Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0697 |
Symbol | |
ID | 6974094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 790930 |
End bp | 792357 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643390226 |
Product | transcriptional regulator, XRE family |
Protein accession | YP_002275102 |
Protein GI | 209542873 |
COG category | [R] General function prediction only |
COG ID | [COG3800] Predicted transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.280462 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.896724 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCCGA CCCGCATCGG TCACATCATT CGCCGGCTTC GGTCCGAGCG GTCCCTGTCG CAGCAGGGGC TGGCGACGCG GCTGGGCATT TCGCCCAGCT ACCTGAACCT GATCGAACAC GATCAGCGCA GCGTCACGGC CTCGCTGCTG ATCAAGCTGA CCCGGGCGCT GGACGTATCC ATCGAAGCCC TGTCCGGCGT GGACGAACAG CGATTGCGGG GATTGCTGCA CGAGGCCCTG TCCGACCCGC TGCTGGGCGC GCACGCGGTG CCGGCGCAGG AGATCGCGGT CCTGGCGGCA CAACCCGAGG CCGCGCGGGC GGTGCTGACC CTGCATCAGG CCTTTCGTGC CGCCCATCAT GACGCGACAC GCCTGATGCT GCCGACCGGC ACGCGGGTCA TCCTGCCGCA GGAAGAAGCA CGGCTGGTCT ATGACGAGCG GCTGAACTTC TTCCCGGAAC TGGAGGCGGC GGCCGAACAG GTGCGCGCCG ACATGGCCCG CGCGGCGCGG CTGCCGGACG GCGAGGCCCT GCCGCCGTCG GAAAGCAACC ACATGATCGC GACCCGGCTG CGCCAGCACC ACGGCATCGT GGTGCGGATC GCGGCCCTGG ACGGCGCGTT CCGCCGTTAC GATCCCGACA GCCGGCTTCT GCAGCTTTCC GATCTGCTGC CGCGTGAAAG CCGGGGGTTC CAGCTGGCCT TCCAACTGAT GCTGATCGAG GGACGCGACG CGATCGAGCA TCTGCTGCAG GATATCGCAC CCAGCACGCA GGAGGCGCGG ATCGTCATCC AGATCGGGCT GGTGAACTAC GCGGCGGCGG CATTGCTGAT GCCCTATTCG CCCTTCCTGG CCGCGGCAAC GGCCCTGCGC CACGATCTGG ACATCCTGTC CGCGCGGTTC GGCGTATCCT ACCAGCAGGC GGCGCAGCGC CTGTCCACAT TGCAGAAACC CGGCGAACGC GGGGTGCCGT TCTTCTTCGT GCGGACCGAT CCGGCGGGGA ACATGACGAA GAGCTTTTCC GCCTGCGGCT TTCCGGTGCC GCGCCAGGGC CATTCCTGCC CGCAATGGAA TGCCAATACC TGCTTTTCCA CCCCCGGCGT GATCCAGGCG CAGGTGGCGC AGTTCACCGA CGGCCGGACC TTTCTGTGTT TCGCGCGGAC CGTGACGGGC ATTTCCACCG GCTGGAACGA CGTGCAGCCC GTTCATGCCG TGGCCATGGG CTGCGACATC ACCCGCGCTG CCGAAATCGT GTATTCCGAT CGCCTGAATT TACAGGCGCC GGCGATTCCT GTCGGTATAT CCTGCCATTT ATGTGATTGG ACCGAATGTC GTTCGCGGGC CTTTCCTCCC CTGCATCACC GGTTGGCGCC CGACGTCAAC CAGCGGGATT CGCTGCCGTT CACATTCGCA CCCGAACCGC CGGGGTGA
|
Protein sequence | MAPTRIGHII RRLRSERSLS QQGLATRLGI SPSYLNLIEH DQRSVTASLL IKLTRALDVS IEALSGVDEQ RLRGLLHEAL SDPLLGAHAV PAQEIAVLAA QPEAARAVLT LHQAFRAAHH DATRLMLPTG TRVILPQEEA RLVYDERLNF FPELEAAAEQ VRADMARAAR LPDGEALPPS ESNHMIATRL RQHHGIVVRI AALDGAFRRY DPDSRLLQLS DLLPRESRGF QLAFQLMLIE GRDAIEHLLQ DIAPSTQEAR IVIQIGLVNY AAAALLMPYS PFLAAATALR HDLDILSARF GVSYQQAAQR LSTLQKPGER GVPFFFVRTD PAGNMTKSFS ACGFPVPRQG HSCPQWNANT CFSTPGVIQA QVAQFTDGRT FLCFARTVTG ISTGWNDVQP VHAVAMGCDI TRAAEIVYSD RLNLQAPAIP VGISCHLCDW TECRSRAFPP LHHRLAPDVN QRDSLPFTFA PEPPG
|
| |