Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0771 |
Symbol | |
ID | 6974168 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 877600 |
End bp | 878511 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643390300 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002275176 |
Protein GI | 209542947 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.594317 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCAAT CGATCGAGGC ATTGCGCCAC CATATTGACA GGCATTGCAA GGCGGGTCGC GTCGAAACCG TGGTGCCCGG CCTGTCGCTG ATGCGCGCCG ACGCGCCGAC GCTGCCGGTC AGTTGCGTGT ACCAGCCCAC GCTTTGCCTG ATCGTGCAGG GCAGCAAGCA GGTGGTGTTG GGCGATCGGA TCTTCGCCTA TGACGCGCGG AACTACCTGA TCGCCACGGT GGACCTGCCG GTGACGGGCG GCGTCACGCA GGCGACGCCC GATTATCCCT ATCTGGCGCT GAGCCTGGCG CTGGACCCGT CGCGGATTGC CGCCCTGCTG CTGGACGTGC CCGCCGTGCT GGCCGAAACC AGGCCGGCGG CGGGCCTGGC CGTCAGCACG GTGACCGACA CGCTGCTCGA CCCCGTGGCG CGGCTGGTCG GGTTGCTGGA CCGGCCGGAG GACATCCCGG TGCTGGCGCC CCTGTTCGAG CAGGAGATCC TCTACCGGCT GCTGCAGGGC GACCAGGGCG GGATACTGCG ACAGGTTGCC CGCGCCGACA GCTATCTGTC CCATGTCCGC CGGGCGGTCG CCTGGATACG CGATCATTAC GCCGAGCCGT TCAGTATCGG TGACCTGGCG GCCCAAACGG GCATGAGTGC TTCGTCCTTC CACCGTCATT TCAAGGCGGT GACGATGATG AGCCCCCTGC AATACCGCAC GCGCATCCGC CTGCAGGAGG CACGGCGCAT GTTGCTGGCC GACGGGCAGG ACGCCGCCGG CATCGGCTTC GTGGTGGGCT ATGACAGCCC GTCGCAATTC AGCCGGGAAT ATCGCCGGAT GTTCGGCGTT CCACCCGCGC GTGACGCCGC GCGCCTGCGC CGGACGGACG GCGAGGCACG CGGCCTGGCC TATCCGCCCT GA
|
Protein sequence | MRQSIEALRH HIDRHCKAGR VETVVPGLSL MRADAPTLPV SCVYQPTLCL IVQGSKQVVL GDRIFAYDAR NYLIATVDLP VTGGVTQATP DYPYLALSLA LDPSRIAALL LDVPAVLAET RPAAGLAVST VTDTLLDPVA RLVGLLDRPE DIPVLAPLFE QEILYRLLQG DQGGILRQVA RADSYLSHVR RAVAWIRDHY AEPFSIGDLA AQTGMSASSF HRHFKAVTMM SPLQYRTRIR LQEARRMLLA DGQDAAGIGF VVGYDSPSQF SREYRRMFGV PPARDAARLR RTDGEARGLA YPP
|
| |