Gene Gdia_1968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1968 
Symbolrho 
ID6975394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2183180 
End bp2184487 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content64% 
IMG OID643391497 
Producttranscription termination factor Rho 
Protein accessionYP_002276343 
Protein GI209544114 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.6615 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCTCG CCGAACTCAA GGCCAAAACC CCCGCGGACC TTCTGGCGTA CGCCGAAAGC 
CTGCAGATCG AGAACGCGTC GTCCCTGCGC AAGCAGGACA TGATGTTCGC CATCCTCAAG
ACCCTCGCCG ACAATGATCA GGCGATCCAT GGCGAGGGAA CGCTGGAAAT CCTGCCCGAC
GGGTTCGGCT TCCTGCGCTC GCCCGAGGCC AATTACCTGC CCGGGCCGGA CGATATCTAT
ATCTCGCCCA GCCAGGTGCG GCGCTTCGGC CTGCGCACCG GCGACACGGT CGAGGGCCAG
ATCCGCGCCC CGCGCGACGG CGAGCGCTAT TTCGCGCTGC TGAAGGTCAA CACGATCAAT
TTCGAGCCGC CCGAGGCCGT CCGTCACCGG ATCAATTTCG ACAACCTGAC GCCCCTCTAC
CCCGAGCGCC GGCTGAAGAT GGAAGTCGAG GCCAACGCCG TGGGCGGCGA GCCCGAGAAG
GTCGAGAAGG GCAAGGGCGC CAAGTCCCAG CCCAAGGATT TCACCCCCCG CGTGATCGAC
CTGGTCTCGC CGATCGGCAT GGGGCAGCGC GCGCTGATCG TCGCCCCGCC GCGCACCGGC
AAGACCGTGA TGCTGCAGAG CATCGCCTCG TCGATTTCCG CCAACCACCC CGAAGTCTTC
CTGATCGTCC TGCTGATCGA CGAGCGCCCG GAAGAAGTCA CCGACATGGC CCGCTCCGTC
CGGGGCGAGG TCGTGTCCTC GACCTTCGAC GAGCCGGCGA CCCGCCACGT CCAGGTGACG
GAAATGGTGC TGGAGAAGGC CAAGCGGCTG GTCGAGCACA AGCGCGACGT CGTCATCCTG
CTGGATTCCA TCACCCGCCT GGCCCGCGCC TACAACACCG TGGTGCCGTC ATCGGGCAAG
GTGCTGACCG GCGGCGTGGA TGCCAATGCC CTGCAGCGCC CCAAGCGCTT CTTCGGCGCC
GCGCGCAATA TCGAGGAAGG CGGATCGCTG ACCATCATCG CCACCGCGCT GATCGATACC
GGCAGCCGCA TGGACGAGGT GATTTTCGAG GAATTCAAGG GCACCGGCAA CTCGGAACTC
ATCCTCGACC GCAAGCTGGC CGACAAGCGC ACCTTCCCGG CGATCGACAT CACCAAGAGC
GGCACCCGCA AGGAAGAATT GCTGGTCGAA CGCTCCGAAC TGTCGAAGAT GTGGGTCCTG
CGCCGCATCC TGGCGCCGAT GGGCACGATG GACGCGATGG ACTTTCTGCT GGACAAGCTG
AAATACAGCA AGACCAACCG GGATTTCTTC GACGCGATGA ATACCTGA
 
Protein sequence
MHLAELKAKT PADLLAYAES LQIENASSLR KQDMMFAILK TLADNDQAIH GEGTLEILPD 
GFGFLRSPEA NYLPGPDDIY ISPSQVRRFG LRTGDTVEGQ IRAPRDGERY FALLKVNTIN
FEPPEAVRHR INFDNLTPLY PERRLKMEVE ANAVGGEPEK VEKGKGAKSQ PKDFTPRVID
LVSPIGMGQR ALIVAPPRTG KTVMLQSIAS SISANHPEVF LIVLLIDERP EEVTDMARSV
RGEVVSSTFD EPATRHVQVT EMVLEKAKRL VEHKRDVVIL LDSITRLARA YNTVVPSSGK
VLTGGVDANA LQRPKRFFGA ARNIEEGGSL TIIATALIDT GSRMDEVIFE EFKGTGNSEL
ILDRKLADKR TFPAIDITKS GTRKEELLVE RSELSKMWVL RRILAPMGTM DAMDFLLDKL
KYSKTNRDFF DAMNT