Gene Gura_4065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_4065 
Symbolrho 
ID5165085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp4722351 
End bp4723598 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content55% 
IMG OID640551544 
Producttranscription termination factor Rho 
Protein accessionYP_001232782 
Protein GI148266076 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000187673 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTTAC AAGAATTAAA GGGTAAAAAA ATCAATGAGC TGACCGCTAT TGCCAAGGGT 
CTGAATATTG AGGGCGCTTC CAGTCTGCGC AAGCAGGACC TGATTTTTGC CATCCTCAAT
GCCCAGACCG AGAAAAACGG CATGATCTTT GGCGAGGGTG TGCTTGAAAC CCTGCCGGAT
GGATTTGGTT TCTTGAGGGC GCCGGATTAC AACTATCTGC CGGGGCCGGA CGACATCTAT
GTCAGCCCAA GCCAGATCCG CCGTTTCAAC CTTCACACAG GCGATACCGT TTCCGGCCAG
ATCAGGCCTC CCAAGGAGGG TGAGCGTTAT TTCGCCCTGC TCAAGGTTGA ATCTGTCAAC
CATGAATCGC CGGATGTGGC GAGGGATAAG ATCCTCTTCG ACAACCTGAC GCCACTTTAT
CCCGAAGAGA AGCTGAAGCT TGAGACAACG CCCGACAATA TGCCGATGCG GGTGGTGGAG
CTGATAGCGC CCATCGGCAA GGGGCAGCGC GGTCTGATCG TCGCACCGCC CCGCACCGGC
AAGACCATGC TGATCCAGAA TATCGCCAAC TCCATTGCCG AAAACCACCC CGAAGTATTC
CTTATCGTCC TTCTCATCGA TGAACGTCCG GAAGAGGTGA CCGACATGCA GCGCTCGGTC
AACGGAGAGG TGATTTCCTC CACCTTCGAT GAGCCCGCCT CGCGTCATAT CCAGGTGGCG
GAGATGGTAA TCGAGAAGGC CAAACGGCTT GTCGAGCACA AGCGGGATGT GGTGATCCTC
CTTGACTCCA TTACCCGTTT GGCCCGTGCC TACAATACGG TCATTCCCCC TTCCGGCAAA
ATCCTTTCCG GCGGCGTCGA CTCCAATGCC CTGCACAAAC CGAAGCGCTT CTTCGGTGCA
GCCCGCAACA TCGAAGAGGG CGGCTCGCTC ACGATCATCG CCACTGCCCT GGTCGATACC
GGCAGCAAGA TGGATGAGGT CATCTTCGAA GAGTTCAAAG GGACCGGCAA CATGGAACTT
CATCTCGACC GCAAGCTGGT CGAGAAGCGT ACCTTCCCGG CTATCGACAT TAACAAGTCC
GGTACCCGCA AGGAGGAACT TCTCATCGAG AAAAGCGCCC TCAACCGGAT CTGGATTCTG
CGCAAGGTGC TCCATCCCAT GAATGTGGTG GACAGTATGG AATTCCTCAT CTCCAAGCTT
GAGGGGACCA AGGGTAACCA GGCGTTCCTT GATTCCATGA GTAAGTGA
 
Protein sequence
MNLQELKGKK INELTAIAKG LNIEGASSLR KQDLIFAILN AQTEKNGMIF GEGVLETLPD 
GFGFLRAPDY NYLPGPDDIY VSPSQIRRFN LHTGDTVSGQ IRPPKEGERY FALLKVESVN
HESPDVARDK ILFDNLTPLY PEEKLKLETT PDNMPMRVVE LIAPIGKGQR GLIVAPPRTG
KTMLIQNIAN SIAENHPEVF LIVLLIDERP EEVTDMQRSV NGEVISSTFD EPASRHIQVA
EMVIEKAKRL VEHKRDVVIL LDSITRLARA YNTVIPPSGK ILSGGVDSNA LHKPKRFFGA
ARNIEEGGSL TIIATALVDT GSKMDEVIFE EFKGTGNMEL HLDRKLVEKR TFPAIDINKS
GTRKEELLIE KSALNRIWIL RKVLHPMNVV DSMEFLISKL EGTKGNQAFL DSMSK