Gene EcolC_4220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4220 
Symbolrho 
ID6067806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4662308 
End bp4663567 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content50% 
IMG OID641603652 
Producttranscription termination factor Rho 
Protein accessionYP_001727144 
Protein GI170022190 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00203599 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTTA CCGAATTAAA GAATACGCCG GTTTCTGAGC TGATCACTCT CGGCGAAAAT 
ATGGGGCTGG AAAACCTGGC TCGTATGCGT AAGCAGGACA TTATTTTTGC CATCCTGAAG
CAGCACGCAA AGAGTGGCGA AGATATCTTT GGTGATGGCG TACTGGAGAT ATTGCAGGAT
GGATTTGGTT TCCTCCGTTC CGCAGACAGC TCCTACCTCG CCGGTCCTGA TGACATCTAC
GTTTCCCCTA GCCAAATCCG CCGTTTCAAC CTCCGCACTG GTGATACCAT CTCTGGTAAG
ATTCGCCCGC CGAAAGAAGG TGAACGCTAT TTTGCGCTGC TGAAAGTTAA CGAAGTTAAC
TTCGACAAAC CTGAAAACGC CCGCAACAAA ATCCTCTTTG AGAACTTAAC CCCGCTGCAC
GCAAACTCTC GTCTGCGTAT GGAACGTGGT AACGGTTCTA CGGAAGATTT AACCGCTCGC
GTACTGGATC TGGCATCACC TATCGGTCGT GGTCAGCGTG GTCTGATTGT GGCACCGCCG
AAAGCCGGTA AAACTATGCT GCTGCAGAAC ATTGCTCAGA GCATTGCTTA CAACCACCCG
GATTGTGTGC TGATGGTTCT GCTGATCGAC GAACGTCCGG AAGAAGTAAC CGAGATGCAG
CGTCTGGTAA AAGGTGAAGT TGTTGCTTCT ACCTTTGACG AACCCGCATC TCGCCACGTT
CAGGTTGCGG AAATGGTGAT CGAGAAGGCC AAACGCCTGG TTGAGCACAA GAAAGACGTT
ATCATTCTGC TCGACTCCAT CACTCGTCTG GCGCGCGCTT ACAACACTGT TGTTCCGGCG
TCAGGTAAAG TGTTGACCGG TGGTGTGGAC GCCAATGCCC TTCATCGTCC GAAACGCTTC
TTCGGTGCGG CGCGTAACGT GGAAGAGGGC GGCAGCCTGA CCATTATCGC GACGGCGCTT
ATCGATACCG GTTCTAAAAT GGACGAAGTT ATTTACGAAG AGTTTAAAGG TACAGGCAAC
ATGGAACTGC ACCTCTCTCG TAAGATCGCT GAAAAACGCG TCTTCCCGGC TATCGACTAC
AACCGTTCCG GTACCCGTAA AGAAGAGCTG CTCACGACTC AGGAAGAACT GCAGAAAATG
TGGATCCTGC GCAAAATCAT TCACCCGATG GGCGAAATCG ATGCAATGGA ATTCCTCATT
AATAAACTGG CCATGACCAA GACCAATGAC GATTTCTTCG AAATGATGAA ACGCTCATAA
 
Protein sequence
MNLTELKNTP VSELITLGEN MGLENLARMR KQDIIFAILK QHAKSGEDIF GDGVLEILQD 
GFGFLRSADS SYLAGPDDIY VSPSQIRRFN LRTGDTISGK IRPPKEGERY FALLKVNEVN
FDKPENARNK ILFENLTPLH ANSRLRMERG NGSTEDLTAR VLDLASPIGR GQRGLIVAPP
KAGKTMLLQN IAQSIAYNHP DCVLMVLLID ERPEEVTEMQ RLVKGEVVAS TFDEPASRHV
QVAEMVIEKA KRLVEHKKDV IILLDSITRL ARAYNTVVPA SGKVLTGGVD ANALHRPKRF
FGAARNVEEG GSLTIIATAL IDTGSKMDEV IYEEFKGTGN MELHLSRKIA EKRVFPAIDY
NRSGTRKEEL LTTQEELQKM WILRKIIHPM GEIDAMEFLI NKLAMTKTND DFFEMMKRS