Gene EcSMS35_4146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4146 
Symbolrho 
ID6144642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4246064 
End bp4247323 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content51% 
IMG OID641618969 
Producttranscription termination factor Rho 
Protein accessionYP_001746101 
Protein GI170683673 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTTA CCGAATTAAA GAATACGCCG GTTTCTGAGC TGATCACTCT CGGCGAAAAT 
ATGGGGCTGG AAAACCTGGC TCGTATGCGT AAGCAGGACA TTATTTTTGC CATCCTGAAG
CAGCACGCAA AGAGTGGCGA AGATATCTTT GGTGATGGCG TACTGGAGAT ATTGCAGGAT
GGATTTGGTT TCCTCCGTTC CGCAGACAGC TCCTACCTCG CCGGTCCTGA TGACATCTAC
GTTTCCCCTA GCCAAATCCG CCGTTTCAAC CTCCGCACTG GTGATACCAT CTCTGGTAAG
ATTCGCCCGC CGAAAGAAGG TGAACGCTAT TTTGCGCTGC TGAAAGTTAA CGAAGTTAAC
TTCGACAAAC CTGAAAACGC CCGCAACAAA ATCCTCTTTG AGAACTTAAC CCCGCTGCAC
GCAAACTCTC GTCTGCGTAT GGAACGTGGT AACGGTTCTA CGGAAGATTT AACCGCTCGC
GTACTGGATC TGGCATCACC TATCGGTCGT GGTCAGCGTG GTCTGATTGT GGCACCGCCG
AAAGCCGGTA AAACCATGCT GCTGCAGAAC ATTGCTCAGA GCATTGCTTA CAACCACCCG
GATTGCGTAC TGATGGTTCT GCTGATCGAC GAACGTCCGG AAGAAGTCAC CGAGATGCAG
CGTCTGGTTA AAGGTGAAGT TGTTGCTTCT ACCTTTGACG AACCCGCATC TCGCCACGTT
CAGGTTGCGG AAATGGTGAT CGAGAAGGCC AAACGCCTGG TTGAGCACAA GAAAGACGTT
ATCATTCTGC TCGACTCCAT CACTCGTCTG GCGCGCGCTT ACAACACCGT TGTTCCGGCG
TCAGGTAAAG TGTTGACCGG TGGTGTGGAT GCCAATGCTC TGCATCGTCC GAAACGCTTC
TTTGGTGCGG CGCGTAACGT GGAAGAGGGC GGCAGCCTGA CCATCATCGC TACCGCGCTT
ATCGATACCG GTTCCAAGAT GGACGAAGTT ATCTACGAAG AGTTTAAAGG TACAGGCAAC
ATGGAACTGC ACCTCTCTCG TAAGATCGCT GAAAAACGCG TCTTCCCGGC TATCGACTAC
AACCGTTCCG GTACCCGTAA AGAAGAGCTG CTCACGACTC AGGAAGAACT GCAGAAAATG
TGGATCCTGC GCAAAATCAT TCACCCGATG GGCGAAATCG ATGCAATGGA ATTCCTCATT
AATAAACTGG CAATGACCAA GACCAATGAC GATTTCTTCG AAATGATGAA ACGCTCATAA
 
Protein sequence
MNLTELKNTP VSELITLGEN MGLENLARMR KQDIIFAILK QHAKSGEDIF GDGVLEILQD 
GFGFLRSADS SYLAGPDDIY VSPSQIRRFN LRTGDTISGK IRPPKEGERY FALLKVNEVN
FDKPENARNK ILFENLTPLH ANSRLRMERG NGSTEDLTAR VLDLASPIGR GQRGLIVAPP
KAGKTMLLQN IAQSIAYNHP DCVLMVLLID ERPEEVTEMQ RLVKGEVVAS TFDEPASRHV
QVAEMVIEKA KRLVEHKKDV IILLDSITRL ARAYNTVVPA SGKVLTGGVD ANALHRPKRF
FGAARNVEEG GSLTIIATAL IDTGSKMDEV IYEEFKGTGN MELHLSRKIA EKRVFPAIDY
NRSGTRKEEL LTTQEELQKM WILRKIIHPM GEIDAMEFLI NKLAMTKTND DFFEMMKRS