Gene SbBS512_E4139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4139 
Symbolrho 
ID6269206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3859999 
End bp3861258 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content50% 
IMG OID641727966 
Producttranscription termination factor Rho 
Protein accessionYP_001882393 
Protein GI187732455 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCTTA CCGAATTAAA GAATACGCCG GTTTCTGAGC TGATCACTCT CGGCGAAAAT 
ATGGGGCTGG AAAACCTGGC TCGTATGCGT AAGCAGGACA TTATTTTTGC CATCCTGAAG
CAGCACGCAA AGAGTGGCGA AGATATCTTT GGTGATGGCG TACTGGAGAT ATTGCAGGAT
GGATTTGGTT TCCTCCGTTC CGCAGACAGC TCCTACCTCG CCGGTCCTGA TGACATCTAC
GTTTCCCCTA GCCAAATCCG CCGTTTCAAC CTCCGCACTG GTGATACCAT CTCTGGTAAG
ATTCGCCCGC CGAAAGAAGG TGAACGCTAT TTTGCGCTGC TGAAAGTTAA CGAAGTTAAC
TTCGACAAAC CTGAAAACGC CCGCAACAAA ATCCTCTTTG AGAACTTAAC CCCGCTGCAC
GCAAACTCTC GTCTGCGTAT GGAACGTGGT AACGGTTCTA CGGAAGATTT AACCGCTCGC
GTACTGGATC TGGCATCACC TATCGGTCGT GGTCAGCGTG GTCTGATTGT GGCACCGCCG
AAAGCCGGTA AAACTATGCT GCTGCAGAAC ATTGCTCAGA GCATTGCTTA CAACCACCCG
GATTGTGTGC TGATGGTTCT GCTGATCGAC GAACGTCCGG AAGAAGTAAC CGAGATGCAG
CGTCTGGTAA AAGGTGAAGT TGTTGCTTCT ACCTTTGACG AACCCGCATC TCGCCACGTT
CAGGTTGCGG AAATGGTGAT CGAGAAGGCC AAACGCCTGG TTGAGCACAA GAAAGACGTT
ATCATTCTGC TCGACTCCAT CACTCGTCTG GCGCGCGCTT ACAACACTGT TGTTCCGGCG
TCAGGTAAAG TGTTGACCGG TGGTGTGGAC GCCAATGCCC TTCATCGTCC GAAACGCTTC
TTCGGTGCGG CGCGTAACGT GGAAGAGGGC GGCAGCCTGA CCATTATCGC GACGGCGCTT
ATCGATACCG GTTCTAAAAT GGACGAAGTT ATTTACGAAG AGTTTAAAGG TACAGGCAAC
ATGGAACTGC ACCTCTCTCG TAAGATCGCT GAAAAACGCG TCTTCCCGGC TATCGACTAC
AACCGTTCCG GTACCCGTAA AGAAGAGCTG CTCACGACTC AGGAAGAACT GCAGAAAATG
TGGATCCTGC GCAAAATCAT TCACCCGATG GGCGAAATCG ATGCAATGGA ATTCCTCATT
AATAAACTGG CCATGACCAA GACCAATGAC GATTTCTTCG AAATGATGAA ACGCTCATAA
 
Protein sequence
MNLTELKNTP VSELITLGEN MGLENLARMR KQDIIFAILK QHAKSGEDIF GDGVLEILQD 
GFGFLRSADS SYLAGPDDIY VSPSQIRRFN LRTGDTISGK IRPPKEGERY FALLKVNEVN
FDKPENARNK ILFENLTPLH ANSRLRMERG NGSTEDLTAR VLDLASPIGR GQRGLIVAPP
KAGKTMLLQN IAQSIAYNHP DCVLMVLLID ERPEEVTEMQ RLVKGEVVAS TFDEPASRHV
QVAEMVIEKA KRLVEHKKDV IILLDSITRL ARAYNTVVPA SGKVLTGGVD ANALHRPKRF
FGAARNVEEG GSLTIIATAL IDTGSKMDEV IYEEFKGTGN MELHLSRKIA EKRVFPAIDY
NRSGTRKEEL LTTQEELQKM WILRKIIHPM GEIDAMEFLI NKLAMTKTND DFFEMMKRS