Gene EcSMS35_2689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2689 
SymbolcsiE 
ID6146517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2760769 
End bp2762049 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content52% 
IMG OID641617559 
Productstationary phase inducible protein CsiE 
Protein accessionYP_001744724 
Protein GI170683567 
COG category[K] Transcription 
COG ID[COG3711] Transcriptional antiterminator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCCTA CGCTTGCTCC ACCATCTGTC CTTTCGGCTC CCCAGCGCCG CTGCCAGATC 
TTGCTGACAC TCTTTCAGCC GGGGTTGACC GCCACCACGG CAACTTTCAG CGAGCTTAAT
GGTGTGGATG ATGATATTGC CAGTCTTGAT ATCAGCGAAA CAGGACGGGA GATCCTGCGC
TATCATCAAC TCACACTGAC TACTGGTTAT GACGGTAGCT ACCGGGTTGA AGGTACAGTG
CTTAACCAAC GTTTGTGTTT ATTTCACTGG CTACGACGTG GTTTCCGTCT GTGTCCGTCA
TTTATTACCA GCCATTTCAC CCCCGCCCTG AAGAGTGAAC TGAAGCGGCG CGGAATTGCG
CGTAACTTTT ACGACGATAC CAATCTACAA GCGTTAGTGA ATCTCTGCTC CCGACGGCTG
CAAAAACGCT TTGAAACGCG CGATATTCAT TTCCTGTGTC TGTATCTGCA ATATTGTTTG
CTGCAACACC ACGCTGGAAT TACGCCTCAG TTTAATCCGC TCCAACGTCG CTGGGCGGAA
TCCTGCCTTG AATTTCAGGT AGCGCAGGAA ATTGGACGCC ACTGGCAGCG TCGGGCGCTC
CAGCCTGTAC CGCCTGATGA GCCACTGTTC ATGGCATTAC TTTTTTCCAT GTTGCGGGTT
CCCGATCCAC TGCGGGATGC GCATCAGCGG GACAGACAAC TGCGTCAGTC TATCAAACGT
CTGGTAAACC ATTTTCGTGA GCTGGGAAAT GTTCGTTTTT ATGATGAACA GGGGTTGTGC
GATCAGCTTT ATACCCACCT CGCCCAGGCG TTAAATCGCA GTTTGTTTGC CATCGGTATT
GATAACACCC TGCCGGAAGA GTTCGCCCGC CTGTATCCAC GCCTGGTGCG CACCACCCGC
GCGGCGCTGG CCGGATTTGA AAGTGAATAC GGCGTCCATC TTTCTGATGA GGAAAGCGGT
CTGGTCGCGG TGATTTTCGG TGCCTGGCTA ATGCAGGAAA ACGACCTGCA TGAGAAACAG
ATTATTCTAC TGACCGGGAA TGATAGCGAG CGAGAAGCGC AGATTGAGCA GCAGCTACGC
GAACTAACGT TACTGCCGCT CAACATTAAA CATATGTCGG TAAAGGCATT TTTGCAGACA
GGCGCTCCGC GCGGCGCGGC ACTGATTATT GCTCCTTATA CCATGCCGTT ACCGCTCTTT
TCACCACCGC TGATCTATAC GGACCTGACG TTGACAACAC ATCAACAGGA GCAGATCCGC
AAAATGCTCG AATCAGCATG A
 
Protein sequence
MMPTLAPPSV LSAPQRRCQI LLTLFQPGLT ATTATFSELN GVDDDIASLD ISETGREILR 
YHQLTLTTGY DGSYRVEGTV LNQRLCLFHW LRRGFRLCPS FITSHFTPAL KSELKRRGIA
RNFYDDTNLQ ALVNLCSRRL QKRFETRDIH FLCLYLQYCL LQHHAGITPQ FNPLQRRWAE
SCLEFQVAQE IGRHWQRRAL QPVPPDEPLF MALLFSMLRV PDPLRDAHQR DRQLRQSIKR
LVNHFRELGN VRFYDEQGLC DQLYTHLAQA LNRSLFAIGI DNTLPEEFAR LYPRLVRTTR
AALAGFESEY GVHLSDEESG LVAVIFGAWL MQENDLHEKQ IILLTGNDSE REAQIEQQLR
ELTLLPLNIK HMSVKAFLQT GAPRGAALII APYTMPLPLF SPPLIYTDLT LTTHQQEQIR
KMLESA