Gene EcSMS35_2454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2454 
Symbol 
ID6145162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2503935 
End bp2505476 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content52% 
IMG OID641617326 
Producthypothetical protein 
Protein accessionYP_001744498 
Protein GI170681615 
COG category[S] Function unknown 
COG ID[COG1288] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000219916 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAAGGGA ATATTTGCGC TATGTCCGCA ATCACTGAAT CTAAACCAAC AAGAAGATGG 
GCAATGCCCG ATACGTTGGT GATTATCTTT TTTGTTGCCA TTTTAACCAG CCTTGCCACC
TGGGTAGTTC CGGTGGGCAT GTTTGACAGT CAGGAAGTGC AGTATCAGGT TGATGGTCAA
ACAAAAACAC GCAAAGTCGT AGATCCACAC TCATTCCGCA TTCTGACTAA CGAAGCAGGC
GAACCTGAGT ATCACCGCGT ACAACTGTTC ACGACGGGCG ATGAACGCCC GGGTCTGATG
AACTTCCCGT TTGAAGGGTT AACCTCAGGA TCGAAATACG GGACAGCCGT TGGCATCATC
ATGTTTATGC TGGTGATTGG CGGCGCGTTT GGCATTGTGA TGCGTACAGG AACCATTGAT
AACGGTATCC TGGCGCTTAT TCGCCATACT CGCGGGAATG AAATTCTCTT TATTCCTGCG
CTGTTTATTC TGTTTTCACT TGGCGGTGCG GTATTTGGTA TGGGGGAAGA GGCCGTCGCC
TTTGCCATTA TCATCGCACC GCTAATGGTC CGGCTGGGCT ATGACAGTAT TACCACCGTC
CTGGTGACTT ATATTGCCAC GCAAATCGGT TTTGCCAGTT CGTGGATGAA CCCGTTTTGT
GTGGTCGTTG CTCAGGGGAT TGCCGGCGTT CCGGTGCTTT CTGGCTCCGG GTTGCGCATC
GTGGTATGGG TTATCGCCAC TCTGATTGGC CTGATCTTTA CCATGGTGTA CGCCTCACGA
GTGAAAAAGA ATCCTCTTCT GTCACGCGTG CATGAGTCCG ACCGCTTCTT TCGTGAAAAG
CAGGCGGATG TTGAACAACG TCCGTTTACC TTTGGTGACT GGCTGGTATT GATTGTCCTG
ACCGCCGTAA TGGTCTGGGT GATTTGGGGC GTGATCGTTA ATGCCTGGTT TATTCCAGAA
ATTGCCAGCC AGTTCTTCAC CATGGGTCTG GTGATTGGCA TCATCGGCGT CGTTTTCCGC
CTTAACGGCA TGACGGTTAA TACCATGGCT TCATCCTTTA CCGAAGGGGC GCGAATGATG
ATCGCCCCTG CCCTGCTGGT GGGTTTCGCC AAAGGGATTT TGCTGCTGGT CGGTAATGGT
GAAGCGGGTG ATGCCAGCGT GTTAAATACC ATCCTCAACA GCATTGCCAA TGCCATTAGC
GGTCTGGATA ACGCGGTCGC GGCCTGGTTT ATGTTGCTCT TCCAGGCAGT ATTTAATTTC
TTCGTGACGT CCGGTTCTGG TCAGGCGGCG TTAACCATGC CGTTACTGGC ACCGCTTGGC
GATCTGGTCG GTGTTAACCG TCAGGTTACC GTGCTGGCGT TCCAGTTTGG TGATGGCTTC
AGCCACATCA TTTACCCGAC CTCAGCTTCG TTAATGGCAA CGCTCGGTGT TTGCCGGGTG
GACTTCCGTA ACTGGCTGAA GGTGGGCGCG ACACTGCTTG GACTGCTGTT TATTATGTCC
AGCGTCGTGG TGATCGGCGC TCAGTTGATG GGCTACCACT AA
 
Protein sequence
MQGNICAMSA ITESKPTRRW AMPDTLVIIF FVAILTSLAT WVVPVGMFDS QEVQYQVDGQ 
TKTRKVVDPH SFRILTNEAG EPEYHRVQLF TTGDERPGLM NFPFEGLTSG SKYGTAVGII
MFMLVIGGAF GIVMRTGTID NGILALIRHT RGNEILFIPA LFILFSLGGA VFGMGEEAVA
FAIIIAPLMV RLGYDSITTV LVTYIATQIG FASSWMNPFC VVVAQGIAGV PVLSGSGLRI
VVWVIATLIG LIFTMVYASR VKKNPLLSRV HESDRFFREK QADVEQRPFT FGDWLVLIVL
TAVMVWVIWG VIVNAWFIPE IASQFFTMGL VIGIIGVVFR LNGMTVNTMA SSFTEGARMM
IAPALLVGFA KGILLLVGNG EAGDASVLNT ILNSIANAIS GLDNAVAAWF MLLFQAVFNF
FVTSGSGQAA LTMPLLAPLG DLVGVNRQVT VLAFQFGDGF SHIIYPTSAS LMATLGVCRV
DFRNWLKVGA TLLGLLFIMS SVVVIGAQLM GYH