Gene EcSMS35_2621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2621 
Symbol 
ID6143209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2678124 
End bp2680139 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content55% 
IMG OID641617492 
Producthypothetical protein 
Protein accessionYP_001744657 
Protein GI170683380 
COG category[R] General function prediction only 
COG ID[COG1444] Predicted P-loop ATPase fused to an acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAAC TGACTGCGCT TCACACATTA ACAGCGCAAA TGAAACGTGA AGGGATCCGC 
CGCTTGCTGG TGTTGAGCGG GGAAGAGGGT TGGTGTTTTG ATCATGCGCT TAAGTTGCGT
GATGCCTTAC CTGGCGACTG GCTGTGGATT TCGCCGCAGC CAGATGCTGA AAACCACTGT
TCTCCCTCGG CACTACAAAC TTTACTTGGG CGCGAGTTCC GGCATGCGGT ATTCGACGCC
CGCCACGGCT TTGATGCCGC TGCCTTTGCG GCACTTAGCG GAACGTTGAA AGCGGGAAGC
TGGCTAGTGT TGTTACTCCC TGTATGGGAT GAGTGGGAAA ACCAACCTGA TGCCGACTCG
CTGCGCTGGA GTGATTGCCC TGACCCTATT GCGACGCCGC ATTTTGTCCA GCATTTCAAA
CGCGTACTTA CGGCGAATAA CGACGCTATC CTCTGGCGGC AAAACCAGCC GTTCTCGTTG
GCGCATTTTA CTCCCCGTAC TGACTGGCAC CCCGCGACCG GCGCACCACA GCCAGAACAA
CAGCAACTCT TACAGCAGCT ACTGACCATG CCATTGGGCG TGGCGGTGGT AACGGCTGCG
CGTGGGCGCG GTAAATCGGC GCTGGCAGGG CAACTCATTT CTCGTATTGC GGGTAGTGCG
ATTGTCACCG CGCCCGCAAA AGCGGCAACG TATGTACTGG CACAATTTGC GGGCGAGAAG
TTTCGCTTTA TTGCACCGGA TGCCTTGTTA GCCAGCGATG AGCAAGCCGA CTGGCTGGTG
GTCGATGAAG CCGCAGCCAT ACCTGCGCCG TTGTTGCATC AACTGGTATC GCGTTTTCCT
CGAACGTTGT TAACCACTAC GGTGCAGGGC TACGAAGGCA CCGGACGTGG TTTTTTGCTG
AAATTTTGCG CTCGCTTTCC GCATTTACAC CGTTTTGAAC TGCAACAACC GATCCGCTGG
GCGCAGGGAT GCCCGCTGGA GAAAATGGTC AGCGAGGCAC TGGTGTTTGA CGATGAAAAC
TTCACTCACG AACCACAAGG TGACATCGTC ATTTCTGCTT TTGAACAGAC GTTATGGCGA
AGCGAACCAG AAACGCCGTT AAAGGTATAT CAGCTATTGT CTGGCGCGCA CTACCGGACC
TCGCCGCTGG ATTTACGCCG GATGATGGAT GCACCAGGGC AACATTTTTT ACAGGCGGCT
GGCGAAAACG AGATTGCCGG AGCGCTGTGG CTGGTGGATG AGGGGGGATT ATCTCAACAA
CTCAGTCAGG CGGTATGGGC AGGTTTTCGT CGCCCTCGGG GTAATCTGGT GGCCCAGTCG
CTGGCGGCGC ACGGCAGCAA TCCACTGGCA GCGACATTGC GTGGACGGCG GGTAAGCCGG
ATAGCAGTTC ATCCGGCGCG TCAGCGGGAA GGCACAGGGC GGCAACTTAT TGCTGGTGCT
TTGCAATATA CGCATGACCT CGACTATCTT TCGGTGAGTT TTGGTTACAC CGGGGAGTTA
TGGCGTTTCT GGCAACGCTG CGGTTTTGTG CTGGTGCGAA TGGGTAATCA TCGTGAAGCC
AGCAGCGGTT GCTATACGGC GATGGCACTG TTACCGATGA GTGATGCGGG TAAACAGCTG
GCTGAACGTG AGCATTACCG TTTACGTCGC GATGCGCAAG CTCTCGCGCA GTGGAATGGC
GAAATGCTTC CCGTTGATCC ACTAAACGAT GCCGTCCTTT CTGACGACGA CTGGCTTGAA
CTGGCCGGTT TTGCTTTCGC TCATCGTCCG CTATTAACGT CGTTAGGTTG CTTAATGCGT
CTGTTACAAA CCAGCGAAAT GGCATTACCG GCGCTGCGTG GGCGTTTACA GAAAAACGCC
AGTGACGCGC AGTTATGTAC CACACTTAAA CTTTCAGGCC GTAAGCTGTT ACTGGTCCGT
CAGCGGGAAG AGGCCGCGCA GGCGCTATAC GCACTTGATG ATGTTCGCAC TGAGCGTTTG
CGCGATCGCA TAACGCAATG GCAATTTTTT CACTGA
 
Protein sequence
MAELTALHTL TAQMKREGIR RLLVLSGEEG WCFDHALKLR DALPGDWLWI SPQPDAENHC 
SPSALQTLLG REFRHAVFDA RHGFDAAAFA ALSGTLKAGS WLVLLLPVWD EWENQPDADS
LRWSDCPDPI ATPHFVQHFK RVLTANNDAI LWRQNQPFSL AHFTPRTDWH PATGAPQPEQ
QQLLQQLLTM PLGVAVVTAA RGRGKSALAG QLISRIAGSA IVTAPAKAAT YVLAQFAGEK
FRFIAPDALL ASDEQADWLV VDEAAAIPAP LLHQLVSRFP RTLLTTTVQG YEGTGRGFLL
KFCARFPHLH RFELQQPIRW AQGCPLEKMV SEALVFDDEN FTHEPQGDIV ISAFEQTLWR
SEPETPLKVY QLLSGAHYRT SPLDLRRMMD APGQHFLQAA GENEIAGALW LVDEGGLSQQ
LSQAVWAGFR RPRGNLVAQS LAAHGSNPLA ATLRGRRVSR IAVHPARQRE GTGRQLIAGA
LQYTHDLDYL SVSFGYTGEL WRFWQRCGFV LVRMGNHREA SSGCYTAMAL LPMSDAGKQL
AEREHYRLRR DAQALAQWNG EMLPVDPLND AVLSDDDWLE LAGFAFAHRP LLTSLGCLMR
LLQTSEMALP ALRGRLQKNA SDAQLCTTLK LSGRKLLLVR QREEAAQALY ALDDVRTERL
RDRITQWQFF H