Gene EcSMS35_1209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1209 
Symbol 
ID6147283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1214150 
End bp1215190 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content42% 
IMG OID641616087 
Producthypothetical protein 
Protein accessionYP_001743270 
Protein GI170682042 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.852584 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.278976 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACAT CGACAGTAAT TCCTGAAGAC ATCAAAACGC TAAAATCCGA CGTTAGCAAA 
TTAAAAAACG ATCAAGGAAG CTACGCAACA AAATCATATG TAGACAATAA AACAACATGG
AATAGTTATT GCAATGTAAT CTATGATCAA AAGACATTGC CGACCACTGG AACTATATTT
AGCGGTAAGA TTCACTTGTC AAATAAGACA GGAGAAACCG AAAACGCTTA TAGTGAGATG
TACACTAGAA AAAATATTGA CGGTACAAAA GATACAATGA CAAGGATTGT CACACACAAT
GGAACAAAAG GTATCTTTTG GGATTTTAGC GATCTTTACG GTGGAACATT AATTTTTCCA
GGCAGTGATG GTTACCTTAA GATGGGGAAC TGTCTCATGT CGTATGGTGT GCGGGGAAGT
AACGCGCTTA TTAAGTTTGA CTGCACAGAC ACATTACAAA TCAAATATGC CAATCATGGG
TCAACCATGA CAATCAACAC ACAGGGAACC GCTCATTCTG GCGCTACTAC TAGTTTGTGG
GGTAACTCTA CCCGTCCGGT TGTATATGAA GTTGGTGCTG ATGGTGGCGC TTATATGTTC
TATGCGCAGA AAAATACCGA TAACACCTAT ATGTTAAGCG TTAATGGTGC ATGTCATGCC
ACCGCATTTA ACCAGCATTC CGACCGGGAT CTGAAAGACA ACATTCAGGT GATCGATAAT
GCAACCGACC GCATCCGTAA AATGAACGGC TATACATACA CGCTTAAAGA AAACGGTATG
CCCTATGCTG GTGTCATTGC ACAGGAAGCT CTGGAAGCAA TCCCAGAAGT TGTAGGTTCC
GCAATGAAAT ATCAGGACGG TGCAAGCGGA TCGGAAGGTG AAGAAGGTGA ACGTTATTAC
ACAGTAGATT ATTCTGGTGT TACTGGCTTG CTTGTTCAGG TAGCCAGAGA GTCAGACGAC
AGAATAACAG CACTGGAAGA AGAAAACGCA GAATTAAGAC AAAGATTATC TGCAATTGAG
GCGGCGCTTG CGTCTAAATA A
 
Protein sequence
MATSTVIPED IKTLKSDVSK LKNDQGSYAT KSYVDNKTTW NSYCNVIYDQ KTLPTTGTIF 
SGKIHLSNKT GETENAYSEM YTRKNIDGTK DTMTRIVTHN GTKGIFWDFS DLYGGTLIFP
GSDGYLKMGN CLMSYGVRGS NALIKFDCTD TLQIKYANHG STMTINTQGT AHSGATTSLW
GNSTRPVVYE VGADGGAYMF YAQKNTDNTY MLSVNGACHA TAFNQHSDRD LKDNIQVIDN
ATDRIRKMNG YTYTLKENGM PYAGVIAQEA LEAIPEVVGS AMKYQDGASG SEGEEGERYY
TVDYSGVTGL LVQVARESDD RITALEEENA ELRQRLSAIE AALASK