Gene EcSMS35_1002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1002 
Symbolwzc 
ID6147421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1019915 
End bp1022077 
Gene Length2163 bp 
Protein Length720 aa 
Translation table11 
GC content52% 
IMG OID641615889 
Producttyrosine kinase 
Protein accessionYP_001743081 
Protein GI170681107 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGAAA AAGTAAAACA ACATGCCGCT CCGGTAACGG GCAGTGATGA AATCGATATT 
GGTCGCCTGG TCGGCACCGT CATTGAAGCG CGCTGGTGGG TAATTGGCAT CACCGCCGTA
TTTGCCCTCT GTGCCGTGGT TTACACCTTC TTCGCCACGC CGATTTATAG TGCCGACGCA
CTGGTACAAA TCGAGCAAAA CAGCGGCAAT TCGTTAGTGC AGGACATTGG TTCGGCATTA
GCCAACAAAC CGCCAGCATC GGACGCCGAG ATCCAGTTGA TTCGTTCGCG TCTGGTGCTT
GGTAAAACGG TGGACGATCT CGACCTCGAT ATTGCAGTGA GCAAAAACAC GTTCCCTATT
TTCGGTGCGG GCTGGGATCG CCTGATGGGA CGCCAGAACG AGACGGTAAA AGTGACCACC
TTTAACCGCC CGAAAGAGAT GGCGGATCAG GTGTTTACGC TTAATGTGCT GGACGATAAA
AACTACACTC TGAGCAGCGA TGGCGGCTTT AGCGCCCGTG GGCAAGCGGG CCAGATGCTG
AAAAAAGAAG GCGTCACGCT GATGGTTGAA GCCATTCACT CCAGCCCGGG CAGTGAGTTT
ACCGTCACCA AATACTCCAC GCTGGGGATG ATCAACCAAC TGCAAAACAG CCTGACGGTA
ACGGAGAACG GCAAAGACGC AGGCGTACTG AGCCTGACTT ATACCGGTGA AGATCGCGAA
CAGATCCGCG ACATTCTTAA CAGCATCGCC CGTAACTATC AGGAACAAAA TATTGAGCGC
AAATCCGCGG AAGCGTCGAA AAGCCTCGCC TTCCTCGCGC AACAGTTACC GGAAGTACGT
AGCCGCCTTG ATGTTGCCGA AAATAAATTG AATGCCTTCC GTCAGGATAA AGATTCTGTT
GATCTGCCGC TGGAAGCAAA AGCGGTGCTC GATTCGATGG TGAACATCGA CGCACAATTG
AACGAACTGA CCTTTAAAGA AGCGGAAATC TCCAAGCTGT ACACCAAAGT TCACCCGGCG
TATCGCACGC TGCTGGAGAA ACGGCAGGCG CTGGAAGACG AAAAAGCCAA ACTGAACGGT
CGCGTAACGG CGATGCCGAA AACCCAGCAG GAGATTGTCC GTCTGACCCG CGATGTCGAG
TCTGGGCAGC AGGTCTATAT GCAGCTGCTG AATAAAGAGC AGGAGCTGAA AATCACCGAA
GCCAGCACCG TCGGCGATGT GCGCATTGTT GACCCGGCAA TCACCCAGCC AGGTGTGCTA
AAACCGAAGA AAGGGCTGAT TATCCTTGGG GCGATTATCC TTGGCCTGAT GCTCTCTATC
GTGGCTGTGC TGCTGCGCTC GTTGTTTAAT CGCGGCATTG AAAGCCCGCA AGTGCTGGAA
GAACATGGCA TTAGCGTCTA TGCCAGCATC CCGCTGTCGG AGTGGCAGAA AGCGCGCGAT
AGCGTCAAAA CCATCAAAGG GATTAAACGC TATAAACAGA GCCAGCTACT GGCGGTGGGG
AATCCAACCG ATCTGGCGAT TGAAGCCATC CGTAGTCTGC GTACCAGTCT GCACTTCGCG
ATGATGCAGG CGCAGAACAA TGTGTTGATG ATGACCGGGG TTAGCCCGTC AATCGGTAAA
ACCTTTGTCT GCGCCAACCT GGCGGCGGTG ATTAGCCAGA CCAATAAACG GGTGTTGTTG
ATCGACTGCG ATATGCGCAA AGGCTACACC CACGAGCTGT TGGGCACTAA TAACGTTAAT
GGCCTGTCGG AAATTCTGAT TGGACAGGGC GATATTACTA CTGCTGCTAA ACCGACCTCT
ATTGCCAAAT TTGACCTGAT CCCGCGCGGT CAGGTGCCGC CAAATCCTTC TGAACTGTTA
ATGAGCGAAC GCTTTGCCGA ACTGGTGAAG TGGGCGAGTA AAAACTACGA CCTGGTGTTG
ATTGATACGC CGCCGATTCT GGCAGTGACC GATGCGGCAA TTGTTGGTCG TCATGTCGGA
ACCACGTTAA TGGTGGCGCG TTATGCGGTC AACACATTGA AAGAAGTGGA AACCAGTCTG
AGCCGCTTTG AGCAAAATGG TATTCCGGTG AAAGGGGTGA TTCTGAACTC CATCTTCCGC
CGCGCCAGCG CGTATCAGGA TTATGGCTAT TACGAATACG AATATAAGTC GGATGCGAAA
TAA
 
Protein sequence
MTEKVKQHAA PVTGSDEIDI GRLVGTVIEA RWWVIGITAV FALCAVVYTF FATPIYSADA 
LVQIEQNSGN SLVQDIGSAL ANKPPASDAE IQLIRSRLVL GKTVDDLDLD IAVSKNTFPI
FGAGWDRLMG RQNETVKVTT FNRPKEMADQ VFTLNVLDDK NYTLSSDGGF SARGQAGQML
KKEGVTLMVE AIHSSPGSEF TVTKYSTLGM INQLQNSLTV TENGKDAGVL SLTYTGEDRE
QIRDILNSIA RNYQEQNIER KSAEASKSLA FLAQQLPEVR SRLDVAENKL NAFRQDKDSV
DLPLEAKAVL DSMVNIDAQL NELTFKEAEI SKLYTKVHPA YRTLLEKRQA LEDEKAKLNG
RVTAMPKTQQ EIVRLTRDVE SGQQVYMQLL NKEQELKITE ASTVGDVRIV DPAITQPGVL
KPKKGLIILG AIILGLMLSI VAVLLRSLFN RGIESPQVLE EHGISVYASI PLSEWQKARD
SVKTIKGIKR YKQSQLLAVG NPTDLAIEAI RSLRTSLHFA MMQAQNNVLM MTGVSPSIGK
TFVCANLAAV ISQTNKRVLL IDCDMRKGYT HELLGTNNVN GLSEILIGQG DITTAAKPTS
IAKFDLIPRG QVPPNPSELL MSERFAELVK WASKNYDLVL IDTPPILAVT DAAIVGRHVG
TTLMVARYAV NTLKEVETSL SRFEQNGIPV KGVILNSIFR RASAYQDYGY YEYEYKSDAK