Gene EcSMS35_2489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2489 
Symbol 
ID6147095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2536010 
End bp2537368 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content47% 
IMG OID641617361 
ProductSMC domain-containing protein 
Protein accessionYP_001744533 
Protein GI170684247 
COG category[R] General function prediction only 
COG ID[COG3950] Predicted ATP-binding protein involved in virulence 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000889729 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAATA TCCGCACGCT TAAGCTCACT AATCTGGGGC GGTTTGAAGA ACTTGAAGTT 
CATCTGGCTC CAGTGGAGGA GTTCAAGAGC AATGTGACCG TTTTTATTGG TAATAATGGT
GCAGGTAAAA CATCAATATT AAAATCGTTG GCAACCAGCC TGAGTTGGTT CGTTGCCCGA
GTTCGTACTG AAAAAGGTAA CGGTAGCCCT ATTCCTGAAG ACGCTATTCT GAACGGTAGG
AGTTCGGCGA CAATTGAACT TCAGGTACTG AATACGCATC CAGCGACGGA GGCCGCTACG
CCCTACCGTT GGTTGCTTGC CAGAACGGCC AGTGGGAAAA AATCGACCAC CGCCTCCAGC
CTGCAAGAGG CCAGCCAATT GGCTGCGTTT TATCGAGATC AATACACCCA GAATAGCGGG
GCATCCTTCC CGCTTATCGC CTTCTATCCC GTAGAACGTG TCGTGCTGGA TGTGCCGTTG
AAAATAAAAG AACGTCATAA TTTTTTGCAA CTGGATGGCT ACGATAACGC CCTGAATCAG
GGTATTGATT TCCGCCGTTT CTTTGAGTGG TTTCGCAATC GCGAAGATGC AGAAAATGAA
TCGGGCTTAC CCCAAGACGT TCTGGATAAG CTCAGTACCA GGATAGATCT CGATAACACC
GTCTTAAATG CATTAACGGC AATCATGGCC TCGTCCCGGG ATCGCCAGTT GACCGCCGTC
AGAACGGCCA TTAGTCGCTT TATGCCAGGG TTCAGCAACT TACGCGTCAG GCGTAAACCT
CGCCTGCATA TGTCGATTGA TAAAAATGGC CAGACACTGA ATGTGCTGCA ATTATCGCAG
GGTGAAAAAT CACTGATGGC GTTAGTCGGC GATATTGCTC GCCGCCTGGC AATGATGAAC
CCGATGTTAG AAAACCCGCT AAACGGCGAG GGAATTGTAT TAATTGATGA AGTGGACATG
CACCTGCATC CAACATGGCA GCGTACAATC ATCCAGCGTC TGACGACAAC ATTCCCACAT
TGCCAGTTTG TCTTAACAAC CCACTCTCCT TTAGTGATCA GTGATTACAA AGATGTGCTG
GTTTATTCTC TGGATAATGG CGAATTAACG CAGCTCCCGT CTCTGTATGG GCAAGATGCG
AATACTGTGC TTTTGAATGT GATGGATACG GATATTCGCA ATGCGACAGT GGCAGAAAAA
CTTAACGATC TTTTGGATCT GATTCAGAAA AACGACTTTA TCAACGCTAA CGCTCTTCTG
AATACGCTAA GCCTGGAACT TCCTGAAAAC CATCTTGAAC TGGTGAAAGC CAGAATGCTT
CTGCGCAAAC AGGAAATTAA ACATGCGCGA AATAACTAA
 
Protein sequence
MMNIRTLKLT NLGRFEELEV HLAPVEEFKS NVTVFIGNNG AGKTSILKSL ATSLSWFVAR 
VRTEKGNGSP IPEDAILNGR SSATIELQVL NTHPATEAAT PYRWLLARTA SGKKSTTASS
LQEASQLAAF YRDQYTQNSG ASFPLIAFYP VERVVLDVPL KIKERHNFLQ LDGYDNALNQ
GIDFRRFFEW FRNREDAENE SGLPQDVLDK LSTRIDLDNT VLNALTAIMA SSRDRQLTAV
RTAISRFMPG FSNLRVRRKP RLHMSIDKNG QTLNVLQLSQ GEKSLMALVG DIARRLAMMN
PMLENPLNGE GIVLIDEVDM HLHPTWQRTI IQRLTTTFPH CQFVLTTHSP LVISDYKDVL
VYSLDNGELT QLPSLYGQDA NTVLLNVMDT DIRNATVAEK LNDLLDLIQK NDFINANALL
NTLSLELPEN HLELVKARML LRKQEIKHAR NN