Gene EcSMS35_A0145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_A0145 
Symbol 
ID6106459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010488 
Strand
Start bp113148 
End bp114704 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content35% 
IMG OID641614884 
Producthypothetical protein 
Protein accessionYP_001740025 
Protein GI170650802 
COG category[H] Coenzyme transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG1120] ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAAA AAAATAAAAA AACCATTACT GGTCAAGTCT TGAACTCCAT TAAAATTAAC 
AAACTAAAAT GTATAAATGG ATTAAACGAG ATTATTTTTA AACCTCATGC GTTAACCGCA
ATTTTAGGTC CAAATGGAAG CGGAAAATCA ACAATACTCC ATGCTATAGC AAGCATATAC
ATGCCTGAAG AAGGTTTCCC CGGTGAAGAC CATCGGTTAA TGCATTTTTT TCCACGCAGC
CCACATGCTG AATGGAACGG CAGTGACTTC ATTGTAAATT TAACTTATCG TAAAGATGGG
GTAATGATTG AAAATGAATT GAAAAATTAT GGAAAAGCAG ATATTCGAGG CTCACGATGG
ATTCAGATTT ATGCTCGTCG TCCTTTAAGA GAAGTTTACT ATCTTGGCAT TGATAAATGT
GTTCCTATAA TAGAATCAGA AAAAAAGAAT AATATTCAGT ATGAAACCAG CAGTGTCAGT
AATGATTTAA TAACAAACAT TCTTCATTAC GCCTCCTATA TACTTAATAA ACCATACACA
AGTTTCAATC AGCATCAACA ACCCAATGGA AAAATATTGA TTGGAGTTGA GTCAGGGGGA
CTGGCCTACT CATCATTGAG TATGAGCGCA GGAGAGCAAA AGATATTTTT AATTCTGGAA
ACAATATTGA AAGCGGACAA GAACGCCTTA ATATTAATTG ATGAATTAGA TTTATTGTTG
CATGACGAAG CATTGAAAAA GTTAATAGAA GTCATATCAT CTCATGCCAA AGATAAAAAC
AAACAAATCA TCTTTACTAC CCATAGAGAA ATGATAACCA CTTTATCAGA CAAGATAAAT
ATAAGACACG TCGTAAACAT TCAAGGCCGT AGTTATTCAT TTGAAGAAAC AAAACCTGAT
GCTATAAACA GATTAACAGG TGAATCAACG ACGCCTATTG AAATATATGT AGAAGATGAC
TTAGCTGTTG CCATAATAAA TAAAATATGT TCTTCACTTA AGGCCAGCAA ATATGTAAAA
ATATTTAAAT TTGGCGCAGC TTCAAACGCA TTCACTCTAC TTGCCAGTAC CCTTATTCGT
GGTGACAATC TATCTGGTAA ACTTTATATT CTTGATGGCG ATAAATATTC CACAGAAAAT
GAAAAGAAAA CTGCTCTTGA TAAAGTTTTC ACCGGCACAG AATCACGAAC TTATGAGTTA
AAAGCGGCAG CGGAAGGAAA AGTAAAACAA TTTAATCTCC CAAATGGGGT TAAGCCTGAA
CAATACATTC ATTATCTTAT TACAAATGTC CCACTGGACG GATTGGGTGG CGAATATTTA
GAAATAATCG AAGCAGCAAG GGATATCAGA GTTGAACTTG ATGCTCATAA TTATATCTCG
AATATTTTAA CTAAATTGGG CATTGATCGT CCGTCCGGGT TAACACGAGT GATGGATCTG
GCCTCCAGGC ATCCCGAATG GCATCAGTAT GTGAGTGAAG TCACTGACTG GTTACAGCCC
GTAGTTTCTG ATTTAATGGA ACGATTACCT GAGAATGATA CAGTAGATAT AACTTAA
 
Protein sequence
MAEKNKKTIT GQVLNSIKIN KLKCINGLNE IIFKPHALTA ILGPNGSGKS TILHAIASIY 
MPEEGFPGED HRLMHFFPRS PHAEWNGSDF IVNLTYRKDG VMIENELKNY GKADIRGSRW
IQIYARRPLR EVYYLGIDKC VPIIESEKKN NIQYETSSVS NDLITNILHY ASYILNKPYT
SFNQHQQPNG KILIGVESGG LAYSSLSMSA GEQKIFLILE TILKADKNAL ILIDELDLLL
HDEALKKLIE VISSHAKDKN KQIIFTTHRE MITTLSDKIN IRHVVNIQGR SYSFEETKPD
AINRLTGEST TPIEIYVEDD LAVAIINKIC SSLKASKYVK IFKFGAASNA FTLLASTLIR
GDNLSGKLYI LDGDKYSTEN EKKTALDKVF TGTESRTYEL KAAAEGKVKQ FNLPNGVKPE
QYIHYLITNV PLDGLGGEYL EIIEAARDIR VELDAHNYIS NILTKLGIDR PSGLTRVMDL
ASRHPEWHQY VSEVTDWLQP VVSDLMERLP ENDTVDIT