Gene EcSMS35_3708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3708 
Symbol 
ID6147059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3771868 
End bp3772860 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content47% 
IMG OID641618534 
Producthypothetical protein 
Protein accessionYP_001745674 
Protein GI170683549 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0226] ABC-type phosphate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.371578 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAT TCACCGGTGT TTTACTATTA GGTACGGCGC TACTGGCGGG ATGTGTCGAC 
CGGGAAGGGT ACTATAACAG CGTCAGGGAA GAAGACAGCC ATGGACTGAC GTCTCTGCGG
GGGCAACCTG AATTACGTTA CAACGATGAT TGGTCAAGAT GGCCGAGAGT GTACGGCGCT
ACAGCCTTAT ACCCGCTGTA TGCCTCCGCG TATTATGAAT TAGTACCCGA GCCAAAAGAT
AAGGATCGAA CCTCGCTGGC CTGGCAGGCG TATGGTTTGC AACAAACCCG AACAGCTGAA
GCCTACGATA GTCTGATTAA GGGTACCGCG ACGGTTATTT TTGTTGCACA ACCGTCGGAA
GGACAGAAAA AACGTGCAGA AGAAGCAGGT GTTAAACTCA AATATACCGC TTTCGCCCGC
GAAGCCTTTG TCTTTATCGT TGATATCAAA AACCCGGTTA ATTCACTTAG CGAACAGCAG
GTCAAAGACA TTTTTAGCGG TAAAGTGAGT CGCTGGAATA AAGTGGGCGG CGGTGACGAA
AGTATAAAAG TCTGGCAGCG GCCAGAAGAT TCTGGAAGCC AAACGGTTAT GAAGGGACTG
GTTATGCAAG ATACTCCAAT GCTGCCAGCC AAAAAATCCA CTGTTATCGA TCTTATGGGC
GGTTTAATTA CCGAAGTCGC CGACTACCAA AATACACCAT CTTCTATTGG ATACACCTTC
CACTATTACG TCACACGTAT GAATGACAAT ATGCTCAAAA TGCGCAAACA GATCAAACTG
CTGGCTATAA ATGGCGTTGC ACCTACAGAG GAAAATATCC GTAACGGCAC TTATCCGTAC
ATAATTCATG CTTATATGGT GACGCACGAA AACCCTACGC TGGAAACGCA GAAATTCGTC
GACTGGTTTT TAAGCCCGCA GGGACAGCAA TTGGTAGAGG ATGTGGGCTA TGTGCCGATT
TATGACGCAT CATCCGAATC ATCAGGACAA TAA
 
Protein sequence
MNKFTGVLLL GTALLAGCVD REGYYNSVRE EDSHGLTSLR GQPELRYNDD WSRWPRVYGA 
TALYPLYASA YYELVPEPKD KDRTSLAWQA YGLQQTRTAE AYDSLIKGTA TVIFVAQPSE
GQKKRAEEAG VKLKYTAFAR EAFVFIVDIK NPVNSLSEQQ VKDIFSGKVS RWNKVGGGDE
SIKVWQRPED SGSQTVMKGL VMQDTPMLPA KKSTVIDLMG GLITEVADYQ NTPSSIGYTF
HYYVTRMNDN MLKMRKQIKL LAINGVAPTE ENIRNGTYPY IIHAYMVTHE NPTLETQKFV
DWFLSPQGQQ LVEDVGYVPI YDASSESSGQ