Gene EcSMS35_3709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3709 
Symbol 
ID6142839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3772886 
End bp3774391 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content47% 
IMG OID641618535 
Producthypothetical protein 
Protein accessionYP_001745675 
Protein GI170681684 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0226] ABC-type phosphate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.447391 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAACG GAAAATGGAT TTTGACCTCG CTGGTAATGA CTTTTTTTGG CATCCCCATA 
CTGGCGCAAT TTTTGGCAGC GGTTGTTGCC ATGCTGGGGG CCGGGCTTGC CGCTATTTTT
GATGTTTGCA ATTTACTCTT TACGCCAACA ATTTATCTTC TACTCAACGT CTTTATGCTG
ACGCTGGGCG CATTACTGCT ATTTTTCTCT GGGCGAGTGT GGGCGGGCGA TAGCGCACCA
GAAAACAGAG AAATAGCCGC CTGGCGACAA TGTCTTTTTT TAGTTCCCGC TTTATTAACC
CTGGTTGGCT GGATAATCAC GCTACATCTG GCAGATTATC AATTTCGCCA GATGGTTTCA
GGTTGGTTGG CAAACCTTAT GCTTCCCTGG TTGGGCGTTT TTACAGTCTC ATTCGTCGGT
GGTGAGTACT GGTGGATAGT CATTATTCCC GTTGGCGCGC ATATCAGTTT TTCACTGGGA
TACGGCTGGC CGACCAGACA CCCTTTAACC GGCACCTCCG GTCTACGTTG CCGTAATTTA
CTTCTGTTCA TTCTTCTCTT ACTGGGTATT GTCGCTGGTT ATCAGGCTTA TTTATATAAA
CAGCTTAATC CCGGCGTCGG TGTGCGTGAA AATATTGATA CCTGGGCCTG GCGACCCGAT
AAACTTAATA ATCAACTGAC GCCACTGCGT GGTAAACCAC AAATTCAATT TACGCAAAAC
TGGCCGCGAC TCGATGGCGC TACGGCGGCG TACCCCATTT ATGCCTCTGC ATTTTATGCA
TTAAGTGTAA TACCAGAGGA TTTTCACGTT TGGGATTATC TGGATAACTC TCGTACGCAA
GAAGCATATA ACAAAATCGT TAAGGGCGAT GCTGATATTA TTTTCGTGGC GCAACCTTCC
GATGGGCAAA AAAAACGCGC TGAGAAATCG GGCGTCACTT TGCTGTACAC GCCATTTGCC
CGTGAAGCGT TTGTTTTCAT CGTCAATGCG GATAATCCGG TTAATTCCCT GACTGAACAA
CAAGTGCGTG ACATTTTTAG TGGCGCAATT ACCAACTGGC GCACGGTTGG CGGTAACGAT
CAGGAGATCC AGACCTGGCA ACGCCCGGAA GACTCTGGCA GCCAGACAGT GATGCAATCG
CAGGTCATGA AAAATGTCCG CATGATCTCG CCGCAGGAAA CCGAAGTGGC AAGCATGATG
GAGGGAATGA TTAAAGTCGT TGCCGAATAC CGTAATACAA ACAACGCAAT AGGCTACACC
TTCCGCTATT ACGCAACACA AATGAATGCC GATAAAAATA TAAAACTGCT AGCGATTAAC
GGTATTGCAC CGACTGCGGA AAATATTCGT AACGGCAAAT ATCCGTATAT CATCGATGCC
TTTATGGTAA CGCGGGAAAA TACTACGTCA GAAACACAAA AACTGGTCGA ATGGTTTTTA
ACGCCGCAGG GACAGAGTCT GGTGGAAGAT GTGGGCTATG TGCCGCTGTA TCCAACAATG
AAATAA
 
Protein sequence
MQNGKWILTS LVMTFFGIPI LAQFLAAVVA MLGAGLAAIF DVCNLLFTPT IYLLLNVFML 
TLGALLLFFS GRVWAGDSAP ENREIAAWRQ CLFLVPALLT LVGWIITLHL ADYQFRQMVS
GWLANLMLPW LGVFTVSFVG GEYWWIVIIP VGAHISFSLG YGWPTRHPLT GTSGLRCRNL
LLFILLLLGI VAGYQAYLYK QLNPGVGVRE NIDTWAWRPD KLNNQLTPLR GKPQIQFTQN
WPRLDGATAA YPIYASAFYA LSVIPEDFHV WDYLDNSRTQ EAYNKIVKGD ADIIFVAQPS
DGQKKRAEKS GVTLLYTPFA REAFVFIVNA DNPVNSLTEQ QVRDIFSGAI TNWRTVGGND
QEIQTWQRPE DSGSQTVMQS QVMKNVRMIS PQETEVASMM EGMIKVVAEY RNTNNAIGYT
FRYYATQMNA DKNIKLLAIN GIAPTAENIR NGKYPYIIDA FMVTRENTTS ETQKLVEWFL
TPQGQSLVED VGYVPLYPTM K