Gene EcSMS35_3782 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3782 
SymbolpitA 
ID6143022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3849241 
End bp3850740 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content54% 
IMG OID641618608 
Productlow-affinity inorganic phosphate transporter 1 
Protein accessionYP_001745748 
Protein GI170681103 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0306] Phosphate/sulphate permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTACATT TGTTTGCTGG CCTGGATTTG CATACCGGGC TGTTATTATT GCTTGCACTG 
GCTTTTGTGC TGTTCTACGA AGCCATCAAT GGTTTCCATG ACACAGCCAA CGCCGTGGCA
ACCGTTATCT ATACCCGCGC GATGCGTTCT CAGCTCGCCG TGGTTATGGC GGCGGTGTTC
AACTTTTTGG GTGTTTTGCT GGGTGGTCTG AGTGTTGCCT ATGCCATTGT GCATATGCTG
CCGACGGATC TGCTGCTTAA TATGGGATCG TCTCATGGCC TTGCCATGGT GTTCTCTATG
TTGCTGGCGG CGATTATCTG GAACCTGGGT ACCTGGTACT TTGGTTTACC TGCATCCAGC
TCTCATACGC TGATTGGCGC GATCATCGGG ATTGGTTTAA CCAATGCGTT GATGACCGGG
ACGTCAGTGG TGGATGCACT CAATATCCCG AAAGTATTAA GTATTTTCGG TTCTCTGATC
GTTTCCCCTA TTGTCGGCCT GGTGTTTGCT GGCGGTCTGA TTTTCTTGCT GCGTCGCTAC
TGGAGCGGCA CCAAGAAACG CGCCCGTATC CACCTGACCC CAGCGGAGCG TGAAAAGAAA
GACGGCAAGA AAAAGCCGCC GTTCTGGACG CGTATTGCGC TGATCCTTTC CGCTATCGGC
GTGGCGTTTT CGCACGGCGC GAACGATGGT CAGAAAGGTA TTGGTCTGGT TATGTTGGTA
TTGATTGGCG TCGCACCAGC AGGCTTCGTG GTGAACATGA ATGCCACGGG CTACGAAATC
ACCCGTACCC GTGATGCCAT CAACAACGTG GAAGCTTACT TTGAGCAGCA CCCTGCGCTG
CTGAAACAGG CTACCAGTGC TGATCAGTTA GTACCGGCTC CGGAAGCTGG CGCAACGCAA
CCTGCGGAGT TCCATTGCCA TCCGTCGAAT ACCATTAACG CGCTCAACCG CCTGAAAGGC
ATGTTGACTA CCGATGTGGA AAGCTACGAT AAGCTGTCGC TTGATCAACG TAGCCAGATG
CGCCGCATTA TGCTGTGCGT TTCTGACACT ATCGATAAAG TGGTGAAGAT GCCTGGTGTG
AGTGCTGACG ATCAGCGTCT GTTGAAGAAA CTGAAGTCCG ACATGCTTAG CACCATCGAG
TATGCGCCGG TGTGGATCAT CATGGCGGTC GCGCTGGCGT TAGGTATCGG TACGATGATT
GGCTGGCGTC GTGTGGCAAC GACTATCGGT GAGAAAATCG GTAAGAAAGG CATGACCTAC
GCTCAGGGGA TGTCTGCCCA GATGACGGCG GCAGTGTCTA TCGGCCTGGC GAGTTATACC
GGGATGCCGG TTTCCACTAC CCACGTACTC TCCTCTTCTG TCGCGGGGAC GATGGTGGTA
GATGGCGGCG GCTTACAGCG TAAAACCGTA ACCAGTATTC TGATGGCCTG GGTATTTACC
CTTCCGGCTG CGGTACTGCT TTCCGGCGGG CTGTACTGGC TCTCCTTGCA GTTCCTGTAA
 
Protein sequence
MLHLFAGLDL HTGLLLLLAL AFVLFYEAIN GFHDTANAVA TVIYTRAMRS QLAVVMAAVF 
NFLGVLLGGL SVAYAIVHML PTDLLLNMGS SHGLAMVFSM LLAAIIWNLG TWYFGLPASS
SHTLIGAIIG IGLTNALMTG TSVVDALNIP KVLSIFGSLI VSPIVGLVFA GGLIFLLRRY
WSGTKKRARI HLTPAEREKK DGKKKPPFWT RIALILSAIG VAFSHGANDG QKGIGLVMLV
LIGVAPAGFV VNMNATGYEI TRTRDAINNV EAYFEQHPAL LKQATSADQL VPAPEAGATQ
PAEFHCHPSN TINALNRLKG MLTTDVESYD KLSLDQRSQM RRIMLCVSDT IDKVVKMPGV
SADDQRLLKK LKSDMLSTIE YAPVWIIMAV ALALGIGTMI GWRRVATTIG EKIGKKGMTY
AQGMSAQMTA AVSIGLASYT GMPVSTTHVL SSSVAGTMVV DGGGLQRKTV TSILMAWVFT
LPAAVLLSGG LYWLSLQFL