Gene EcolC_0223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0223 
Symbol 
ID6066098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp254317 
End bp255816 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content54% 
IMG OID641599624 
Productphosphate transporter 
Protein accessionYP_001723231 
Protein GI170018277 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0306] Phosphate/sulphate permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTACATT TGTTTGCTGG CCTGGATTTG CATACCGGGC TGTTATTATT GCTTGCACTG 
GCTTTTGTGC TGTTCTACGA AGCCATCAAT GGTTTCCATG ACACAGCCAA CGCCGTGGCA
ACCGTTATCT ATACCCGCGC GATGCGTTCT CAGCTCGCCG TGGTTATGGC GGCGGTGTTC
AACTTTTTGG GTGTTTTGCT GGGTGGTCTG AGTGTTGCCT ATGCCATTGT GCATATGCTG
CCGACGGATC TGCTGCTTAA TATGGGATCG TCTCATGGCC TTGCCATGGT GTTCTCTATG
TTGCTGGCGG CGATTATCTG GAACCTGGGT ACCTGGTACT TTGGTTTACC TGCATCCAGC
TCTCATACGC TGATTGGCGC GATCATCGGG ATTGGTTTAA CCAATGCGTT GATGACCGGG
ACGTCAGTGG TGGATGCACT CAATATCCCG AAAGTATTAA GTATTTTCGG TTCTCTGATC
GTTTCCCCTA TTGTCGGCCT GGTGTTTGCT GGCGGTCTGA TTTTCTTGCT GCGTCGCTAC
TGGAGCGGCA CCAAGAAACG CGCCCGTATC CACCTGACCC CAGCGGAGCG TGAAAAGAAA
GACGGCAAGA AAAAGCCGCC GTTCTGGACG CGTATTGCGC TGATCCTTTC CGCTATCGGC
GTGGCGTTTT CGCACGGCGC GAACGATGGT CAGAAAGGCA TTGGTCTGGT TATGTTGGTA
TTGATTGGCG TCGCACCAGC AGGCTTCGTG GTGAATATGA ATGCCACTGG CTACGAAATC
ACCCGTACCC GTGATGCCAT CAACAACGTC GAAGCTTACT TTGAGCAGCA TCCTGCGCTG
CTGAAACAGG CTACCGGTGC TGATCAGTTA GTACCGGCTC CGGAAGCTGG CGCAACGCAA
CCCGCGGAGT TCCATTGCCA TCCGTCGAAT ACCATTAACG CGCTCAACCG CCTGAAAGGC
ATGTTGACTA CCGATGTGGA AAGCTACGAC AAGCTGTCGC TTGATCAACG TAGCCAGATG
CGCCGCATTA TGCTGTGCGT TTCTGACACT ATCGACAAAG TGGTGAAGAT GCCTGGCGTG
AGTGCTGACG ATCAGCGCCT GTTGAAGAAA CTGAAGTCCG ACATGCTTAG CACCATCGAG
TATGCACCGG TGTGGATCAT CATGGCGGTC GCGCTGGCGT TAGGTATCGG TACGATGATT
GGCTGGCGTC GTGTGGCAAC GACTATCGGT GAGAAAATCG GTAAGAAAGG CATGACCTAC
GCTCAGGGGA TGTCTGCCCA GATGACGGCG GCAGTGTCTA TCGGCCTGGC GAGTTATACC
GGGATGCCGG TTTCCACTAC TCACGTACTC TCCTCTTCTG TCGCGGGGAC GATGGTGGTA
GATGGCGGCG GCTTACAGCG TAAAACCGTA ACCAGTATTC TGATGGCCTG GGTGTTTACC
CTTCCGGCTG CGGTACTGCT TTCCGGCGGG CTGTACTGGC TCTCCTTGCA GTTCCTGTAA
 
Protein sequence
MLHLFAGLDL HTGLLLLLAL AFVLFYEAIN GFHDTANAVA TVIYTRAMRS QLAVVMAAVF 
NFLGVLLGGL SVAYAIVHML PTDLLLNMGS SHGLAMVFSM LLAAIIWNLG TWYFGLPASS
SHTLIGAIIG IGLTNALMTG TSVVDALNIP KVLSIFGSLI VSPIVGLVFA GGLIFLLRRY
WSGTKKRARI HLTPAEREKK DGKKKPPFWT RIALILSAIG VAFSHGANDG QKGIGLVMLV
LIGVAPAGFV VNMNATGYEI TRTRDAINNV EAYFEQHPAL LKQATGADQL VPAPEAGATQ
PAEFHCHPSN TINALNRLKG MLTTDVESYD KLSLDQRSQM RRIMLCVSDT IDKVVKMPGV
SADDQRLLKK LKSDMLSTIE YAPVWIIMAV ALALGIGTMI GWRRVATTIG EKIGKKGMTY
AQGMSAQMTA AVSIGLASYT GMPVSTTHVL SSSVAGTMVV DGGGLQRKTV TSILMAWVFT
LPAAVLLSGG LYWLSLQFL