Gene EcolC_4010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4010 
Symbol 
ID6064565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4404561 
End bp4406192 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content55% 
IMG OID641603421 
Productsodium-dependent inorganic phosphate (Pi) transporter 
Protein accessionYP_001726936 
Protein GI170021982 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1283] Na+/phosphate symporter 
TIGRFAM ID[TIGR00704] Na/Pi-cotransporter
[TIGR01013] Phosphate:Na+ Symporter (PNaS) Family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0189192 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTAACGC TGCTTCACCT GCTTTCTGCC GTCGCCCTGC TGGTCTGGGG GACTCATATT 
GTTCGAACCG GCGTAATGCG CGTCTTCGGC GCGCGTTTGC GTACTGTCCT TAGCCGGAGC
GTCGAAAAAA AGCCGCTCGC CTTTTGCGCG GGGATCGGCG TTACCGCACT GGTACAGAGC
AGTAATGCCA CCACCATGCT GGTGACCTCG TTCGTCGCTC AGGATCTGGT AGCCCTCGCA
CCGGCTCTGG TCATTGTGCT GGGGGCAGAT GTCGGGACGG CGCTAATGGC GCGTATTCTC
ACCTTCGACT TATCCTGGCT GTCACCGTTA CTTATTTTTA TCGGCGTGAT TTTTTTCCTC
GGACGCAAAC AGTCACGCGC CGGGCAACTG GGCCGCGTCG GTATTGGTCT TGGGCTGATT
TTGCTAGCGC TGGAGTTGAT TGTGCAGGCC GTAACGCCGA TCACCCAGGC AAACGGCGTT
CAGGTGATCT TTGCCTCGCT GACCGGCGAT ATTCTGCTGG ATGCGCTGAT TGGCGCGATG
TTCGCCATTA TCAGCTACTC CAGCCTTGCT GCTGTATTGC TGACTGCGAC TCTGACCGCC
GCAGGCATTA TCTCCTTCCC CGTGGCGCTC TGTCTGGTGA TTGGTGCTAA CCTCGGTTCC
GGCCTGCTGG CGATGCTCAA CAACAGTGCC GCCAATGCCG CAGCCCGCCG TGTCGCGCTG
GGTAGTTTGC TGTTTAAGCT GGTGGGTAGC CTGATTATCC TGCCGTTTGT CCATTTGCTG
GCAGAGACAA TGGGGAAGTT GTCATTGCCA AAAGCGGAAC TGGTGATCTA TTTCCACGTC
TTCTACAACC TTGTACGTTG CCTGGTCATG CTGCCATTTG TTGACCCGAT GGCACGGTTT
TGCAAAACGA TTATTCGCGA TGAACCGGAA CTGGATACCC AGCTACGGCC TAAACATCTG
GATGTCAGCG CGCTGGATAC GCCCACGCTT GCTCTGGCGA ACGCCGCGCG CGAAACCCTG
CGCATTGGCG ACGCCATGGA ACAGATGATG GAAGGGCTGA ATAAAGTGAT GCACGGCGAG
CCACGGCAGG AGAAAGAGTT GCGTAAGCTG GCAGATGATA TCAACGTTCT CTATACCGCC
ATTAAGCTGT ATCTGGCGCG GATGCCAAAA GAGGAACTGG CAGAAGAAGA GTCGCGTCGC
TGGGCGGAGA TTATTGAAAT GTCGCTCAAC CTTGAACAGG CTTCCGATAT CGTCGAGCGC
ATGGGCAGCG AAATTGCTGA TAAATCGCTG GCAGCACGGC GGGCATTTTC GCTTGATGGC
TTGAAGGAAC TGGATGCGCT CTATGAGCAA TTGCTCAGTA ATTTAAAACT GGCAATGTCG
GTGTTCTTCT CTGGCGATGT CACCAGCGCT CGTCGTTTGC GTCGCAGCAA ACATCGTTTT
CGCATTCTTA ATCGCCGCTA TTCCCACGCC CACGTCGATC GCCTGCATCA GCAAAACGTG
CAGAGCATTG AAACCAGTTC GCTACATTTA GGCTTACTGG GAGATATGCA GCGCCTGAAC
TCGCTGTTTT GTTCGGTGGC TTACAGTGTG CTGGAACAGC CGGATGAAGA TGAAGGACGG
GACGAGTATT AA
 
Protein sequence
MLTLLHLLSA VALLVWGTHI VRTGVMRVFG ARLRTVLSRS VEKKPLAFCA GIGVTALVQS 
SNATTMLVTS FVAQDLVALA PALVIVLGAD VGTALMARIL TFDLSWLSPL LIFIGVIFFL
GRKQSRAGQL GRVGIGLGLI LLALELIVQA VTPITQANGV QVIFASLTGD ILLDALIGAM
FAIISYSSLA AVLLTATLTA AGIISFPVAL CLVIGANLGS GLLAMLNNSA ANAAARRVAL
GSLLFKLVGS LIILPFVHLL AETMGKLSLP KAELVIYFHV FYNLVRCLVM LPFVDPMARF
CKTIIRDEPE LDTQLRPKHL DVSALDTPTL ALANAARETL RIGDAMEQMM EGLNKVMHGE
PRQEKELRKL ADDINVLYTA IKLYLARMPK EELAEEESRR WAEIIEMSLN LEQASDIVER
MGSEIADKSL AARRAFSLDG LKELDALYEQ LLSNLKLAMS VFFSGDVTSA RRLRRSKHRF
RILNRRYSHA HVDRLHQQNV QSIETSSLHL GLLGDMQRLN SLFCSVAYSV LEQPDEDEGR
DEY