Gene EcolC_1262 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1262 
Symbol 
ID6064788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1379442 
End bp1381169 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content50% 
IMG OID641600677 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_001724255 
Protein GI170019301 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000565297 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000416053 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGATTTCAG GCATTTTAGC ATCCCCGGGT ATCGCTTTCG GTAAAGCTCT GCTTCTGAAA 
GAAGACGAAA TTGTCATTGA CCGGAAAAAA ATTTCTGCCG ACCAGGTTGA TCAGGAAGTT
GAACGTTTTC TGAGCGGTCG TGCCAAGGCA TCAGCCCAGC TGGAAACGAT CAAAACGAAA
GCTGGTGAAA CGTTCGGTGA AGAAAAAGAA GCCATCTTTG AAGGGCATAT TATGCTGCTC
GAAGATGAGG AGCTGGAGCA GGAAATCATA GCCCTGATTA AAGATAAGCA CATGACAGCT
GACGCAGCTG CTCATGAAGT TATCGAAGGT CAGGCTTCTG CCCTGGAAGA GCTGGATGAT
GAATACCTGA AAGAACGTGC GGCTGACGTA CGTGATATCG GTAAGCGCCT GCTGCGCAAC
ATCCTGGGCC TGAAGATTAT CGACCTGAGC GCCATTCAGG ATGAAGTCAT TCTGGTTGCC
GCTGACCTGA CGCCGTCCGA AACCGCACAG CTGAACCTGA AGAAGGTGCT GGGTTTCATC
ACCGACGCGG GTGGCCGTAC TTCCCACACC TCTATCATGG CGCGTTCTCT GGAACTACCT
GCTATCGTGG GTACCGGTAG CGTCACCTCT CAGGTGAAAA ATGACGACTA TCTGATTCTG
GATGCCGTAA ATAATCAGGT TTACGTCAAT CCAACCAACG AAGTTATTGA TAAAATGCGC
GCTGTTCAGG AGCAAGTGGC TTCTGAAAAA GCAGAGCTTG CTAAACTGAA AGATCTGCCA
GCTATTACGC TGGACGGTCA CCAGGTAGAA GTATGCGCTA ACATTGGTAC GGTTCGTGAC
GTTGAAGGTG CAGAGCGTAA CGGCGCTGAA GGCGTTGGTC TGTATCGTAC TGAGTTCCTG
TTCATGGACC GCGACGCACT GCCCACTGAA GAAGAACAGT TTGCTGCTTA CAAAGCAGTG
GCTGAAGCGT GTGGCTCGCA AGCGGTTATC GTTCGTACCA TGGACATCGG CGGCGACAAA
GAGCTGCCAT ACATGAACTT CCCGAAAGAA GAGAACCCGT TCCTCGGCTG GCGCGCTATC
CGTATCGCGA TGGATCGTAA AGAGATCCTG CGCGATCAGC TCCGCGCTAT CCTGCGTGCC
TCGGCTTTCG GTAAATTGCG CATTATGTTC CCGATGATCA TCTCTGTTGA AGAAGTGCGT
GCACTGCGCA AAGAGATCGA AATCTACAAA CAGGAACTGC GCGACGAAGG TAAAGCGTTT
GACGAGTCAA TTGAAATCGG CGTAATGGTG GAAACACCGG CTGCCGCAAC AATTGCACGT
CATTTAGCCA AAGAAGTTGA TTTCTTTAGT ATCGGCACCA ATGATTTAAC GCAGTACACT
CTGGCAGTTG ACCGTGGTAA TGATATGATT TCACACCTTT ACCAGCCAAT GTCACCGTCC
GTGCTGAACT TGATCAAGCA AGTTATTGAT GCTTCTCATG CTGAAGGCAA ATGGACTGGC
ATGTGTGGTG AGCTTGCTGG CGATGAACGT GCTACACTTC TGTTGCTGGG GATGGGTCTG
GACGAATTCT CTATGAGCGC CATTTCTATC CCGCGCATTA AGAAGATTAT CCGTAACACG
AACTTCGAAG ATGCGAAGGT GTTAGCAGAG CAGGCTCTTG CTCAACCGAC AACGGACGAG
TTAATGACGC TGGTTAACAA GTTCATTGAA GAAAAAACAA TCTGCTAA
 
Protein sequence
MISGILASPG IAFGKALLLK EDEIVIDRKK ISADQVDQEV ERFLSGRAKA SAQLETIKTK 
AGETFGEEKE AIFEGHIMLL EDEELEQEII ALIKDKHMTA DAAAHEVIEG QASALEELDD
EYLKERAADV RDIGKRLLRN ILGLKIIDLS AIQDEVILVA ADLTPSETAQ LNLKKVLGFI
TDAGGRTSHT SIMARSLELP AIVGTGSVTS QVKNDDYLIL DAVNNQVYVN PTNEVIDKMR
AVQEQVASEK AELAKLKDLP AITLDGHQVE VCANIGTVRD VEGAERNGAE GVGLYRTEFL
FMDRDALPTE EEQFAAYKAV AEACGSQAVI VRTMDIGGDK ELPYMNFPKE ENPFLGWRAI
RIAMDRKEIL RDQLRAILRA SAFGKLRIMF PMIISVEEVR ALRKEIEIYK QELRDEGKAF
DESIEIGVMV ETPAAATIAR HLAKEVDFFS IGTNDLTQYT LAVDRGNDMI SHLYQPMSPS
VLNLIKQVID ASHAEGKWTG MCGELAGDER ATLLLLGMGL DEFSMSAISI PRIKKIIRNT
NFEDAKVLAE QALAQPTTDE LMTLVNKFIE EKTIC