Gene Cag_1607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1607 
Symbol 
ID3746473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2098294 
End bp2100057 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content46% 
IMG OID637774148 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_379905 
Protein GI78189567 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTCCC GCAGCCGAAG AGAAGAGCGA ACCCCAAACG CATGTAACCA ATCTGAAACC 
CTATACAATG GCATAGGTGC CTCAAAAGGC ATTGCCATTG GCGAGTGTTA CGCCTTTATT
AAAGAGGAGA ACACGCATGA GCCAAGGGAA CTCAATGAGA AAAATTGCAA GGAGGAGGTT
GAACGCTTTT TAACCGCCTT AAGCCGCTCT GAGCATGAGC TTAAAAAAAT TGAGCAAGTA
ACCATCCGCA AGCTTGGCAA AAGCTATTCC AATCTTTTTC AAGCACAAAT TATGATGCTG
CACGATCCTG TGCTTGTGGA GACTATTTCA AAACGCATTG TAAGTGAACG CAAAGGCGCT
CATTTAGTGA TTGATGATGA ATTCAATAAA TATCTTCAGC ACTTTAAGAA CTCCGACCAT
ACGTTGTTTC AAGAGCGTGC CGATGATTTG CTTGATATTA AAAACCGCAT TATTCGCAAT
CTCGATATTC GCAAACTGCA CTCATGGATT CCCGAAGGGG TAATTGTTTG CTGCGATACC
CTTTCACCTG CCGATATTAT TCTTCTTAGC CGTAGCAACA TCAAGGGTTT TGTAACCGCA
ACGGGTGGAA AAACTTCCCA TATTTCGCTG ATTTGCAAAT CGCTCAAAAT TCCGATGGTG
GTGGGGCTGG GGCAAATTGC CGATAAAGTA GCAACGGGTA TGGCGGTAAT TATTGATGGT
GCCAATGGCA CCGTTATTAC CAATCCCTCG GCTGCAACGT TGGAGGAATA TTTCCTTAAA
CAAAAAAATG AGCAACAATC AAGCAGTAAC CTTCAAGCCG CCATGCTGCC CGCAACCACA
CAGTGTGGGG TTCGGGTAAG CTTTTATGCT AACATTGATT TTCGCGAAGA AGCCTTCTCT
CTTGCTGCTG CTGGTGCCGA AGGGGTTGGC TTGTTCCGTA GCGAAAATCT TTTTACCGAA
GGCACAAAAA CTCCAAAAGA AGAGGAGCAA TTTACTTGTT ACCGTGCTAT TGCCGATGCC
ATTGCTCCCA TGAGGCTTGA TGTTCGGCTG TTTGATATTG GCGGCGATAA ATTGCTCTAT
TCTCCCGTTA AAGAGATCAA TCCAAATCTT GGCTGGCGCG GCATTCGTAT TTTGCTTGAT
TTGCCTGAAA TTCTCGACAC CCAAATTCGT GCCATCTTAC AGGCAAACAC TCATGGCAAT
ATTGATATTC TTATTCCCAT GGTGATGTCG CTGCATGAAA TTCGCACGGT TCGAGAATCG
GTAGAACGGC AATTTGCGGA GTTGCAAGCT GAGCGTGACG GGCAGATTAC TCAACCCGGA
ATTGGCGCTA TGATTGAGCT ACCCGCTGCG GTTGAATTAA TTGAAGAAAT TACCCAATGT
GTTGATTTTA TTAGCATTGG CACCAATGAC CTTACGCAAT ACACGCTTGC CGTTGACCGC
AACAATGTGA TTGTGCAAGA TTTATTTGAC CGTTTTCATC CTGCCGTTAT GCGCCAAGTG
CATCGCATTA TTCAGGTAGC GAACAAAAAT CACTGCTGTG CTATGATGTG TGGCGATATG
GCATCCGATT CCCTTGCGCT GCCCTTTTTG CTTGGATGCG GTTTGCGCAA CTTTAGCGTT
GTGGTTTCTG AGATTGCTGA ACTTAAAGCG CATGTTGCTC GCTATGCACT TGCCGAAACC
GAAGCGCTTG CGCAAGAGTG TCTTGCTCTC AATAACCCAG CAGCAATTAA AGCGCGGCTT
GAAGCTTTCC AAGCCGCCCA TTAA
 
Protein sequence
MSSRSRREER TPNACNQSET LYNGIGASKG IAIGECYAFI KEENTHEPRE LNEKNCKEEV 
ERFLTALSRS EHELKKIEQV TIRKLGKSYS NLFQAQIMML HDPVLVETIS KRIVSERKGA
HLVIDDEFNK YLQHFKNSDH TLFQERADDL LDIKNRIIRN LDIRKLHSWI PEGVIVCCDT
LSPADIILLS RSNIKGFVTA TGGKTSHISL ICKSLKIPMV VGLGQIADKV ATGMAVIIDG
ANGTVITNPS AATLEEYFLK QKNEQQSSSN LQAAMLPATT QCGVRVSFYA NIDFREEAFS
LAAAGAEGVG LFRSENLFTE GTKTPKEEEQ FTCYRAIADA IAPMRLDVRL FDIGGDKLLY
SPVKEINPNL GWRGIRILLD LPEILDTQIR AILQANTHGN IDILIPMVMS LHEIRTVRES
VERQFAELQA ERDGQITQPG IGAMIELPAA VELIEEITQC VDFISIGTND LTQYTLAVDR
NNVIVQDLFD RFHPAVMRQV HRIIQVANKN HCCAMMCGDM ASDSLALPFL LGCGLRNFSV
VVSEIAELKA HVARYALAET EALAQECLAL NNPAAIKARL EAFQAAH