Gene Apar_0600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0600 
Symbol 
ID8413457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp668626 
End bp670329 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content50% 
IMG OID645022175 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_003179621 
Protein GI257784404 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.12696 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.706095 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGAGG GAGTCAATGC GTCTAACGGA ATCGGCATCG GTGCTGCTCA GGTTGCTGTA 
GATCCAGATC TTACTTTTAC TCCGCACACT GTTGAAGACA CTGCTGCAGA GAAGGCTCGT
TACGCAGAGG CTGTAACTAA GTTCATCGCA CAGACTAATG CGCAAATTGA GCGTATGACT
AAGACGGTGG GTGAGGAAGC TGCTGCAATT ATGGGTGCTC ACATTGAGTT TGCAGAGGAT
GAGGGCATCA AAGAGGCCGT TAACAGCGCT ATTGACGGTG GTACCTGTGT TGAGCAGGCT
GTAAGCGACG CATATGACAT GTATTACAAC ATGTTCCTTG GCATGGAGGA TGAACTCTTC
CGTGAGCGCG CAGCAGACGT TGCAGACGTA AAGACTGGTC TTCTCGCTGA TCTTCTTGGC
AAAGAGGTTG TTGACCTCTC TACGCTTCCA GAGAACTCCG TCATTGTCTG CCGTGAGCTG
ACTCCTTCAA TGACCGCAGA TATTGATAAG GACCACGTTG CAGGTATTGT TACCGAGACT
GGTGGTCGTA CTTCTCACTC CGCAATCATT GCTCGTGCTC TTGAGATCCC TGCAGTTCTT
TCTGTTCCTA ACATCACTTC TGAGGTTGCA ACTGGCAACG CTATTGTTGT CGACGGCACT
AACGGCAAGG TTGTTGTTAA TCCTTCTGAG GCTGAGCTTG CTGAGTACAA GGCTCAGGCT
GAGGCTTACG CTGCAGAGAA GGCTGCTCTT GAGGCTTATC GCGGTAAGGA GACCGTAACT
GCTGACGGCA TTAAGGTTCT GCTTGTTGCT AATATCGGTA ATCCAGATGA CGCTAACGGC
GCAGTTGACG CTGATGCTGA GGGTATCGGT CTTTTCCGCT CCGAGTTCCT GTTCATGGAT
GCAAAGGAGC TGCCAAGCGA GGAAGAGCAG TTTGCTGCTT ATCAGAAGGT TGCTCTGCGC
ATGAAGGATA AGCCAGTCAT CATCCGTACT CTTGATGTCG GTGGTGATAA AGAGATTCCT
TATCTCAACA TGAAGGCTGA GGAGAATCCA TTCATGGGCT TCCGCGCTAT TCGCTACTGC
CTTAACAATG CTGAGCAGTA CAAGGTTCAG CTCCGCGCCC TTCTCCGTGC ATCTGCATTT
GGTGACATCA AGATTATGCT TCCTCTTGTC ACTACTGTTG ACGAGGTTCG TCAGGCAAAA
GCTCTTGTTG AGGAGTGCAA GGGTGAGCTT GACGCTAAGG GTGTAGCATA CAACAAAGAT
ATTGAGGTAG GCACCATGAT CGAGACTCCA TCTGCATCTC TGATTGCAGA TAAGCTGGCT
CGTGAGTGCG ATTTCTTCTC CATTGGTACC AATGACCTTA TTGGCTACAC CATGTGCGCC
GACCGTGGCA ATGATCGCGT TGCATATCTC TACGAGGTCT ATCAGCCATC CGTCCTCCGT
TCCCTCAAGT ACCTCATTGG TGAGGGTAAC AAAAAGAAGA TTATGGTTGG CATGTGCGGT
GAGGCAGCTG CAGATCCACT GCTCATCCCA GTCCTTCTTT CCTTTGGCCT GGATGAGTTC
TCCGTCTCTG CTCCATCTGT CCTGCGTACC CGCAAGACCA TTGCAGCTTG GACAAAGGCT
GAGGCAGACG AGCTTACTGC TCGTGTCATG GAGCTTGATA CCGCAGCTGA GGTTAAGGCT
CTGCTTGAGC GTGAAGCTCG CTAA
 
Protein sequence
MFEGVNASNG IGIGAAQVAV DPDLTFTPHT VEDTAAEKAR YAEAVTKFIA QTNAQIERMT 
KTVGEEAAAI MGAHIEFAED EGIKEAVNSA IDGGTCVEQA VSDAYDMYYN MFLGMEDELF
RERAADVADV KTGLLADLLG KEVVDLSTLP ENSVIVCREL TPSMTADIDK DHVAGIVTET
GGRTSHSAII ARALEIPAVL SVPNITSEVA TGNAIVVDGT NGKVVVNPSE AELAEYKAQA
EAYAAEKAAL EAYRGKETVT ADGIKVLLVA NIGNPDDANG AVDADAEGIG LFRSEFLFMD
AKELPSEEEQ FAAYQKVALR MKDKPVIIRT LDVGGDKEIP YLNMKAEENP FMGFRAIRYC
LNNAEQYKVQ LRALLRASAF GDIKIMLPLV TTVDEVRQAK ALVEECKGEL DAKGVAYNKD
IEVGTMIETP SASLIADKLA RECDFFSIGT NDLIGYTMCA DRGNDRVAYL YEVYQPSVLR
SLKYLIGEGN KKKIMVGMCG EAAADPLLIP VLLSFGLDEF SVSAPSVLRT RKTIAAWTKA
EADELTARVM ELDTAAEVKA LLEREAR