Gene Apre_0200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0200 
Symbol 
ID8396951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp226527 
End bp227516 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content39% 
IMG OID644994538 
Productprotein-N(pi)-phosphohistidine--sugar phosphotransferase 
Protein accessionYP_003151973 
Protein GI257065717 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3444] Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIB 
TIGRFAM ID[TIGR00824] PTS system, mannose/fructose/sorbose family, IIA component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000259121 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAGGAA TAATACTCGC AAGCCACGGC GGTTTTGCCG ATGGTATCAA AGAATCCGCT 
CAAATGATCT TTGGCGAGCA AGAAAAATTC GAATCAGTAT GCCTACTACC TTCAATGGGA
CCCGATGATT TTAGAGCAAA TCTCGAAAAA GCCATTGAAA AATTAGATAC TGAAGAGATT
CTTTTCTTGG TTGACCTTTG GGGCGGTACA CCATTTAACC AAAGCTCAAA TTTATTTGAG
GGAAACGAGG ATAAATGGGC AATCGTTGCT GGCATGAACC TTCCTATGGT TATAGAAGCT
TTAAGCGAGA GATTTACTGC AGAGAAATCT CATGATATAG CAAAGGCTAT AGTAGGATCA
GCCAAAGAAG GAGTTAAGAT TAAGCCGGAA GATCTTAACC CAGTAGAAGA AGCGAAAACA
GAAGTCAAGG AAGATAATAT TCCTAAGGGA TCTATCCCAG AAGGAACAGT TCTTGGAGAT
GGTAAGATCG ATATTGGTCT TGCAAGAATA GACACAAGAC TTCTCCACGG ACAAGTCGCT
ACAAGCTGGA CAAAGTCAAT AAATCCTGAC AGAATCATAG TTGTAAGTAA TAGCGTAAGC
AAAGACGAGC TAAGAAAGAA CATGGTAATG GAAGCAGCTC CTCCAGGAGT TAAGGCTCAC
GTAATCCCTA TTTGGAAGAT GAAGGAGATT ATGGATGATC CACGTTTTGG AGCAACTCGT
GCTTTATTAT TGTTTGAAAA ACCACAAGAT GTCCTAGAAT TCCTAGAACT AGGCGGAAAG
CTAGATAAGG TTAACCTAGG ATCAATGGCT TACAAACAAG GAGATATCAA CCTTACAAAC
GCTGTTTCAA TGAATGCTGA TGATGTTAAA TGTTTTGACA AAATCCTAGA ATACGGAATA
AAGATCGATG TCAGAAAAGT TCCAGCAGAC AAGAACGAAA ATTTCGACAA CTTGATGAAA
AAAGCTAAAA GAGAATTAAA TATTAATTAA
 
Protein sequence
MVGIILASHG GFADGIKESA QMIFGEQEKF ESVCLLPSMG PDDFRANLEK AIEKLDTEEI 
LFLVDLWGGT PFNQSSNLFE GNEDKWAIVA GMNLPMVIEA LSERFTAEKS HDIAKAIVGS
AKEGVKIKPE DLNPVEEAKT EVKEDNIPKG SIPEGTVLGD GKIDIGLARI DTRLLHGQVA
TSWTKSINPD RIIVVSNSVS KDELRKNMVM EAAPPGVKAH VIPIWKMKEI MDDPRFGATR
ALLLFEKPQD VLEFLELGGK LDKVNLGSMA YKQGDINLTN AVSMNADDVK CFDKILEYGI
KIDVRKVPAD KNENFDNLMK KAKRELNIN