Gene Apar_0464 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0464 
Symbol 
ID8413313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp533833 
End bp534831 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content46% 
IMG OID645022032 
ProductProtein-N(pi)-phosphohistidine--sugar phosphotransferase 
Protein accessionYP_003179486 
Protein GI257784269 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2893] Phosphotransferase system, mannose/fructose-specific component IIA
[COG3444] Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIB 
TIGRFAM ID[TIGR00824] PTS system, mannose/fructose/sorbose family, IIA component
[TIGR00854] PTS system, mannose/fructose/sorbose family, IIB component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value2.7828e-07 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.826743 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTAGTA TCGTTCTAGC CAGTCATGGT AAGTTTGCCG AGGGTATTAA AGATTCTGGC 
AGCATGATTT TTGGACCACA GGAAGGCGTT GTAGCTATTA CGCTTACTCC TGATATGGGT
CCAGATGACC TGCATCAGAA GATTCTCGAC GCAATTACCA CGCTTGAAGA TCAAGAGCAC
GTTCTGTTCT TGGTTGACCT ATGGGGTGGC ACTCCCTTTA ACCAGATTTC TCGTGTTCTT
GAGGAGGAGG GTAAAGAAGA TTGGGTTGCT GTTACTGGTC TTAACCTCCC AATGCTTATA
GCAGCATATG GTTCTCGCCT TGGCGTAGAT ACCGCCACCG AGGTAGCAAA AGAGATTTTC
TCTGAAGCTC GTATGGGCGT AAAGATTAAG CCAGAAGAGC TTGAGCCACA AGAGGCAACA
CCTACTGATG TTCCTGCTGT TGCAACACCT AAGGGAGCAA TTCCTGTGGG AACCGTAATT
GGTGATGGCA AGCTTAAGAT TGTTCTTGCT CGTATTGATA CCCGTCTTCT GCATGGTCAG
GTTGCAACTA CCTGGACAAA GATGACCAAG CCAGACCGTA TCATTGTTTG TTCTGATGGT
GTTGCACAAG ATGAGCTTTG CAAGACCATG ATTGTTCAAG CAGCTCCTCC AGGAGTGCAC
GTTCACGTTG TTCCTATTAA GAAGATTATT GAGATTGCTC ACGATACTCG TTTTGGTAAT
ACCAAGGCAA TGCTCCTGTT CGAGACTCCT CAGGACATGC TTCGTGTCAT TGAAGGTGGC
GTAGAAATCA AGGAAGCTAA TCTTGGTTCT ATTGCTCACT CTGTTGGTAA GGTTGTTGTT
ACCAACGCAG TTGCTATGGA CGAAGACGAT GTAAAGACCC TTGAGGCTAT CCGAGAGTGC
GGTACAAAGT TTGATGTCCG TAAGGTTCCT GCAGACAGTG CAGAAAACTT TGATGCAATG
CTTAAAAAAG CTAAGTCCGA GCTTGCTAAT CGTAAGTAA
 
Protein sequence
MVSIVLASHG KFAEGIKDSG SMIFGPQEGV VAITLTPDMG PDDLHQKILD AITTLEDQEH 
VLFLVDLWGG TPFNQISRVL EEEGKEDWVA VTGLNLPMLI AAYGSRLGVD TATEVAKEIF
SEARMGVKIK PEELEPQEAT PTDVPAVATP KGAIPVGTVI GDGKLKIVLA RIDTRLLHGQ
VATTWTKMTK PDRIIVCSDG VAQDELCKTM IVQAAPPGVH VHVVPIKKII EIAHDTRFGN
TKAMLLFETP QDMLRVIEGG VEIKEANLGS IAHSVGKVVV TNAVAMDEDD VKTLEAIREC
GTKFDVRKVP ADSAENFDAM LKKAKSELAN RK