Gene Apar_0140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0140 
Symbol 
ID8412986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp159752 
End bp160942 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content45% 
IMG OID645021710 
Productphosphopentomutase 
Protein accessionYP_003179167 
Protein GI257783950 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1015] Phosphopentomutase 
TIGRFAM ID[TIGR01696] phosphopentomutase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAAAC GTGTTTTTGT GGTTGTTCTT GATAGTTTTG GCATTGGTGA AGAACCAGAT 
GCAGCAGCAT ATGGAGATGA GGGCAGCAAT ACCCTCTGCG CATGCGCAAC TAGCGGAGTT
CTTAATATTC CAAATATGAC AAAGCTGGGT CTTCTTAATA TTGATGGCGC GCTTGATACT
GTCTATGACG TTGATTTGAA GCCAATTGAG TCTCCTATGG GCGCATATGC TCGTATGCAG
GAGAAATCTG CTGGTAAAGA TACCACCGTT GGTCACTGGG AGATGGCAGG CGTCATTTCT
CCAAAGAAGT TCCCAACCTA TCCTGATGGC TTCCCGCAGG AGGTCATTGA GGAGTTTGAG
CAGAAGACCG GTCGCAAGGT TCTGTGCAAT AAACCTTACT CTGGAACTGA TGTAATTAGA
GATTTTGGTA AAGAGCACGT AGAGACTGGT GCTTTGATTG TTTACACTTC AGCAGATTCT
GTTTTCCAGA TTGCTGCACA TGAGGATGTG GTAAGCCCCG AGAAACTCTA TGAGTATTGC
CGCATTGCAC GTGAGATTCT GCAGGGTGAG CATGGTGTTG CTCGCGTTAT TGCTCGTCCT
TTTGAGGGAG AGTGGCCATA TCAGCGCACT TCTCGCCGTC ATGACTTCTC GCTTGAGCCA
ACTGGCACAA CTATGCTTGA CCGTCTCAAG GAGAACGGCT TTGACGTTCT TTCCATTGGC
AAAATTTATG ACATCTTTGC TCATCGTGGT ATGACTGAGT TTGAGTTTAC TACCTGCAAT
GCAGATGGAA TTCAGAAGAC TATTGAGGCT ACCAGTAAAG ACTTTAACGG TCTATGCTTC
ACTAACCTTG TTGATACTGA CATGATTTAC GGTCACCGCA ACGATCCTGT GGGATATTCC
AATGCTCTTT CTTACTTTGA TGAGCATATC CCTCAGATTA TTAAGGGTCT TCGTGAAGAT
GATCTGTTTA TCATTACCGC AGATCATGGC TGTGATCCTG TGACGCCATC TACTGACCAC
TCTCGTGAGT ATGTACCACT GCTTATCACA GGTCCAAAAG TCAAGCCAAA TACTAACCTT
GGTACAACCA CCACATTTGC CGACATGGCC GAGACAATTC TTGATTATTT TGGCGTTGAG
CAACTTGGTG TTGGAACTTC TCATCTATCA GAGATTCTTA AGGAGAATTA A
 
Protein sequence
MPKRVFVVVL DSFGIGEEPD AAAYGDEGSN TLCACATSGV LNIPNMTKLG LLNIDGALDT 
VYDVDLKPIE SPMGAYARMQ EKSAGKDTTV GHWEMAGVIS PKKFPTYPDG FPQEVIEEFE
QKTGRKVLCN KPYSGTDVIR DFGKEHVETG ALIVYTSADS VFQIAAHEDV VSPEKLYEYC
RIAREILQGE HGVARVIARP FEGEWPYQRT SRRHDFSLEP TGTTMLDRLK ENGFDVLSIG
KIYDIFAHRG MTEFEFTTCN ADGIQKTIEA TSKDFNGLCF TNLVDTDMIY GHRNDPVGYS
NALSYFDEHI PQIIKGLRED DLFIITADHG CDPVTPSTDH SREYVPLLIT GPKVKPNTNL
GTTTTFADMA ETILDYFGVE QLGVGTSHLS EILKEN