Gene Apar_1219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1219 
Symbol 
ID8414098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1365893 
End bp1368364 
Gene Length2472 bp 
Protein Length823 aa 
Translation table11 
GC content54% 
IMG OID645022813 
Productputative phosphoketolase 
Protein accessionYP_003180237 
Protein GI257785020 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3957] Phosphoketolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.514131 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.618192 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCAGC CCGTGTACGG TACACCTTGG CAGACCCTGA ATCAGCCTGT ATCTGAGGAA 
GAACTTGCCG GCGTCGATAG GTATTGGCGC GCAGCTAATT ACCTGTCTGT TGGTCAGATT
TATCTTCGCA GTAACCCTCT TATGAAGGAT GGTTTTTCTC GCGAGGATGT TAAGCACCGT
CTGGTTGGCC ACTGGGGAAC CACCCCAGGT CTAAACTTCC TGTTCGGCCA CGTCAACCGT
TTCATTCGCG ACCATAACCA GAACACCATC TTCCTCATGG GTCCTGGTCA CGGTGGACCT
GCAGGTACTG CACAGTCCCT GCTTGACGGA ACTTATCGTG AGACCTACCC ATTTATCACT
GACGATGAAG AGGGTCTCCA GAAGTTCTTC CGCCGCTTCT CTTATCCTGG CGGAATTCCT
TCCCACTATG CACCTGAGAC CCCAGGTTCC ATCCACGAGG GCGGCGAGCT GGGTTACACC
TTGTCTCACG CATACGGCGC TGTCTTGGAC AATCCATCCC TGCTTGCAGT TGCTGTTGTC
GGCGACGGTG AGGCAGAGAC CGGTCCACTT GCAACTTCTT GGCAGACCAA CAAGTTCATG
GACCCACTGA CCGATGGTAT TGTTCTTCCA ATCCTGCACC TCAACGGCTA CAAAATTGCC
AACCCAACCA TTCTTGCTCG TATTTCCGAT GGTGAACGCG ACGAGTTCTT CCGCGGCATG
GGCTATCACC CATATAATTT TGTTGCTGGC TTTGACGACG AGGATCACGC ATCCATCCAC
CGTCGTTTTG CAGCTCTGCT TGAGGCTGTC TTCAATGAGA TCTGCGCTAT CAAGACTCGC
GCAGCCGCAG GTGACGCATC GCGTCCTTAC TACCCAATGA TCATCTTCCG CACCCCTAAG
GGTTGGACTT GCCCTCCTTA TATTGATGGC AAGAAGACCG AGGGCTCTTG GCGCGCACAC
CAGGTTCCAC TGGCATCAGC TCGCGACACT GAGGCTCACT TCCAGGTTCT TCGTGATTGG
ATGAGCTCCT ACAATCCAGA GACCCTCTTC ACCGAGAAGG GCGCAATCCG TCCAGAGGTT
ACTTCCTTCA TGCCTAAGGG TGACCTCCGT CTTGGCGCTA ACCCTAACGC AAATGGTGGT
ACAATCCGTC GCAACCTCGT TCTTCCTGAT GCAAAGAAGT ACGAGATTCC AGTTGCCGAG
AAGGGCCACG GCTTTGGTGC TACTGAGGCA ACGCGCGTTC TTGGCGAGTA CACCGCTGAG
CTTATTAACA GCAACCGTTC TGAGTTCCGT ATCTTCGGAC CTGATGAGAC CGCTTCTAAC
CGTCTGCAGC CTTCCTTCCA GGTAACCGAC AAGCAGTGGT TTGGTGGCTT TAACGACGAC
TTTGAGAACG ACGAGCATAT TTCCCCAGTT GGTAACGTTA TTGAGCAGCT TTCCGAGCAC
CAGTGCGAGG GTCTGCTTGA GGGCTACACC CTAACTGGCC GCCACGGCAT TTGGTCTAGC
TACGAGTCAT TTGTCCACAT CATTGACTCA ATGATCAACC AGCACGCTAA GTGGCTTGAG
GCTACCGTCC GCCACATTCC ATGGCGCAAA CCTATTTCTG CTCTTAACCT GGTTCTTTCC
AGCCACGTTT GGCGTCAGGA CCACAACGGA TTCTCTCACC AGGATCCTGG TTTTGTTGAC
ATCATGCTTA ACAAGAGCTT CAACAACGAC CACATCACCA ACATCTACTT CCCAGCAGAC
GCTAACCTTC TGCTTGCAGT TGGCGAGAAG TGCTATACCT CCACAAACTG CATCAACGCT
ATCTTTGCTG GTAAGCAGCC TGCTCCAACG TGGGTAACCC TCGACGAGGC TCGCGAGGAG
CTTTCAGTTG GCGCTAAGGA GTGGAAGTGG GCTTCCAACG CTGAGGCTGG CGAGGAGGAC
ATTGTCCTTG CATCCTGCGG CGACGTTCCT ACTCAGGAGC TCCTTGCTGC TCTGGACATG
CTTGGTAAGC TGGGCATCAA GGCTCGCTTC GTTAACGTTG TTGACCTGCT CAAGATTCAG
AACGCTTCCG AGAACAACGA GGCTCTTTCT GACGAGGAGT TCACAAAGCT CTTCTCCGCA
GACAAGCCAG TTCTCTTCGC ATTCCACGCT TACGCTGGTT CTGTCCGTCG CCTGATTTGG
AACCGTCCAA ACCACGACAA CTTCAACGTT CACGGCTACG AGGAGCAGGG TTCCACCACA
ACTCCTTACG ACATGCTACG CTTGAACAAC ATGGACCGCT GGGCACTTGC TGCTGACGCT
CTCCGCATGA TTGACGCTCA GAAGTGGGCA GATCAGATTG ACGAGTGGGA GAAGTTCCGC
ACCGAAGCCT TTGAGTTTGC TGTCGAGAAG GGCTATGATC ACCCAGCATT TACTGATTGG
TCTTGGCCAG ACGCTTCCGA TGCTGATCAG AGCATCTCTG CAACTCAGGC AACCGCAGGC
GACAACGAGT AA
 
Protein sequence
MAQPVYGTPW QTLNQPVSEE ELAGVDRYWR AANYLSVGQI YLRSNPLMKD GFSREDVKHR 
LVGHWGTTPG LNFLFGHVNR FIRDHNQNTI FLMGPGHGGP AGTAQSLLDG TYRETYPFIT
DDEEGLQKFF RRFSYPGGIP SHYAPETPGS IHEGGELGYT LSHAYGAVLD NPSLLAVAVV
GDGEAETGPL ATSWQTNKFM DPLTDGIVLP ILHLNGYKIA NPTILARISD GERDEFFRGM
GYHPYNFVAG FDDEDHASIH RRFAALLEAV FNEICAIKTR AAAGDASRPY YPMIIFRTPK
GWTCPPYIDG KKTEGSWRAH QVPLASARDT EAHFQVLRDW MSSYNPETLF TEKGAIRPEV
TSFMPKGDLR LGANPNANGG TIRRNLVLPD AKKYEIPVAE KGHGFGATEA TRVLGEYTAE
LINSNRSEFR IFGPDETASN RLQPSFQVTD KQWFGGFNDD FENDEHISPV GNVIEQLSEH
QCEGLLEGYT LTGRHGIWSS YESFVHIIDS MINQHAKWLE ATVRHIPWRK PISALNLVLS
SHVWRQDHNG FSHQDPGFVD IMLNKSFNND HITNIYFPAD ANLLLAVGEK CYTSTNCINA
IFAGKQPAPT WVTLDEAREE LSVGAKEWKW ASNAEAGEED IVLASCGDVP TQELLAALDM
LGKLGIKARF VNVVDLLKIQ NASENNEALS DEEFTKLFSA DKPVLFAFHA YAGSVRRLIW
NRPNHDNFNV HGYEEQGSTT TPYDMLRLNN MDRWALAADA LRMIDAQKWA DQIDEWEKFR
TEAFEFAVEK GYDHPAFTDW SWPDASDADQ SISATQATAG DNE