Gene Apar_0861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0861 
Symbol 
ID8413727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp956160 
End bp957824 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content47% 
IMG OID645022444 
ProductDak phosphatase 
Protein accessionYP_003179881 
Protein GI257784664 
COG category[R] General function prediction only 
COG ID[COG1461] Predicted kinase related to dihydroxyacetone kinase 
TIGRFAM ID[TIGR03599] DAK2 domain fusion protein YloV 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00152626 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.95295 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTTCAA ATGTCATTCG TACATGCTTC CCTGTAGCAG CACTTGCTGT TGCAGACAAA 
GCAGAAGAGA TCAACAAGCT CAATGTTTTT CCTGTACCAG ATGGAGACAC CGGCACTAAC
ATGTCCCTTA CCCTGGGTAC CGTTGTTCGT GAGGTTCAAG ATCTTCCTCA AGATGCAAGT
ATGCAGGATA TTGCAAAGGC AATTACTCAC GGCTCTCTGA TGGGTGCTCG CGGTAACTCC
GGCGTTATTA CTTCTCAGAT TCTTCGCGGT ATCGCTGAGG GTCTTTGTGA CGTTAAGAAT
CCTGAGGCAG TAACTCCTAA GGATATTGCA CACGCATTCC GCCGCGGTAA AGAAGTTGCT
TTTAAGGCCG TTAGAAAGCC AGTTGAGGGT ACTATTCTTA CCGTTTTAAA GGACGTTTCT
GCTAAGGCAG ACTCTCTTGA GAAGTCTCAG CTTACCCCAG CAGAGGTCTT AGATGCCCTT
GTTGTTGAGG CATATGAATC CGTTGCCCGC ACTCCTGAGC TTCTTCCTGT TCTTAAAGAG
AACGGTGTTG TTGACTCTGG TGCATTTGGT TTTGCAACTT TCCTTGAGGG CTTTGTAAAC
GCTGTTACTG GTAAGACTGA AACTACTGAC TTTCAGACAA CTGTTTCAGT TTCTGACGCT
AAGGCTGCCA CCAGTGCAAA GGTTGAGATT GAGCTCAACG ATGACTGGGA GGGTTCTGAG
TACCGTTATT GCAATGAGTT CCTTTTCAAG GCAGATAGTC CTGACTTTGA CGAGGAAGCA
GCTTTGAATT TCCTTGCAAC TATGGGTGAT TGCGAGCTTC TTGTTGGCGC AAACCCAGAC
TACAAGATTC ACGTTCACTC AAATACTCCT AATAAGGTTC TTGAGTACAT GCTTCAGTAT
GGTCAGATTT TTGAGGTCTT TATTCATAAT ATGGACCTTG AGGCTAAGGA GCGTACCGAG
AAAATCGCTG AAGATAAGAA GGCTGCAGCA GTACCTAAAA AAGAGCTTGG CTTTGTTGCG
GTAACCGCTG GTTCTGGCGC CGCATCAATC TTGAAGTCTC TTGGCGTGGA CGTTGTTGTT
TCTGGTGGTC AGACCATGAA TCCTTCAACT GCAGATATTC TTGCTGCAAT TGAAGGGGCC
AATGCAGAAC AAGTTATTGT TATGCCAAAT AACTCTAACA TTCGTATGGC GGCAGAGGCG
GCTGCAAGTG CATGCGAGAA TATTAAGGTT GCAGTTATTC CAACCAAGTC TGTTCTTCAG
GCATTTGCTG CAATGTTTGT CGTTGCAGAT GGCGTTCCAT TTGAAGAGCT TGTCGAAGAG
ATGACTGATG CTATTTCTGG CGTTCGTTAT GGCGAGGTAA CTACCGCAGT TCGCGATTCT
TCCGCAGCTG ACGGTACTCC TATCCATGAT GGTGACGTCA TGGGTATCCA GGGAGGCTCC
ATTGATGTTG TCGGCTCCGA TGTCATGAAG GTCACGCTTG ATCTTATTGC AAAGATGCAA
GAGGAGGAAG AGGGTGACAA CCTCACCATT CTTGCAGGTG AGGATTTCTC TGATGAGCAG
CTCGATTTTC TCGCTAGCAG AGTCGAGGAA GCTTATCCAG ATCTTGAGGT TGACGCTCAG
CGCGGCGAGC AGCCACTCTA TCCAGTTATC TTCTCTATTG AGTAG
 
Protein sequence
MISNVIRTCF PVAALAVADK AEEINKLNVF PVPDGDTGTN MSLTLGTVVR EVQDLPQDAS 
MQDIAKAITH GSLMGARGNS GVITSQILRG IAEGLCDVKN PEAVTPKDIA HAFRRGKEVA
FKAVRKPVEG TILTVLKDVS AKADSLEKSQ LTPAEVLDAL VVEAYESVAR TPELLPVLKE
NGVVDSGAFG FATFLEGFVN AVTGKTETTD FQTTVSVSDA KAATSAKVEI ELNDDWEGSE
YRYCNEFLFK ADSPDFDEEA ALNFLATMGD CELLVGANPD YKIHVHSNTP NKVLEYMLQY
GQIFEVFIHN MDLEAKERTE KIAEDKKAAA VPKKELGFVA VTAGSGAASI LKSLGVDVVV
SGGQTMNPST ADILAAIEGA NAEQVIVMPN NSNIRMAAEA AASACENIKV AVIPTKSVLQ
AFAAMFVVAD GVPFEELVEE MTDAISGVRY GEVTTAVRDS SAADGTPIHD GDVMGIQGGS
IDVVGSDVMK VTLDLIAKMQ EEEEGDNLTI LAGEDFSDEQ LDFLASRVEE AYPDLEVDAQ
RGEQPLYPVI FSIE