Gene Apar_1061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1061 
Symbol 
ID8413934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1204714 
End bp1205880 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content48% 
IMG OID645022650 
ProductN-acetylglucosamine-6-phosphate deacetylase 
Protein accessionYP_003180080 
Protein GI257784863 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1820] N-acetylglucosamine-6-phosphate deacetylase 
TIGRFAM ID[TIGR00221] N-acetylglucosamine-6-phosphate deacetylase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.346962 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACAT TTGCAGTTAA AGCAGATAAG TTCTTTTTAC CAGGAGCAAC TTCTGGTCCA 
GGATATTTGC TCGTCGAAGA CGGCATATTT GGTCATTTCA CTAAAGAAAA GCCAGAGTGT
GAGATTATTG ACCGCACCGG TTCTTGGGTA GCTCCTGGTC TTGTTGATAC GCATATCCAC
GGTTTTCTCG ACCATGACAT TATGGATTGC GATCCTGACG GCGTCATTGA GATTGCTCAG
GGTCTGCTCT CTAATGGCGT AACTTCTTGG CTTCCCACAA CACTGACCGC AAGCGTTGAG
CAGACTGGTG ATGCTTGTGA GTCCGTTGCT GACGCAGCAG AGGGAATTGC GGCAAATGGT
ATTGATGCTG CTCGCATCCA GGGAATCTTT CTAGAGGGAC CATTCTTTAC CGAGAAGCAC
AAGGGAGCTC AAAATCCTGC GTACTTTCTT GACCCAGATG TGGATGTCTT TGATGAATGG
CAGGAGCGCG CTGATGGTTG GATTGCCAAG ATAGCTATTG CTCCAGAGCG CGATGGTGCT
CCAGAGTTCT GTGCAGAGAT GGCAGACCGT GGTGTTCATG TAGCCTTGGG ACACTCTGAT
GCAACTTTTG AAGAGGCTCT TGCATGTGTA AATGCTGGTG CTGATATCTT TGTTCATACT
TATAACGGCA TGAGTGGTCT TCACCATCGT GAGCCGGGTA TGGTGGGCGC TGCAATGACT
ACCCACGGTA CTTATGCAGA GGCAATTTGC GACGGTCACC ACCTTAATCC TATTGCAGTT
CGCGCTCTTG TGAATGCAAA GGGAGCAGAT CATACCGTTC TCATTACCGA TTGCATGCGC
GCAGGCGGTA TGCCTAATGG TCAGTACAAT CTTGGTGATT TCCCCGTTGT TGTTGAAGGT
GGGACTGCTC GCCTGATGGA TGACTCTCAC AGTCTTGCTG GCTCAATCCT TCGTCTGTTT
GAAGGCGTAA AGAACGTCTA TGACTGGGGA GTTGTATCTG CTGAAGAGGC AGTTCGCATG
GCTTCAGAAA ACCCAGCTCG CTCCTGTGGA ATTGATGATG TTTGCGGCTT TATTCGTCCT
GGATACGATG CAGACTTTAT TGTTATTACT AAGAATCTTC AACTTGAAGA GACGTTCCTT
GGTGGAAAGA GTGTCTACAA GGCTTAA
 
Protein sequence
MSTFAVKADK FFLPGATSGP GYLLVEDGIF GHFTKEKPEC EIIDRTGSWV APGLVDTHIH 
GFLDHDIMDC DPDGVIEIAQ GLLSNGVTSW LPTTLTASVE QTGDACESVA DAAEGIAANG
IDAARIQGIF LEGPFFTEKH KGAQNPAYFL DPDVDVFDEW QERADGWIAK IAIAPERDGA
PEFCAEMADR GVHVALGHSD ATFEEALACV NAGADIFVHT YNGMSGLHHR EPGMVGAAMT
THGTYAEAIC DGHHLNPIAV RALVNAKGAD HTVLITDCMR AGGMPNGQYN LGDFPVVVEG
GTARLMDDSH SLAGSILRLF EGVKNVYDWG VVSAEEAVRM ASENPARSCG IDDVCGFIRP
GYDADFIVIT KNLQLEETFL GGKSVYKA