Gene Apar_1016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1016 
Symbol 
ID8413888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1152055 
End bp1152957 
Gene Length903 bp 
Protein Length300 aa 
Translation table11 
GC content45% 
IMG OID645022605 
ProductHAD-superfamily hydrolase, subfamily IIB 
Protein accessionYP_003180036 
Protein GI257784819 
COG category[R] General function prediction only 
COG ID[COG0561] Predicted hydrolases of the HAD superfamily 
TIGRFAM ID[TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily
[TIGR01484] HAD-superfamily hydrolase, subfamily IIB 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.650618 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCAGAGC TTTCAACAGA AGAACTTATT ACTCAAAAAC AGTCTTGGCT TAATCAGGAT 
GTATCAAATC TGAAAGTAGA AATCGTCTTC TCCGATATGG ATGAGACATT TCTCCATACC
GATAAATCTC TTGTTGCTGA TAATATGGCA ATGCTAGACC GCCTGGCAGA TCTAGGTATT
CCTTTTGTAC CTTGTTCCGG TCGAGCGTTT AACGTGCTGC CAAAAGAGGT TGTGAATCAT
CCTGCTTCTA AGTATGCGAT TTCTTGTGAC GGTGCCGTTA TTAGAGACCT TTCTACTCAG
ACAGTGCTTT ACGAGTCTAC TGTTACCAAG GAACAGGCAC TTGAGTTGTA TCAAAAGCTC
AAGAATTATC CTGTGACATT TGATATTTTT GCTGATGGCG GAGTTTACCT AGAGCGCTCA
AGGCACAACC TTATTTCTCA GCTTGGTATT CCTGCAGATC ACGCGGCGTT TATGCAGCGT
TCTCGTACAC CTTTTGACCA AAGCGTCGAG GAATTTATTA ATAGCGTATC TACCATTGAA
CGACTTGGCT CCTATTGGTC AGTTTCAGAA GGCGAGAAGT ACCAAGCACT TGTAGCAGAT
GCTGTGGCTT CGGTAAAAGG ATTGAGTGGT ACGGCTTCCA TGAGTATGGG AATGGAAATC
TTGGCTGACG GCACTTCAAA GGGCGCCGCA CTTCAATGGC TCTGCAATCA TCTTGGCATT
TCAACAGAAA ATGCTGCAGC CTTTGGCGAC TCGCTTAATG ATGTTGCTAT GCTTTCTGCT
GCAGGAATGG GCACAGCAGT TGCTAATGCT CGTCGTGAGG ATTTTGAAGC TGCAACCTAT
ATCACCTTAA CTAACGATGA GGCGGGTTTC TCGCGATTTT TAAAGTGGGC ACTGGGGGAG
TAA
 
Protein sequence
MSELSTEELI TQKQSWLNQD VSNLKVEIVF SDMDETFLHT DKSLVADNMA MLDRLADLGI 
PFVPCSGRAF NVLPKEVVNH PASKYAISCD GAVIRDLSTQ TVLYESTVTK EQALELYQKL
KNYPVTFDIF ADGGVYLERS RHNLISQLGI PADHAAFMQR SRTPFDQSVE EFINSVSTIE
RLGSYWSVSE GEKYQALVAD AVASVKGLSG TASMSMGMEI LADGTSKGAA LQWLCNHLGI
STENAAAFGD SLNDVAMLSA AGMGTAVANA RREDFEAATY ITLTNDEAGF SRFLKWALGE