Gene Arth_0515 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0515 
Symbol 
ID4447000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp549897 
End bp550727 
Gene Length831 bp 
Protein Length276 aa 
Translation table11 
GC content64% 
IMG OID639688312 
ProductHAD family hydrolase 
Protein accessionYP_830014 
Protein GI116669081 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0647] Predicted sugar phosphatases of the HAD superfamily 
TIGRFAM ID[TIGR01452] phosphoglycolate/pyridoxal phosphate phosphatase family
[TIGR01457] HAD-superfamily subfamily IIA hydrolase, TIGR01457
[TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGAGT CAGACGAAGT ACGTTCATCA GCAGCGGTTT ACCGAAGCGG CCAGGAAATC 
GAATGCTGGC TGACTGACAT GGACGGCGTC CTGGTCCACG AAAACCAGCC GATCCCGGGC
GCCGCTGAAC TGATCCAGCG CTGGGTGGAC ACCTCCAAGC GTTTCCTGGT GCTCACCAAC
AACTCCATCT TCACGCCCCG CGACCTGGCC GCGCGCCTGC GTTCCTCCGG CCTGGAGATC
CCCGAGGAGA ACATCTGGAC TTCGGCCCTG GCCACCGCCC AGTTCCTCAA GGACCAGGTG
CGCGGCTCGG ATTCCGGGAA CCGCGCCTAC ACTATCGGCG AGGCAGGGCT TACGACGGCG
CTGCACGAGG CCGGCTTCAT CCTCACCGAC CAGAACCCGG ACTTTGTGGT GCTTGGCGAG
ACACGCACCT ACTCCTTCGA GGCCATCACG ATGGCCATCC GGCTAATCCT GGCAGGCGCC
CGCTTCATCG CCACCAACCC GGATGCCACG GGCCCGTCCA AAGACGGCCC CATGCCCGCC
ACCGGAGCCA TCGCGGCGCT GATTACCAAA GCCACCGGCC GTGAGCCCTA CATTGTGGGC
AAGCCGAACC CCATGATGTT CCGTTCGGCC ATGAACCAGA TCGACGCCCA TTCCGAGACC
ACCGCCATGA TCGGCGACCG GATGGACACC GACATCATCG CCGGCATGGA GGCCGGGCTG
CACACGGTGC TGGTCCTCAG CGGAATCACC CACAAGGACG ACATTGCCGC CTATCCGTTC
CGGCCCAACC AGATCCTGAA CTCGGTGGCA GACCTCAAGA GCCAGATCTA G
 
Protein sequence
MAESDEVRSS AAVYRSGQEI ECWLTDMDGV LVHENQPIPG AAELIQRWVD TSKRFLVLTN 
NSIFTPRDLA ARLRSSGLEI PEENIWTSAL ATAQFLKDQV RGSDSGNRAY TIGEAGLTTA
LHEAGFILTD QNPDFVVLGE TRTYSFEAIT MAIRLILAGA RFIATNPDAT GPSKDGPMPA
TGAIAALITK ATGREPYIVG KPNPMMFRSA MNQIDAHSET TAMIGDRMDT DIIAGMEAGL
HTVLVLSGIT HKDDIAAYPF RPNQILNSVA DLKSQI