Gene Arth_1800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1800 
Symbol 
ID4445664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2014919 
End bp2015878 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content63% 
IMG OID639689618 
ProductD-isomer specific 2-hydroxyacid dehydrogenase, NAD-binding 
Protein accessionYP_831290 
Protein GI116670357 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0111] Phosphoglycerate dehydrogenase and related dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00330988 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCATGG CTTCGCTGAA CCTCACCGAG CCAGGCCGCA ACGGCCGCAT TGCAGTGACC 
CCCCGCTCCT TGTCAGACGG GGGGCACCCT GCCCTGCAGA AGCTGGAACG CGCAGGATAC
GAACTGGTTT ATCCGTCTCC CGGTGCAGTG CCCAATGAAG ATCAGATCCG CGCCGGCGTG
TCGGAGTGCG TCGGCTACCT TGCGGGCACC GAACGCCTTT CCGGACAGGT ACTGGAGGAC
CTTACTCGGC TGAAAGCCAT CTCCCGGAAC GGCGTCGGCG TGGATTCGAT CGATGTCGAA
GCGGCCGAGC GTCTGGGGAT CAACGTACTC ACCGCGCCAG GCGCCAACTC GCAGGGAGTA
GCGGAACTTA CCATCGCACT GATTCTGGCC GGGAGCCGCA GCATCCCCTG GCACGATGCC
CAGCTGAAGT CGGGCCAATG GAACCGCCGG CCCGGCAATG AAGTGTCAGG GAAAGTCCTT
GGTCTGATCG GATGCGGCCA GATCGGCCGG CGGGTTGCGA CGATGGCGCT TGGACTAGGC
ATGAAGGTGA TTGCCTTCGA CGAATATCCC GTGACATCGT TCGCTCCTTC GCCCGACTTC
TCATGGGCAC CACGGGAGCG TGTTTTGTCA TCGAGCCACG TCGTATCGCT GCACACTCCG
CCGTCCGGGC AACCGGTTCT CGGAGCCGCG GCAATCCGGC TGCTCCAATG GGGTACCGGC
GTCATCAACA CTGCGCGGGC ATCCCTGATC GACGACGAGG CGCTGCTACA GGCTCTCGAC
TCCGGGCAGG TCGAGTATCT GGCCACCGAC GTGTTCAGTT CCGAACCCCC TGCACCCAGC
CGGCTGATTA CGCACCCGAG GGTCATCACA ACGCCGCACA TCGGTGGATA CACTAAGGAA
AGCGTGGACC GAGCCACACA GGCCGCTGTG GACAACCTGC TTCACGCCCT CGCCACCTAG
 
Protein sequence
MSMASLNLTE PGRNGRIAVT PRSLSDGGHP ALQKLERAGY ELVYPSPGAV PNEDQIRAGV 
SECVGYLAGT ERLSGQVLED LTRLKAISRN GVGVDSIDVE AAERLGINVL TAPGANSQGV
AELTIALILA GSRSIPWHDA QLKSGQWNRR PGNEVSGKVL GLIGCGQIGR RVATMALGLG
MKVIAFDEYP VTSFAPSPDF SWAPRERVLS SSHVVSLHTP PSGQPVLGAA AIRLLQWGTG
VINTARASLI DDEALLQALD SGQVEYLATD VFSSEPPAPS RLITHPRVIT TPHIGGYTKE
SVDRATQAAV DNLLHALAT