Gene Arth_0642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0642 
Symbol 
ID4446890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp687442 
End bp688470 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content68% 
IMG OID639688441 
Productdehydrogenase 
Protein accessionYP_830141 
Protein GI116669208 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAATT CCCAGCAGCA ATCTGAACAC CGTCCCCAAG CCCGGGCCTA TTGGACTGTC 
GGCCACGAAA AAGGCGAGCT CCGCACGGAA GAGCTGCCCG CGCCGGGCCC GGGTGAGGCG
CTGGTCCGCG CCCTGTATTC GGGAATCAGC AAAGGCACCG AAACCGTGGT CCACTGCGGC
AAAGTACCGC CACGGGTTGC CGAACAAATG CGGGCGCCCC TGCAGGAGGG GTCCTTCCCG
TCGCCGGTGA AGTTCGGTTA CCTCTCCGTG GGAATCGTGG AGGACGGCCC GGAGGGCTGG
GTGGGCCGTA CCGTGTTCTG CCTGCACCCG CACCAGGACC GCTACATTGT TCCGGTCGAG
TCCCTGACCG TGGTCCCGGA GAACGTCCCG GCCCGGCGGG CGGTTCTCAC CGGAACCGTC
GAAACGGCCG TCAACGCCCT GTGGGAAGCC GGCCCACGGC TCGGTGACCG CGTCGCCGTC
GTCGGCGCGG GCCTGGTGGG CGGCATGGTG GCCACCCTCC TTCGCACCTT CCCGCTGCAA
CGGCTCCAGC TGGTGGATGT CGATCCGGCG AAGCGGGCGT TCGCCGATGC CCTCGGCGTC
GAATTCGCCA ACCCCAACGA CGCACTGGCC GACTGCGACA TCGTTATCCA CTGTTCAGCT
TCCCAGGAAG GGCTCGAACG CAGCCTCCAG CTGGTGGGCG ACGAAGGGGA CGTCATTGAA
ATGTCCTGGT ACGCCGACCG CAAGGTCACC ATCCCGCTGG GGGAGGACTT CCACGCCCGC
CGGCTCTCCA TCCGCGCAAG CCAGGTGGGA GTGGTGGCAC GCGCCCGCCG CCACCGCCGG
ACCAACGCCG ACCGGCTGGC GCTCGCCGTG TCCCTGCTCA GCGATCCCGT CTACGACACG
TTCCTCACCG GCGCATCGTC GTTTGCGGAA CTTCCCGCCG TCGTGCACGA GCTGGCCGAG
GGCCGCCTGG ACGCCCTCTG CCACGTCATC GAATACCCTT CCGAACACCC CGCCGAAGAC
AAGAGGTAG
 
Protein sequence
MINSQQQSEH RPQARAYWTV GHEKGELRTE ELPAPGPGEA LVRALYSGIS KGTETVVHCG 
KVPPRVAEQM RAPLQEGSFP SPVKFGYLSV GIVEDGPEGW VGRTVFCLHP HQDRYIVPVE
SLTVVPENVP ARRAVLTGTV ETAVNALWEA GPRLGDRVAV VGAGLVGGMV ATLLRTFPLQ
RLQLVDVDPA KRAFADALGV EFANPNDALA DCDIVIHCSA SQEGLERSLQ LVGDEGDVIE
MSWYADRKVT IPLGEDFHAR RLSIRASQVG VVARARRHRR TNADRLALAV SLLSDPVYDT
FLTGASSFAE LPAVVHELAE GRLDALCHVI EYPSEHPAED KR