Gene Arth_2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2001 
Symbol 
ID4445480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2255748 
End bp2257268 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content69% 
IMG OID639689810 
Productaldehyde dehydrogenase 
Protein accessionYP_831482 
Protein GI116670549 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.230641 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCGG CCACGCACGA CGCTCCCGCT GTGGCCGCCC GTGCCGGGCT CAAGTTGCCG 
TATACGCACG TGGGGGACAT CTTCGTCGAC GGTGCCTGGA CCCCGGCACG CGGCACCGGC
CGCAACCCGG TCACCGATCC CGCCACGGGC GAGGTCTGGG GCTCCGTTCC GGACGGATCC
CCTGAAGACG TCGACGCCGC CGTCGGCTCC GCCCGCAGGG CTTTCGACGA CGGCATGTGG
CCGCGGCTGA CGCCGTCCGA GCGGGCGGCG TACCTGCTGC GCATCGCCGA GGAAGTGGAG
AAGCGCGCCG AGGAGCTCTC GCTGACGAAC ACCCGGGAGA ACGGCTCGCC GGTCTCGGAA
TCCGCGGGGG CCGCGGCCAA CGCCGCGGGC ATCTTCCGGT ACTTCGCCAC CCTCGCCGGT
TACCTTGAAC GCGAGGACGT GCGGGCTTTC CCCCAGGGCG GCGGCGAGTC CGTGGTCCGA
CGCGAACCAA TCGGTGTGTG CGCGCTGATC GCGCCCTGGA ACTTCCCGAT CAACCTCGTC
GTGATCAAGC TGGCTCCGGC GCTGCTGGCC GGCTGTACCG TGGTGATCAA GCCGGCGTCG
CCAACGCCGC TCTCGCTCCG GGTGATCATC GACGCCGTGG CCGCGGCCGG CGTCCCGGCC
GGCGTCGTCA ACCTGGTGAC CGGCTCCGGC CGGCTGGGGG ACTCCCTGGT GAAGCACCCG
GGCGTGGACA AGGTCGCCTT CACCGGTTCC ACCCCGGTCG GCCGGAAGAT CGCGGCGGCC
TGCGGCGAAC TGCTGCGCCC TGTCACGCTG GAGTTGGGCG GAAAGTCAAG CGCAATCGTG
CTGCCGGACG CGGACCTGGA CGCGATGTCC AAGGTGCTGA TCCGGTCCTC GATGCGCAAC
ACCGGGCAGA CCTGCTACAT CTCCACCCGG ATCCTGGCCC CCGCCAGCCG TTACGAGGAA
GTGGTGGACA TGGTCACCAG CACCATCGCC GCCGGCAAGC AGGGAGACCC GCTGGATCCG
GATACGGTCT TTGGCCCGTG CGCCACCGAG TCGCAGTATA GGACCGTGCT GGAGTACGTG
GAATCGGGTC TGGCCGAAGG CGCCCGCGCC ACCACCGGCG GACGTGCGGC GTCACTGGGA
GGCGGACTCG AAGGCGGGTA CTTCGTGGAA CCGACCGTGT TCGCCGACGT CACCCCGGAG
ATGCGGATCT CCCGGGAGGA GATTTTCGGC CCGGTCATCT GCATCCTGAA GTACAACGAC
GCCGGCGGCA GCGCTGATGA GGCCGTGGAG TTGGCGAACA ACACCGAGTT CGGTCTGGGC
GGCCTGGTGT TCGGCGCAGA CCCGGAAGCG GCGCTGGCCG TGGCGGACCG GATGGACACC
GGCTCGGTAG GCATCAACTT TTTCGCCTCC AACCACGCCG CGCCCTTCGG CGGACGTCAC
GATTCCGGCC TCGGCACCGA GTACGGCGTC GAGGGCCTCA ACGCCTACCT GAGCTACAAA
TCCATTCACC GGAAGGTGTA G
 
Protein sequence
MTAATHDAPA VAARAGLKLP YTHVGDIFVD GAWTPARGTG RNPVTDPATG EVWGSVPDGS 
PEDVDAAVGS ARRAFDDGMW PRLTPSERAA YLLRIAEEVE KRAEELSLTN TRENGSPVSE
SAGAAANAAG IFRYFATLAG YLEREDVRAF PQGGGESVVR REPIGVCALI APWNFPINLV
VIKLAPALLA GCTVVIKPAS PTPLSLRVII DAVAAAGVPA GVVNLVTGSG RLGDSLVKHP
GVDKVAFTGS TPVGRKIAAA CGELLRPVTL ELGGKSSAIV LPDADLDAMS KVLIRSSMRN
TGQTCYISTR ILAPASRYEE VVDMVTSTIA AGKQGDPLDP DTVFGPCATE SQYRTVLEYV
ESGLAEGARA TTGGRAASLG GGLEGGYFVE PTVFADVTPE MRISREEIFG PVICILKYND
AGGSADEAVE LANNTEFGLG GLVFGADPEA ALAVADRMDT GSVGINFFAS NHAAPFGGRH
DSGLGTEYGV EGLNAYLSYK SIHRKV