Gene Arth_0036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0036 
Symbol 
ID4447506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp42175 
End bp43791 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content69% 
IMG OID639687830 
Productaldehyde dehydrogenase 
Protein accessionYP_829537 
Protein GI116668604 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACTCA CCGGACATTC CCTGATCGCC GGGCAGGCCG TCGCCGGCGA AGGCAAGACT 
GCCTTTGGCT TCAACCCCGC CAGCAACGAA CAGCTTGAGC CCGCCTACAC CCTGCTCACC
GAGGACCAGC TCAAAGCCGC CACCGCCGCG GCCGGCGAAG CCTACCCCTC CTTCAGCACA
CTCGATCCCG AAACCCACGC GAGCTTCCTG GAAGCCATCG CGGACAACAT CGAGGCCATC
GGCGACGACC TGATCGTCCG CGCCGGACAG GAGACCGGAC TGCCCGCAGC CCGACTACAA
GGTGAACGTG CCCGCACCAC GGGGCAGCTC CGGCTGTTCG CGAACGTTGT CCGCCAGGGC
GATTTCCGCG GCGTCCGCAT CGACCCGGCC CTGCCGGAAC GCACGCCGCT CCCCCGCGCC
GACATCCGCC AGCGCCAGAT CCCGCTGGGA CCCGTGGCGG TGTTCGGTGC CAGCAACTTC
CCGCTGGCCT TCTCGACGGC GGGCGGAGAC ACCGCTTCGG CCCTCGCCGC CGGCTGTCCC
GTAGTCTTCA AAGCCCACAA CGCCCACCCC GGCACGGGCG AACTTGTCGG CCAGGCCATC
GTCAAAGCCG TCCGCGATTC CGGGCTCCAC CCTGGCGTGT TCTCGCTGAT CTACGGCCCC
GGCAGCAGCA TCGGCCAGGC CCTTGTGGCG GACCCGGCCA TCAAGGCTGT GGGCTTCACC
GGCTCGCAGA GCGCCGGCAT TGCGCTGATG CGCACCGCAG CAGCCCGCCC GGAGCCCATC
CCGGTCTACG CGGAAATGTC CTCGCTCAAC CCGGTCTTCG TGTTCCCCGG CGCCCTCACC
GGCTCCGCCG AGCAGATCGA CGCACTGGCG CAGCAGTACG TCACCGCCGT CACCGGCAGC
TCCGGACAGC TCTGCACCTC CCCCGGCCTG CTGTTCGCCC CCGCAGGTGA GCTGGGCGAC
AAACTGGCTG CCGCCGTCGG ACGCGCAGTA TCCGCCTGCG CCGGCCAGAC CATGCTGACC
GCCGGCATCG CCGGTTCGTG GAACAGCGGG GCCGAGACGC TCGGCTCAGC CGACAACGTG
ACCGTCGTCG GCCAGGGAAC CGCCGGACCC ACCGAAAACG CACCGGCCCC CACCATCTTC
GGGACCGACA TCGCCGACTT CGTCAGCAAC CATGTCCTGC ACGCCGAGAT CTTCGGCGCG
GCCAGCCTGG TGATCCGCTA CTCCACCGCC GGGGAACTGA TCGAGGCCAC CAACCGGCTC
GAGGGGCAAC TCACCGCATC CCTGCAGCTC ACCGAAGAGG ACTACCCGAC GGCGGCGCAA
CTGCTGCCCG CCCTGGAACA GAAGGTGGGG CGGATCATCG TCAACGGTTG GCCCACCGGC
GTCGAAGTGG GTCACGCCAT GGTCCATGGC GGCCCCTTCC CGGCGACGTC GGACACGCGG
ACGACGTCGG TCGGCACCCT GGCGATCAAC CGATTCCTCC GGCCGGTCGC CTACCAGAAC
CTGCCCCAGG AACTGCTCCC GGCTCCGCTG CAGGACGCCA ACCCGTGGCA CCTGAACCGC
CGGATCGACG GCACGGTCGA AGCCGCAGCC GACGCAGAAG ATAAGGTCAA CGCATGA
 
Protein sequence
MTLTGHSLIA GQAVAGEGKT AFGFNPASNE QLEPAYTLLT EDQLKAATAA AGEAYPSFST 
LDPETHASFL EAIADNIEAI GDDLIVRAGQ ETGLPAARLQ GERARTTGQL RLFANVVRQG
DFRGVRIDPA LPERTPLPRA DIRQRQIPLG PVAVFGASNF PLAFSTAGGD TASALAAGCP
VVFKAHNAHP GTGELVGQAI VKAVRDSGLH PGVFSLIYGP GSSIGQALVA DPAIKAVGFT
GSQSAGIALM RTAAARPEPI PVYAEMSSLN PVFVFPGALT GSAEQIDALA QQYVTAVTGS
SGQLCTSPGL LFAPAGELGD KLAAAVGRAV SACAGQTMLT AGIAGSWNSG AETLGSADNV
TVVGQGTAGP TENAPAPTIF GTDIADFVSN HVLHAEIFGA ASLVIRYSTA GELIEATNRL
EGQLTASLQL TEEDYPTAAQ LLPALEQKVG RIIVNGWPTG VEVGHAMVHG GPFPATSDTR
TTSVGTLAIN RFLRPVAYQN LPQELLPAPL QDANPWHLNR RIDGTVEAAA DAEDKVNA