Gene Arth_3744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3744 
Symbol 
ID4443757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4218992 
End bp4220515 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content67% 
IMG OID639691568 
Productaldehyde dehydrogenase (acceptor) 
Protein accessionYP_833219 
Protein GI116672286 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCG CAACGCTATT CATCGGCGGC ACCTGGAGTG CAGCGTCCGA CGGCGGCACC 
CGTGAGATCC GCTGTCCGGC CGACGGCGAA CTGGCCGGCG TCGTCTCCGA GGCCAACGCC
TACGACGCTG TCCGCGCCGT GGCAGCCGCC AGGGCGGCGT TCGACGGCGG CGAATGGGCA
GGTGTCCCGG CCTTGGAACG CGGGTCCTTC CTGCTGCGCG TCGCGGCGAG GTTGCGGGAG
CGCAAGGACG AGTTTGCCCG CGCCGAAACG CTGGACACCG GCAAGCGCCT GGTGGAGAGC
GAAATCGACA TGGACGACAT CGCCAACTGC TTCGAGTACT TCGGCAAAAT CGCGGGCCAG
GACTCCGGAC GGCTGGTCGA CGCGGGCAGC AGCACCGTGG TCAGCCGGAT CGAATACGAA
CCCGTAGGCG TCTGTGCGCT GATCGCCCCC TGGAACTACC CGCTGCTGCA GGCCGCCTGG
AAGATCGCCC CGGCGCTCGC GGCAGGCTGC AGCTTCGTGC TCAAGCCGAG CGAACTGACG
CCCCACACCG CCATCCTGAT GATGCAGGTC CTGGAAGAAC TGGGACTGCC GTGCGGGGTC
GCCAACCTTG TCCTGGGCGA CGGGAAAACC GTGGGTTCAG TGCTTTCCGG AAACCCGGAT
GTTGACCTCG TCTCCTTCAC GGGCGGGCTC GAAACCGGCA AGACCATCGC AGCTTCGGCC
GCCGCCACGG TCAAGAAGGT AGCGCTGGAG CTGGGCGGCA AGAACCCCAA CATCATCTTC
GCTGACGCCG ACTTCGACGC CGCCCTGGAC AACGCGCTCA ACGCCGCCTT CGTGCACTCC
GGCCAGGTCT GCTCCGCCGG CTCGCGCCTG ATTGTCGAGG AATCCATTGC CGAACGGTTC
GTGGACGAGC TGGTCCGCCG TGCGGAGCAG ATCCGCCTGG GCGGCCCCTT CGATCCCGAC
GCCGAGACAG GGCCGCTGAT CTCCGCAGCC CACCGCGACA AGGTGACCGC CTACGTGGAC
AAGGGCGTCG CCGAGGGTGC ACGCCTGCGC TGCGGCGGTA CGTGGGGCGA CGGCGAGCTC
AAAAAGGGCT ACTACTACCT GCCCACCGTC CTGGACCAGG TCACCAGCGG CATGTCCGTG
CTGAAGGATG AGGCTTTCGG TCCGGTGGTC ACCGTGGAAA CCTTCAGCAC CGAAGAAGAA
GCCGTGCGGC TGGGCAATGA CACCCACTAC GGCCTGGCCG GCGCCGTCTG GAGCCAGAAT
GCCGGCAAGA GCCAACGCGT GGCCCGCAAG CTGCGCCACG GCACCATCTG GATCAACGAT
TTCCACCCCT ACCTGCCGCA GGCGGAATGG GGCGGCTTCG GCCAGTCCGG CGTCGGGCGC
GAACTCGGAC CCACGGGCCT GGCTGAATAC CAGGAAGCCA AGCACGTCTA TCACAACATC
GATCCGCAGG TGACGGGCTG GTTCGCGGAC CCTGGCACAG CCGGGAACAC AGCCGGGAAC
ACAGTCACCG CAGAGGGGAA CTAA
 
Protein sequence
MTTATLFIGG TWSAASDGGT REIRCPADGE LAGVVSEANA YDAVRAVAAA RAAFDGGEWA 
GVPALERGSF LLRVAARLRE RKDEFARAET LDTGKRLVES EIDMDDIANC FEYFGKIAGQ
DSGRLVDAGS STVVSRIEYE PVGVCALIAP WNYPLLQAAW KIAPALAAGC SFVLKPSELT
PHTAILMMQV LEELGLPCGV ANLVLGDGKT VGSVLSGNPD VDLVSFTGGL ETGKTIAASA
AATVKKVALE LGGKNPNIIF ADADFDAALD NALNAAFVHS GQVCSAGSRL IVEESIAERF
VDELVRRAEQ IRLGGPFDPD AETGPLISAA HRDKVTAYVD KGVAEGARLR CGGTWGDGEL
KKGYYYLPTV LDQVTSGMSV LKDEAFGPVV TVETFSTEEE AVRLGNDTHY GLAGAVWSQN
AGKSQRVARK LRHGTIWIND FHPYLPQAEW GGFGQSGVGR ELGPTGLAEY QEAKHVYHNI
DPQVTGWFAD PGTAGNTAGN TVTAEGN