Gene Arth_1856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1856 
Symbol 
ID4445615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2088060 
End bp2089577 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content66% 
IMG OID639689671 
Productaldehyde dehydrogenase 
Protein accessionYP_831343 
Protein GI116670410 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.267986 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACCA AGAGCTACAC GCTCGACGAT GTCGAGCCCA CCTACCGCCC GATGATCATC 
GACGGCGAAG ACGCCCAGGC ACAAAGCGGC CAGACCTTCA CCCGACTGAG CCCGGCGCAT
GAAGTCGAAG TCACCTCCTT CCCGGAGGGC GGGCATGAAG ACGTAAACCG TGCCGTCACT
GCCGCCAGAA AAGCCCTCGA CCGCGGCTGG CGCCAGTCCA CCGGATCCGA ACGGTCCAAG
CTGCTCCTGA AGGTCGCCGA TCTCGTCCGC CGCGACGCCG AAGCATTGTC CCTGGCCGAA
ACGCTGGAGA CCGGTAAACC GATCACCCAG TCACGCAACG AAGTCTCCGG CACCGCCGAG
CTATGGGAGT ATGCGGCCAG TCTGGCCCGC AACACCCACG GCGATGCCCA CAACGCCCTC
GGCCAGGACA CCCTGGCAAT GGTCGTCCAC GAGCCCATCG GCGTCGTCGG CATGATCACG
CCCTGGAACT TCCCGCTGCT GATCATCAGC CAAAAACTGC CGTTCGCCCT CGCCGCCGGC
AACACCGCCG TGATCAAACC CAGCGAAAGC ACCTCAGCGA CCACCGTCAT GCTGGGCCAG
CTCATCCGCG AAGCGGGTTT CCCGGCCGGC GTCGTCAACA TCGTCACCGG CGGCCGCGTC
GTGGGCGCGG CCATTGCCGA ACACCCCGGA ATCGACATGA TCAGCTTCAC CGGCTCCACC
GGCGTGGGCA AAGGCATAGC CTCCGCCGCC GGCCGGGACC TCAAAAAGGT CGAACTCGAA
CTCGGCGGGA AAAACCCGCA AATCATCACC GCCAACGCCG ATTTCACCGC CGCGGTCGAC
GCCGGTGTCT TCGGCGGCTA CTTCAACGTC GGCCAGTGCT GCAACTCCGG CAGCCGCCTC
ATCGTGCACC GCTCCATCGC CGACGAATTC GCATCCGCCG TCGTCGAGCG TGCCCAGCAC
ATGCGCGTCG GTGACCCGCT CAAGGCCGAG ACCCTCGTCG GTTCCCTTGT CAACGACGCC
CAGCTCGCCG TCGTCGAACG CTACGTGGCC GAAGGCCGCG ACGCCGGCGC GCACCTGCTC
ACCGGCGGTG ACCGGCTCGA TACCGGCCTC AACGGCCGCT TCTACCAGCC CACCGTCTTC
ACCGACGTCA CCGCCCATAT GACCATTGCC ACCGACGAGA TCTTCGGGCC CGTCCTGTCC
GTGCTGCCCT ACGACACCCT GGAGGAGGCC ATTGGGATCG CCAACTCCAC CTCGTTCGGC
CTGTCCGCCG GCATCTGGAG CAACGACATC AACGAAGCCC TGACCGCCGC CCGCGACCTG
CGCGCCGGGA CCGTCTGGGT CAACCGCTGG ATGGACGGAT ACCCCGAAGT GCCGTTTGGC
GGCTACGGAC ACAGCGGCAT CGGCCGCGAA CTCGGCCGCC AGGCCCTCGC CGAGTTCAGC
GAACTGAAAA CCATCCAACT CCAGGTCGGC ATCCGTGAAA ACCGCTGGGT CGACGCCCCC
GACGCACCCC GCCGCTAA
 
Protein sequence
METKSYTLDD VEPTYRPMII DGEDAQAQSG QTFTRLSPAH EVEVTSFPEG GHEDVNRAVT 
AARKALDRGW RQSTGSERSK LLLKVADLVR RDAEALSLAE TLETGKPITQ SRNEVSGTAE
LWEYAASLAR NTHGDAHNAL GQDTLAMVVH EPIGVVGMIT PWNFPLLIIS QKLPFALAAG
NTAVIKPSES TSATTVMLGQ LIREAGFPAG VVNIVTGGRV VGAAIAEHPG IDMISFTGST
GVGKGIASAA GRDLKKVELE LGGKNPQIIT ANADFTAAVD AGVFGGYFNV GQCCNSGSRL
IVHRSIADEF ASAVVERAQH MRVGDPLKAE TLVGSLVNDA QLAVVERYVA EGRDAGAHLL
TGGDRLDTGL NGRFYQPTVF TDVTAHMTIA TDEIFGPVLS VLPYDTLEEA IGIANSTSFG
LSAGIWSNDI NEALTAARDL RAGTVWVNRW MDGYPEVPFG GYGHSGIGRE LGRQALAEFS
ELKTIQLQVG IRENRWVDAP DAPRR