Gene Arth_3095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3095 
Symbol 
ID4444328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3468547 
End bp3469977 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content66% 
IMG OID639690922 
Productaldehyde dehydrogenase 
Protein accessionYP_832574 
Protein GI116671641 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR03374] 1-pyrroline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0472146 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTCCAAA CCTTGCAGAA CTTCATAAAC GGCGAGTTCG TCACGCCCGC CGGCACCGGA 
CTGCTGGACA TCGTGAACCC CGCCAACGGT GAGGTGGTGG CGAAGTCGCC CATCTCCGTG
CAGGCCGACG TCGACGCCGC CATGACGGCA GCCAGCGAGG CGTTCAAATC CTGGAAGCAC
GTCACCCCGG GCCAGAGGCA GCTGATGCTC CTCAAGCTTG CCGACGCCGT CGAGGCCAAC
AGCGACGAAC TCGTTGAAGC CCAGCACCGC AACACCGGCC AGGTCCGCAG CCTCATCGCC
TCCGAGGAAG TCGCCGCCGG GGCAGACCAG CTCCGCTTCT TCGCCGGCGC CGCCCGCATC
ATGGAGGGCA AGTCCGCCGG CGAGTACTTC GAAGGCCACA CCTCCTACGT GCGCCGCGAA
CCCATCGGCG TCGTGGCCCA GGTTGCCCCC TGGAACTATC CGTTCCTGAT GGCCATCTGG
AAGATCGGCC CCGCGCTCGC CGCCGGCAAC ACCGTGGTGC TCAAGCCCTC GGACACCACG
CCGGAATCCA CCCTGGTCCT GGCCCGCCTG GCCGGTGGGA TCTTCCCGGC AGGCGTCCTG
AACGTCGTCC TCGGCACCGG CGAAACCGGC GCCATGATGG TGGAGCACAA GGTCCCCGGC
CTCGTTTCCA TCACCGGATC CGTCCGTGCG GGCATCGCAG TGGCTTCGGG AGCGGCCAAG
GGCCTCAAGC GGGCGCACCT GGAGCTCGGC GGCAAAGCCC CGGCCATCGT CTTCAAGGAT
GCCGACATCA AGAAGAGTGC AGCGGCCATC GCCGAGTTCG CCTTCTTCAA CGCGGGCCAG
GACTGCACGG CCATCACCCG GGTGCTGGTC GAGGACTCAG TCCACGACGA CGTCGTGGCA
GCCATGGTGG AACACACCAA GACCCTGCAC ACCGGCTCGC AGAACGACGA AGACAACTAC
TTCGGCCCGC TGAACAACGT GAACCACTTC AACGCCGTGA CGTCTGTGGT GGAGCACCTG
CCGGAGAACT GCAAGATTGT CACCGGCGGC CACCGCGCGG GGGAGAAGGG CTTCTTCTTC
GAACCCACCA TCATCACCGG GGCCAAGCAG ACCGATGACG TCGTCCAGAA AGAAACCTTC
GGGCCCGTCA TTACCGTCCA GAAGTTCAGC ACCGAGGCGG AAGCCGTGGA GCTGGCCAAC
GACGTCGACT ACGCCCTGGC CTCCAGCGTC TGGACCACGG ACCACGGCAC GGCCATGCGC
GTCAGCCGCG ACCTGGACTT CGGCGCGGTG TGGATCAACA CCCACATCCT GCTGACCGCG
GAAATGCCGC ACGGCGGCTT CAAACAGTCC GGCTACGGCA AGGACCTCTC CATGTACGGC
GTCGAGGACT ACACGCGCAT CAAGCACGTG ATGAGCGCAC TCGACGCGTA A
 
Protein sequence
MVQTLQNFIN GEFVTPAGTG LLDIVNPANG EVVAKSPISV QADVDAAMTA ASEAFKSWKH 
VTPGQRQLML LKLADAVEAN SDELVEAQHR NTGQVRSLIA SEEVAAGADQ LRFFAGAARI
MEGKSAGEYF EGHTSYVRRE PIGVVAQVAP WNYPFLMAIW KIGPALAAGN TVVLKPSDTT
PESTLVLARL AGGIFPAGVL NVVLGTGETG AMMVEHKVPG LVSITGSVRA GIAVASGAAK
GLKRAHLELG GKAPAIVFKD ADIKKSAAAI AEFAFFNAGQ DCTAITRVLV EDSVHDDVVA
AMVEHTKTLH TGSQNDEDNY FGPLNNVNHF NAVTSVVEHL PENCKIVTGG HRAGEKGFFF
EPTIITGAKQ TDDVVQKETF GPVITVQKFS TEAEAVELAN DVDYALASSV WTTDHGTAMR
VSRDLDFGAV WINTHILLTA EMPHGGFKQS GYGKDLSMYG VEDYTRIKHV MSALDA