Gene BBta_1974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_1974 
Symbol 
ID5152121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp2039164 
End bp2040624 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content67% 
IMG OID640556915 
ProductNAD-dependent aldehyde dehydrogenase 
Protein accessionYP_001238071 
Protein GI148253486 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.489378 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCCA ATGTGAAACT GCCTGCCGCA AAGGGTGTCT TCATCGACAA CAAATGGCAG 
CCCGCGCAAT CCGGTCGTAC CATTGCGATG CTGGCACCGG CGACCGGACA GGTGATTGCC
TCGATCGCGG CTGGCGATGC GGCGGACATT GATCTCGCCG TCGCGGCGGC GCGCCGCGCG
CTCGAAGGAT CGTGGGGGCG GCTTGCTGCG GTCGAGCGCG GGCGGCTGCT GTCGAAGCTG
GGCCGGCTCG TCGAGGACCA TGCGGAGGAG CTGGCGAAGC TCGAGGCCGC CGACACCGGC
AAGCCGATGA AGCAGGCCAG AGCGGACGTG GTCGCGGTGG CGCGCTACTT CGAATATTAT
GGCGGCGCCG CCGACAAGGT TCATGGCGAC ACCATTCCAT TCCTTGACGG CTTCTTCGTC
ACGACGGTCT ATGAGCCGCT CGGCGTCACC GGCCACATCA TTCCCTGGAA CTATCCGGGA
CAGATGTTCG GCCGCACCGT TGCGCCGGCG CTGGCGATGG GCAACGCCAC GGTGATCAAG
CCCGCCGAAG AGGCCTGCCT GGTGCCGCTG CGGCTCGCGG AGTTGGCCGC GGAGGCTGGC
TTCCCGGCCG GCGCGATCAA TGTCGTGCCG GGTCTCGGCG AAGAGGCTGG CGCAGCCTTG
TCGGCGCATG ATGGCATCGA CTTCATCTCG TTCACCGGCA GCCCGGAGGT CGGCACTCTG
GTACAGACGG CGGCCGCCAA GAACCATATC GGCTGCACGC TGGAACTCGG CGGCAAGTCG
CCGCAGATCG TGTTCGCTGA CGCCGATCTC GAAGCGGCGC TGACCTCCGT TGCTGCGGCC
ATCGTGCAGA ATGCCGGGCA GACCTGCTCG GCCGGCTCGC GCGTGCTTGT CGAGCGCACG
ATCTGGGATC GCTTCCTGGC CGATCTCACG CTGCGGTTCG AAAAGATCAC GGCTGGCACG
CCGGAGATGG ATCTCGATCT CGGCCCGGTC ATCAGTGCGG TCCAGAAGAC GCGCATCGAG
AGCATGCTGG CGCGCGCGGA AAGCGGCGGC GCCAGGCGCG TGGCCTCGGG CAGGATCGCC
GAGGGCGTCC CGAGCGAGGG CTTCTACGTT GCTCCGGCGC TTTACCAGCA CGTCGCGCGC
GACTCCGAAC TGGCACGCGA GGAGGTGTTC GGTCCGGTGC TGGCAGCGAT GCCATTCGAT
GACGAGGCCG ACGCGATCAG GCTCGCCAAT GCCACCGATT TCGGCCTGGT CGCCGGCGTC
TGGTCGGGCG ACGGCTCGCG CGCCATGCGC GTGGCGCGCA AGGTTCGGGT CGGTCAGATG
TTCGTCAATG GCTATGGCGC CGGCGGCGGC ATCGAGCTGC CGTTCGGCGG CATGAAGCGC
TCCGGCCACG GCCGCGAGAA GGGTTTTGAG GCGCTGTACG AGCTTGCGGC GATGAAGACC
CTGATCGTCA AGCACGGTTA G
 
Protein sequence
MNANVKLPAA KGVFIDNKWQ PAQSGRTIAM LAPATGQVIA SIAAGDAADI DLAVAAARRA 
LEGSWGRLAA VERGRLLSKL GRLVEDHAEE LAKLEAADTG KPMKQARADV VAVARYFEYY
GGAADKVHGD TIPFLDGFFV TTVYEPLGVT GHIIPWNYPG QMFGRTVAPA LAMGNATVIK
PAEEACLVPL RLAELAAEAG FPAGAINVVP GLGEEAGAAL SAHDGIDFIS FTGSPEVGTL
VQTAAAKNHI GCTLELGGKS PQIVFADADL EAALTSVAAA IVQNAGQTCS AGSRVLVERT
IWDRFLADLT LRFEKITAGT PEMDLDLGPV ISAVQKTRIE SMLARAESGG ARRVASGRIA
EGVPSEGFYV APALYQHVAR DSELAREEVF GPVLAAMPFD DEADAIRLAN ATDFGLVAGV
WSGDGSRAMR VARKVRVGQM FVNGYGAGGG IELPFGGMKR SGHGREKGFE ALYELAAMKT
LIVKHG