Gene BBta_5141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_5141 
Symbol 
ID5153364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp5374176 
End bp5375336 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content66% 
IMG OID640559917 
Productputative alcohol dehydrogenase 
Protein accessionYP_001241044 
Protein GI148256459 
COG category[C] Energy production and conversion 
COG ID[COG1454] Alcohol dehydrogenase, class IV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.237581 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0395804 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGACTGC ATCAATATCC TCCGATGGAG CAGGTCATTT TCGGCAAGCC GGCGGGCCTC 
GCGTTGCGCG AGGAGGCCGA GCGGAAGGGC GCGCAGCGTG TCTTCCTGAT CGCAAGCCGG
ACGCTGAACA CGACGACGGA CGAGATCGAC AAGATCAGGT CGGCGCTGGG CGAACGCTAT
GCGGGCACGT TCGATCAGGT GCCGCAGCAC ACCACGCGGG ATTCGGTGGT CGAGGCCGCC
GGCCATGCCG CGCAGGCGAA GGCCGATCTC GTCGTCGCGA TCGGCGGCGG CTCGGTGGTC
GACGCCGCCA AGATCGTGCT GATGTGCCTC GAGCACAGCA TCACGGATGC GTCCGGGCTC
GATGGTTTCG AGCTGGTCTC GACGCCCCAA GGACCGCGTC CGGGGCCATT CCGCAATCCC
AAGGTGAGGA TGATCGCCAT CCCGAGCACG CTGTCCGGCG GAGAGTACAA TGCCGGCACG
CTGGTGACCG ATACACGTCG CAAGCTCAAG CAGATCTTCG TGCATCCGCT GATGATGCCG
ATCTCGATCA TTCTGGATCC GGCCATCACC GTGCACACGC CGAGAACCCT GTGGCTCGGC
TCGGGCACGC GAGCGATGGA TCACGGCATC GAGGCGGTTT GCTCGCCCCG CGGCAACCCG
CTGGTCGAGA GCGTCTGCCT GCGCGGTCTC CGCTATCTCT ATGATGGTCT GCTGGCCTAT
GCCGACAACG CGGACAGTCT CGAAGCCCGC CAGATGTGCC AGCTCGGATC CTGGCTGTCG
GCCTTCGGGC TGCAATGCCG CGTGCCGATG GGCGCCAGCC ACGCGATCGG GCACGTCCTC
GGCGGGACCT GCGACGTGCC GCATTATCTG TGCACGGCAG TGATGATGCC GAGCGTGCTG
AAATACAACA AACCGGCGAC CGGCGCCGCG CAGAAGCTGT TGGCCGAGGC GTGGCACGAA
CCGGAGGCCG ACGCCAGCGA GGTCTTCGCA CGCTTCATCG CCCGCCTCGG ATTGCCGACG
CGGCTGGCCG ATGTCGGCGT CACGGAGGAT CGCTTCGGCC TGATCGGAAA CAACGCGATG
CTCTCGGTCT TTACGCCCGC CAACCCGCGG CCGATCAAGG GGCCGGACGA CGTCGTCGAG
ATTCTTCGGC TGGCGGCATA G
 
Protein sequence
MGLHQYPPME QVIFGKPAGL ALREEAERKG AQRVFLIASR TLNTTTDEID KIRSALGERY 
AGTFDQVPQH TTRDSVVEAA GHAAQAKADL VVAIGGGSVV DAAKIVLMCL EHSITDASGL
DGFELVSTPQ GPRPGPFRNP KVRMIAIPST LSGGEYNAGT LVTDTRRKLK QIFVHPLMMP
ISIILDPAIT VHTPRTLWLG SGTRAMDHGI EAVCSPRGNP LVESVCLRGL RYLYDGLLAY
ADNADSLEAR QMCQLGSWLS AFGLQCRVPM GASHAIGHVL GGTCDVPHYL CTAVMMPSVL
KYNKPATGAA QKLLAEAWHE PEADASEVFA RFIARLGLPT RLADVGVTED RFGLIGNNAM
LSVFTPANPR PIKGPDDVVE ILRLAA