Gene BBta_4041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_4041 
Symbol 
ID5152499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp4240241 
End bp4241341 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content69% 
IMG OID640558874 
Producthypothetical protein 
Protein accessionYP_001240013 
Protein GI148255428 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0513022 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATGC TGACGCCGGA TCGATCCAGC GCGATTGCCG CCAACCATCC ACCTGCCCCG 
CCGGCCGCCG ACCGCGGCCG GCGTCGCTTT CCCTGGCCTG GCCTGCTCAC GATGCTGCTG
GCTCTCGCCG CGACGGCCGC GCTGGCGGCA TGGTGGCTGC GCGGACCTGC GATCACGGTC
GTTCGCATCA CGCGCGGCAC GGCCGCCGAG ATCGTCTATG CCACCGGAGC GGTCGAGCCG
GAGACATGGT CGCGCACCAC GCCGCTGGTG CGCGGCCGCA TCGTCGAGCG CTGTCGCTGC
GAGGGCAAGG CGGTCAAGGC TGGTGACCTG CTGGCGCGAC TCGACGACAA GGAGGCGCTG
GCCACCCTGA ACGATCTGCG CGCGCAGGAG GAGTTTCAGC GGCACGAATT CGAGCGGCAG
TCACAGCTGC TGGCGCGCGG CGCGTCCACG TCGCAGGCCT ATCAGCGCTC CGAAAGCGAC
CTCGCCCGCA TCCGCGCCCA GATCGCGGCG CAGAGCCAGC GCCTCGATTA TTTCAGGCTG
GTGTCGCCGA TGGATGGCGT CGTGCTCAAG GAGGATGGCG AGGTCGGCGA CATGGTCGAC
CCCGGCACCA TCCTCTATCG CGTCGGCCTG GAGAAGCCGC TCTGGGTGAT CGCCGACGTC
AATGAAGAGG ACATCCCCCG CGTCCAGGTC GGGCAGAAGG CGCTGCTGCG CAGCGATGCG
TTCGGCAGCC AGGCGCTGCC CGGAACGGTG AAGCAGATCA CGCCGGCGGG CGATCCCGTT
GCCAAGACCT TCCGCGTCCG GATCGGCCTG CCCGACGACA CGCCGCTGCG TGTCGGCATG
AGCGTCGAGG CCAACATCGT CAGCCGCGAG AAGGCCGATG TGCTGCTCGT GCCGGCCAAT
GCCGTCGTGA ATAACGGCCT GCTGGTCATT GACGACGGCC GTGTGCACCG TCGGAAGGTC
CGCGTCGGCA TCAGAGGCTC CGCGGCCGTG GAGATCCTCG AAGGCGCGCA GCAGGGCGAG
ATGGTGGCAG CCCCCGCGAC TGCCGACATC AAAGACGGCG AACGGGTCCG GCCGATGGAG
ATCGAGACTG CAGCGCCATG A
 
Protein sequence
MDMLTPDRSS AIAANHPPAP PAADRGRRRF PWPGLLTMLL ALAATAALAA WWLRGPAITV 
VRITRGTAAE IVYATGAVEP ETWSRTTPLV RGRIVERCRC EGKAVKAGDL LARLDDKEAL
ATLNDLRAQE EFQRHEFERQ SQLLARGAST SQAYQRSESD LARIRAQIAA QSQRLDYFRL
VSPMDGVVLK EDGEVGDMVD PGTILYRVGL EKPLWVIADV NEEDIPRVQV GQKALLRSDA
FGSQALPGTV KQITPAGDPV AKTFRVRIGL PDDTPLRVGM SVEANIVSRE KADVLLVPAN
AVVNNGLLVI DDGRVHRRKV RVGIRGSAAV EILEGAQQGE MVAAPATADI KDGERVRPME
IETAAP