Gene BBta_3743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_3743 
Symbol 
ID5154142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp3919904 
End bp3920902 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content66% 
IMG OID640558581 
Productputative glycerophosphoryl diester phosphodiesterase 
Protein accessionYP_001239727 
Protein GI148255142 
COG category[C] Energy production and conversion 
COG ID[COG0584] Glycerophosphoryl diester phosphodiesterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.166139 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.0832321 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGAT TTGGAAAGTT GAGCGGGCTG GTGCTGCTGA TCTGTGCCGG CTTCATGTTC 
GTCAACAATA CCAATCTCTG GGTGGCGAAG AGAGCAGGGC AGCCGCTGCT GCTGGCGCAT
CGCGGGCTGT CGCAGCAATT CGATCGCACC GGACTGACTG GCGACACCTG CACCGCCGCG
CGCATGCTGC CGCCGACGCA CGACTATCTC GAGAACACCA TCGCCTCGAT GCGCGCGAGC
TTCGCGGCGG GCGCCGATAT CGTCGAATTC GACGTGCATC CGACCACCGA CGGGCGCTTT
GCGGTGTTCC ATGACTGGAC GCTCGACTGC CGCACCGACG GTAAGGGCGT CACGCGGGAG
CATGCGATGA GGGAGCTGCG ACGCCTCGAC ATCGGCTATG GCTACACCGC CGATGGCGGC
CGGACCTTTC CGTTCCGTGG CAAAGGCATC GGCCTGATGC CGTCGCTGGA TGATGTGCTC
GCGACGTTTC CGGATCGGTC GTTCCTGATC AACATCAAGA GCAACGATCC GGCGGAAGGT
CGGCAGCTTG CAGCGGCTCT GGCGCCAATC GATGCGGCGC AACGCGGCCG TCTGATGGTC
TATGGCGGTG ACGCGCCGAT CGCGGCCCTC AGGGCGGCGC AACCCGAGCT GAGGCTCATG
TCGCGGAGCA TGCTGAAGAG CTGCTTGCTG CGCTATATCG GTTATGGCTG GACCGGGCTG
GTGCCCTCCG CCTGCCGCAC CATGATGGTG CTGGTCCCGA TCAATGTCGC GCCGTGGCTC
TGGGGGTGGC CGGATCGGTT CCTGGCACGA ATGGCGGACG CCGGCAGCCG TGTCTTCGTC
ATCGGCGCGT ATCGCGGCGA GGCATTTTCC GCGGGCCTCG ATACGCCGGC CGAGATCGCC
CGGTTGCCCA CGGCATTTGG CGGCGGCGTG ATGACCGATG AGATCGAAAC CGTCGCGCCG
CTGCTGAAGC GCGCTCCGCC GGGCGGCGCA TCCAACTGA
 
Protein sequence
MSRFGKLSGL VLLICAGFMF VNNTNLWVAK RAGQPLLLAH RGLSQQFDRT GLTGDTCTAA 
RMLPPTHDYL ENTIASMRAS FAAGADIVEF DVHPTTDGRF AVFHDWTLDC RTDGKGVTRE
HAMRELRRLD IGYGYTADGG RTFPFRGKGI GLMPSLDDVL ATFPDRSFLI NIKSNDPAEG
RQLAAALAPI DAAQRGRLMV YGGDAPIAAL RAAQPELRLM SRSMLKSCLL RYIGYGWTGL
VPSACRTMMV LVPINVAPWL WGWPDRFLAR MADAGSRVFV IGAYRGEAFS AGLDTPAEIA
RLPTAFGGGV MTDEIETVAP LLKRAPPGGA SN