Gene BBta_3954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_3954 
Symbol 
ID5151540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp4154389 
End bp4155606 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content68% 
IMG OID640558789 
Producthypothetical protein 
Protein accessionYP_001239930 
Protein GI148255345 
COG category[S] Function unknown 
COG ID[COG1652] Uncharacterized protein containing LysM domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.10149 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.332371 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGTCA GCGGCAAAGC GGTGATCCGC ATCGTCCTCG GCGTGGTGGC CGTCGCCGTG 
ACGGCGGTCG CGCTGATCTT CGTGTTCGAC CCAGCCATCA ATTTTGATTT TCCGCCCTCC
GCCAGGGAGA AGACGGCAGC GGGCGCAGGG GCGCCGGCCC AGTCCGGATC GGGCGGGGTC
GCCGTGCAAA CTCCTGGCGC TCAGGACCAG GATCGCGCTG CAAGCCCGCT CGCCAAGATG
CAGCAGCAGG CGGGTGGTCT CGCAGACGTG CTGAGCCCGC TCGTTCCTCA GCCGAGTGCC
GATAGCGATT TGCCGGCGTT TGACGTGGTC AATGTCGACC CATCGGGTGA CGCCGTCGTC
GCGGGCCGGG CGACGCCTGG GGCCGCCGTG GAACTGCTGC GCAATGGCGA GGTCCACGAT
CGCACCGTTG CGGATCGTTC CGGGCAGTTT GCCATGGTTC CACGGCGGCT CCCTGCGGGG
ACCTACGATC TGACCTTGCG GGCGAAGCTC GCGGATGGGC GAGAGCTGTC CTCGAAACAG
AGCGTTGCCG TCATCGTCGA GGCTGGCCGA CAGCCGGCGA CCGTGGCCCT GCTCGCGCCC
GGCGAACCGA CCCGCGTTCT CTCGAAGCCT TCGGGATCGA TCCCGCAGGC GCTCGCGGTG
GACGCGGTCG ACGTCGAGCC GAGCGGCATC CTCCGCGTCA GCGGCCGCGC GCGGCCGGGG
GCGACCGTGA GACTTTACCT GAATGATCGC CTGATCTCCT CGGCGATCGC AGCCGCCGAT
GGACGTTTGA ACATCACCAT CGGCAAGGAC GTGGCGCGCG CCAGCAACGA CCGGATCCGG
CTCGACGAGG TGGATCCGAA GTCGGGATCG GTTCAGGCCC GGGCCGAGGT GCCGTTAAGT
CTGCCGGAGG ACTCGACGAC CGCCTCGCTG CCATCAGCCG CGGGAGCTGC AGGCAAGGCG
AACGGCAGTC CGGCAGCTCA GCGCACAGCG CTCGCAGCCG CGGGTGGTTC CCAGGACACG
GTCGCTCATG CCGGCGGAGG TGAGCCGAAG ATGATCACCG TCACCGTCGC CCGCGGCGAC
AGCCTGTGGC ACATCAGCCG GCGCCTGCTC GGCGGTGGGA CGCGCTACGC CGTGATCTAC
AAGGCCAATC GCGAGCAGAT CCGCAGTCCC GACCTGATCT ATCCGGGTCA GGTGTTCCTG
CTGCCGGCCA AGCGATAA
 
Protein sequence
MIVSGKAVIR IVLGVVAVAV TAVALIFVFD PAINFDFPPS AREKTAAGAG APAQSGSGGV 
AVQTPGAQDQ DRAASPLAKM QQQAGGLADV LSPLVPQPSA DSDLPAFDVV NVDPSGDAVV
AGRATPGAAV ELLRNGEVHD RTVADRSGQF AMVPRRLPAG TYDLTLRAKL ADGRELSSKQ
SVAVIVEAGR QPATVALLAP GEPTRVLSKP SGSIPQALAV DAVDVEPSGI LRVSGRARPG
ATVRLYLNDR LISSAIAAAD GRLNITIGKD VARASNDRIR LDEVDPKSGS VQARAEVPLS
LPEDSTTASL PSAAGAAGKA NGSPAAQRTA LAAAGGSQDT VAHAGGGEPK MITVTVARGD
SLWHISRRLL GGGTRYAVIY KANREQIRSP DLIYPGQVFL LPAKR