Gene BBta_2149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_2149 
Symbol 
ID5155300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp2227912 
End bp2229036 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content63% 
IMG OID640557085 
Producthypothetical protein 
Protein accessionYP_001238241 
Protein GI148253656 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.393338 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGGTT GGAAACAGAG CAACGCGGCT GCAGCGAAGC CGCCCAGCCC GGAGCCGTCG 
CCGGCGCTTG CGCCGGTTGC GCCCTCACAG CAGGACGAGT TGATGCGCCG GTGGATCGCG
TTCGCCGGCA TGCAGCAGCG CGTGATCCGG ACGCTTGTCA GCGAGATCCA GCAGACCTCC
GCCGTGGTGG AGACCGAGGC GGACAGTCTG AGCAGCCGGT TTCAGCGTCT CGCCGTCTGC
GCGGGGCAGC AGACGGAACG CGTCGAAAGC CTGAGCAAGC TCGCGATGGG CATCGAGGTC
GATGGCGAGG CGATCGCGAT CGATCGCATC GCGGGTCTGC TCGAGGAGAC ATTGAGCGAC
GTCGTCGAGA AGATCCTGCT GCTGTCGAAG GACGCGATGT CGATGGTCTA CGCGTTGAGC
GAGCTCAACG GGAACGTCAA CCGCGTCGAC TCCTGCATGG AAGAGTTGAA CAAGATCAAC
CGCGTCACCA ATATGCTGGC GCTCAATGCC AGGATCGAGG CGGAGCGGGC TGGAACGGCG
GGCGCAGCGT TTCGTGTGGT CGCCGGTGAG GTCCGCGAGC TGTCGAGCGC CACGCAGCGG
TTGTCCGTCG ACATGGCGAC GGAGCTGCAT GCCGTCACCC AGGGCATCGA GAACGGCCAC
GAAACGCTGC AGCGCGTCGC GACCATCGAT ATGTCGCAGA ACCTGATGGC CAAGGACCGT
CTCGAGCTGC TGATGAACGC CCTGATCGAG CGCGGCGGCA ACCTGACCGA AGTCGTCAAT
GAAGCGATGA AAGAAGCCGA GGTGATCTCG GCCGACGTCG CCGGCATGAT CACGGGCATC
CAGTTCCAGG ATCGCACGCG GCAACGGCTC GAACATGTGG TCGACACGTT GCGCGTTGTC
GACGAGGCGC TCGACGAGCT GAAGACGACG ACGGCCGATG TCCTGGATGA ACCGGTCGTG
GAGACGACAA TCGACAATGA ATGGGTCAAG ACGCTGCTCG ATCGGTTCAC GCTCGGCGAA
TTGAGGTCGC GCTTCGTCGC GCAGATTCTC GAAGGCAAGC AGCCGGCCGA TCCGAGCGAA
ACGGAGGCCA GCCCTTCGCA GACGGGGACC ATTGAACTGT TTTAG
 
Protein sequence
MFGWKQSNAA AAKPPSPEPS PALAPVAPSQ QDELMRRWIA FAGMQQRVIR TLVSEIQQTS 
AVVETEADSL SSRFQRLAVC AGQQTERVES LSKLAMGIEV DGEAIAIDRI AGLLEETLSD
VVEKILLLSK DAMSMVYALS ELNGNVNRVD SCMEELNKIN RVTNMLALNA RIEAERAGTA
GAAFRVVAGE VRELSSATQR LSVDMATELH AVTQGIENGH ETLQRVATID MSQNLMAKDR
LELLMNALIE RGGNLTEVVN EAMKEAEVIS ADVAGMITGI QFQDRTRQRL EHVVDTLRVV
DEALDELKTT TADVLDEPVV ETTIDNEWVK TLLDRFTLGE LRSRFVAQIL EGKQPADPSE
TEASPSQTGT IELF