Gene BBta_1922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_1922 
Symbol 
ID5151360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp1983414 
End bp1985021 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content63% 
IMG OID640556866 
Productputative ABC transporter, periplasmic solute-binding protein 
Protein accessionYP_001238022 
Protein GI148253437 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.92103 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACG GGATCACCAA CTGGACTTCG CGTGACGATG TGCTGGTCGA GGCGGCGATC 
CGGCGCGGCG CGTCGCGCCG GGATCTCCTG AAGATGATGT TAGCCAACGG TGTCGCGCTC
GCTGCAGGCA GCACCATCCT CGGCCGCGCC GAGCGCGCTG ACGCGGCAAC GCCGAAGAAG
GGCGGATCAA TCAAGGCGGC GGGATGGTCG ACCTCGACCG CCGACACGCT CGATCCGGCC
AAGGCGTCGT TCTCGACCGA CTATGTCCGC TGCTGCTCGT TCTACAACCG GCTGACGACA
CTCGACAAGA ACGGCGCGGC GCAGATGGAG CTCGCTGAAG CCGTGGAGTC GACGGATGCG
CAGACCTGGA CGGTCAAGCT CAAGAAGGGC GTCACCTTCC ACGACGGCAA GCCGTTGACG
GCTGACGACG TCGTGTTCTC GCTGAAGCGG CATCTCGACA AGGCGACCGG CTCCAAGGTG
GCCAAGATCG CGGCGCAGAT GACCGGCTTC AAGGCGGTCG ACAAGACCAC CGTCGAGATC
ACCCTGGCGA GTCCGAACGC TGACCTGCCG ATGATCCTCG CGCTGCATCA CTTCATGATC
GTGGCCGACG GCACCACCGA CTTCTCCAAG GCGAACGGTA CCGGCGCCTT CGTCCGCGAG
GCTTTCGAGC CGGGCGTGCG CTCGATCGGC GTGCGCAACA AGAACTATTT CAAGGACGGT
CCCTATCTCG ACCAGATCGA GTTCATCGCG ATCACGGAAG AGAATGCGCG CGTCAACGCG
CTGCTCTCCG GCGATATTCA GCTCGCGGCC AGCATCAATC CGCGCTCGCT CCGCCTGATC
GAGGGCAAGG AGGGGATCGC GCTGTCGAAA TCGACGACCG GCAACTACAC CGATCTCAAC
ATGCGGCTCG ATATGGCCCC GGGCAACAAG AAAGATTTCA TCGATGGCAT GAAGCACATC
GTCAACCGCG AGCAGATCGT CAAATCGGTC CTGCGCGGCT TCGGGGTCGT CGGCAACGAC
CAGCCGGTCT CGCCGGCCAA CTTCTTCCAC AATCCCGACC TCAAGCCGAA GCCGTTCGAT
CCGGACAAGG CGAAATTCCT GTTCCAGAAG GCCGGCGTGC TCGGTACGCC GATCCCGGTG
GTCGCATCGG AAGCTGCGAA CTCCGCGATC GAGATGGCGA TGGTGATCCA GGCCTCGGCC
GCGGCGGTCG GACTGACGCT CGACGTGCAG CGCGTTCCGT CGGACGGCTA TTGGGACAAT
TACTGGCTGA AGGCGCCGGT CCATTTCGGC AACATCAATC CGCGGCCGAC GCCGGACATC
CTGTTTTCGC TGCTCTACGC CTCCGAGGCG CCTTGGAACG AGAGCCGCTA CAAGTCCGAG
ACGTTCGACA AGATGATGCT CGAGGCGCGC GGGATGCTCG ACCAGGCCAA GCGCAAGCAG
ATGTATGCCG AGATGCAGGT CAAGATCGCG GAGGAGGCGG GCACCATCAT TCCGGCCTAT
ATCTCCAATA TTGATGCCAC CTCGTCCAAG CTGAAGGGCT TGGAGCCGAG CCCGCTCGGC
GGCATGATGG GATATGCCTT TGCTGAATAT GTCTGGCTCG ATTCGTGA
 
Protein sequence
MSNGITNWTS RDDVLVEAAI RRGASRRDLL KMMLANGVAL AAGSTILGRA ERADAATPKK 
GGSIKAAGWS TSTADTLDPA KASFSTDYVR CCSFYNRLTT LDKNGAAQME LAEAVESTDA
QTWTVKLKKG VTFHDGKPLT ADDVVFSLKR HLDKATGSKV AKIAAQMTGF KAVDKTTVEI
TLASPNADLP MILALHHFMI VADGTTDFSK ANGTGAFVRE AFEPGVRSIG VRNKNYFKDG
PYLDQIEFIA ITEENARVNA LLSGDIQLAA SINPRSLRLI EGKEGIALSK STTGNYTDLN
MRLDMAPGNK KDFIDGMKHI VNREQIVKSV LRGFGVVGND QPVSPANFFH NPDLKPKPFD
PDKAKFLFQK AGVLGTPIPV VASEAANSAI EMAMVIQASA AAVGLTLDVQ RVPSDGYWDN
YWLKAPVHFG NINPRPTPDI LFSLLYASEA PWNESRYKSE TFDKMMLEAR GMLDQAKRKQ
MYAEMQVKIA EEAGTIIPAY ISNIDATSSK LKGLEPSPLG GMMGYAFAEY VWLDS