Gene BBta_5604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_5604 
Symbol 
ID5155423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp5830803 
End bp5831963 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content66% 
IMG OID640560347 
Productputative extracellular ligand-binding receptor 
Protein accessionYP_001241469 
Protein GI148256884 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.236032 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTGGC TTTCGGAACT CCGTAGTTCT CTCACCCTCG TGCTATTGGC TGGCGCGCTG 
CATGCCGGCG CCGCGCGTGC CGACGAGCTC GGCGTCACCA GCGACTCCAT CCTGTTCGGA
CAGGTGGCCG CGCTGGAGGG GCCGTCGGCC GCGTTAGGTC AGGCGGCGCG GCAGGGCCTG
CTGGCCGCCT TCAACGAGCT CAACGCAAAA GGCGGCGTCT ATGGCAGACG ACTCAAGCTC
GCCAGCCGCA ACGACGGCTA CGATCCCGAT CGATCGGTGG TGGAGACCAT CAAGTTGATC
TACGAGGACA AGGTGTTCGC GCTGATCGGC GCCGTGGGCA CACCGACCTC GATCGCGACA
GCGCCGATCG CCGCCTCCAA CGACGTGCCC TTCATCGGCC CGGTCTCCGG CGCAGAGTTC
CTGCGGACAC CGGACTTCCA GAACATCGTG AACATCCGCG CCAGCTACGC CGCGGAGGCC
GAAGCCTCGA TCAAGCATCT CACGGAAGAG CTCCGCCTGA CGCGGATCGC GATCTTCTAT
CAGGATGACG CGTTCGGGCG CGACGTGCTG GCGGGCGTGA AGACCCATCT CGACCGCCGG
GGCCTGGAGC TTGCGGCCGA AGGCACCTTC GAGCGAAACA CGCGCGCCGT CGGCGCGGCC
TTGAAGGTGA TCCGTCGCGC AGAGCCTGAG GCGGTGATCC TCGTCGGCAC CTACGGGCCG
TGCGCCGAGT TCATCAAGAT GTCGCATCGC AGTGGCTTCA ATCCGACCTT CACGGCGGTC
TCCTTCGTCG GCGCCAATGC ACTGGCGAAA GAGCTCGGCG CCGAGGGCCG CGGCGTCATC
GTGTCCGAGG TCGTTCCGTT CCCGTGGGAC ACCGACCGCC GCGTCGTCTC AGATTATCAG
GCCGCGATGA AATCCCTGGA TCCGAGCCAG ACGCCCGACT TCATCGGGCT GGAGGGCTAC
ATCACCGGCC GTCTGGTCGC GCGAGCGCTG GCGATGACCG GACCCAATCC GACCCGCGCC
GATCTGCTCC GGATCATCAA CGAGGTCGGG CAGTTCGATA TTGGCGGCCT GGTCGTCAAC
TTCGGCAGCG CGCGCAAAGA CAATCCGCCG CAGGTCACGC TGACGGTGAT CCAGAGCGAC
GGCAGTTTCA AGCCGATCTA G
 
Protein sequence
MKWLSELRSS LTLVLLAGAL HAGAARADEL GVTSDSILFG QVAALEGPSA ALGQAARQGL 
LAAFNELNAK GGVYGRRLKL ASRNDGYDPD RSVVETIKLI YEDKVFALIG AVGTPTSIAT
APIAASNDVP FIGPVSGAEF LRTPDFQNIV NIRASYAAEA EASIKHLTEE LRLTRIAIFY
QDDAFGRDVL AGVKTHLDRR GLELAAEGTF ERNTRAVGAA LKVIRRAEPE AVILVGTYGP
CAEFIKMSHR SGFNPTFTAV SFVGANALAK ELGAEGRGVI VSEVVPFPWD TDRRVVSDYQ
AAMKSLDPSQ TPDFIGLEGY ITGRLVARAL AMTGPNPTRA DLLRIINEVG QFDIGGLVVN
FGSARKDNPP QVTLTVIQSD GSFKPI