Gene BBta_4854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_4854 
Symbol 
ID5154781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp5094317 
End bp5095981 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content66% 
IMG OID640559652 
Productputative ABC transporter (substrate binding protein) 
Protein accessionYP_001240783 
Protein GI148256198 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.441686 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGATG GTTTGCGAAC CCCGGCTCCG TCCCGGCGTG CCTTTCTCGC CGGCACTGCG 
TCGCTGGCCA CGAGCGTGGC GCTTGGCAGC GGCGTGGGAG CCGTGCAGGC GCAGGGGACC
GCGCCCGTCA CCGGCGGACG GTTGGTCTAT CTCGAGCAAC AGGCTCACAC CAATCTCTAT
CCGCCGGCCG GCGGTTTCTA TCCGAACGGC GGCGTGCTCA ATCAGATCAC GGACAAGCTG
ACCTACCAGA ATCCGAAGAC GCTGAAGATC GAGCCGTGGC TCGCCGAGAG CTGGAGCTTC
AACGCCGACC GGACGGAGTA TGTCTTCAAG CTGCGTAAGG GCGTGACGTT CTCCGACGGC
ACCCCGCTCG ATGCGGCGGC CGTGGCCAAG AATTTCGACG TCTACGGCAA GGGCAGCAAG
GAGCTGCGTC ATCCGGTCTC CGAGGTGATC AACAACTATG CCGGAAGCGA GGTCGTCGAT
CCGCTCACCG TGCGGTTCCG CTTCAGCAAA CCGTCACCGG GCTTCCTGCA GGGGACCTCA
GTGATTGGCT CCGGGATCGT ATCGCCGGCG ACGCTGGATC GTCGTTTCGA TGAGCTCGGC
GACGCCACGA AGATCATCGG CAGCGGCCCG TTCGTGGTGG CAAGCGAGCA ACTCGGTCGC
GAGTTGGTGC TGGCCGCGCG CAAGGACTAC GCCTGGGCGC CGGCCAGCTC GGCCAATCAG
GGCCGACCCT ATCTTGACGA GATCCGCTAT ATCGTGACCG GCGAGGACAG CGTCCGCATC
GGCGCGCTGC TCGCCGGCCA GGCCGACGTC ATCCGCCAAG TGCAGGCTTA TGACGAAGCC
CAAGTCGAGA AGGCTGGCTA TCGGATCTAC GCGCCGTCGA CGCGCGGCGT GAACAACAGC
GTCGTGTTCC GCCCCGACAA TCCGCTGGTC GCCGACATCC GCGTGCGCCG CGCGCTGCTG
CATGCGACCA ACGCCAAGGA GATCGTGGAG TCGCTGTTCT CGGAGCACTA TCCGCAGGCG
ACCTCGGTCA TCGCCTCGAC GGCGCAGGGC TATGTCAACC TCTCGGACAA GCTGACCTTC
GATCCGGCCA AGGCGCAGGC GCTGCTCGAC GAGGCCGGTT GGAAGCCTGG CCGCAACGGC
GTGCGCCAGC GCGGTGCGCA GGAGCTCGCT CTGACGGCCT ATGAATCGCT GCCGCAGCCG
CAGAACCGCG CGACCTTGCA GCTGATCGCC CAGCAATGGG CGCGGGTCGG CGTGAAGCTC
ACGGTGCTCG CCGGCGATTC CGGCAGTCGC GCCACCGACG ATCTCGATCC GCTCAAGACG
CCGGTGTCGC CGGCCATGGT CGGGCGCGCC GATCCCGACG TCATCAAGAG CCAGTATCAC
TGGGAGAACC GCGACATGCT GCGGCAGAAG CCGGGCCGCG GCAAGGTCAA CGAATTCGTC
GACGCGAAGC TCAATGAGCT GCTCGAAGCC ATCGCAGCCG AGGTGAATGA GGAGAAGCGG
CTCGACTACG TGGTGCAGGT GCAGACCTAT CTGCTCGACC AGGCCTATGT CATCCCGATC
TTCGAGGAGC CGCAGGCGTT CGCGGCGGCG CCCTATGTCC AGGGGCTCGG CTTCGAGGCG
GTCGGGCGGC CGTCGTTCTA CAACACCTGG CTCGCCAAAC GCTGA
 
Protein sequence
MSDGLRTPAP SRRAFLAGTA SLATSVALGS GVGAVQAQGT APVTGGRLVY LEQQAHTNLY 
PPAGGFYPNG GVLNQITDKL TYQNPKTLKI EPWLAESWSF NADRTEYVFK LRKGVTFSDG
TPLDAAAVAK NFDVYGKGSK ELRHPVSEVI NNYAGSEVVD PLTVRFRFSK PSPGFLQGTS
VIGSGIVSPA TLDRRFDELG DATKIIGSGP FVVASEQLGR ELVLAARKDY AWAPASSANQ
GRPYLDEIRY IVTGEDSVRI GALLAGQADV IRQVQAYDEA QVEKAGYRIY APSTRGVNNS
VVFRPDNPLV ADIRVRRALL HATNAKEIVE SLFSEHYPQA TSVIASTAQG YVNLSDKLTF
DPAKAQALLD EAGWKPGRNG VRQRGAQELA LTAYESLPQP QNRATLQLIA QQWARVGVKL
TVLAGDSGSR ATDDLDPLKT PVSPAMVGRA DPDVIKSQYH WENRDMLRQK PGRGKVNEFV
DAKLNELLEA IAAEVNEEKR LDYVVQVQTY LLDQAYVIPI FEEPQAFAAA PYVQGLGFEA
VGRPSFYNTW LAKR