Gene BBta_2120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_2120 
Symbol 
ID5154389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp2196334 
End bp2197350 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content65% 
IMG OID640557057 
Productputative TRAP-type C4-dicarboxylate transport system, periplasmic component 
Protein accessionYP_001238213 
Protein GI148253628 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCAG CCACATCCAT GTCCCGTCGC CGCTTCGTCG GCTCCGCGCT CGCAGGGGCC 
TCGGCTCTGG CGTTCGGGTC CGCGCAGGCG CAATCGGCCA AATATCGCCT GCGCTACGGC
ACGGCGTTTC CCGCCACCCA TCCCGGCGTC ATCAGAATCA TCGAGGCGTC CGAGCTGATC
AAGAAGCAGA CAAACGGCCT GGTCGATCTG CAGGTCTATC CGAACAGCCA GCTCGGCAGC
GAGCCCGACA TGTTCTCGCA AGTCCGCTCT GGCGCGCTCG ACTTCATGTC GACGTCAGGC
GTGAACCAGA CGGTGGTGCC GATCGGCGGC ATCAATGCGG TCGCCTTCGC GTTCGAGAGC
TACGACCAGG TGTGGTCGGC GATGGATGGC GATCTCGGCA ACCATGTGCG TGGCGAGTTT
GCCAAGGTCG GCCTGCACGT GCTGCCGAAA TGCCTCGACA ACGGCTACCG CAACATCACC
TCCGGCGCCA AGCCGATCAC GTCGCCGGAC GACCTTAAGG GCTTCAAGAT CCGCGTGCCC
GGCAATCCGC TGTGGGTGAC CTTGTTCAAG ACGCTGGGCG CCGCACCGAC GCCGATCAAT
TTCGGCGAGC TCTATGCCGC CTTGCAGACC CGCATCGTCG ACGGCCAGGA GAATCCGCTG
GCGCTGATCC AGAGCGCCAA GCTCTACGAG GTGCAGAAGT TCATCGCGCT GTCCGGCCAC
ATCTGGGACG GCCATCACAT CTTCGCCAAT GCCACGCGCT GGAGCGGTTT GCCGGCCGAC
GTGCGCGACG CCATCACCGC GGCGCTGTCG GATGCGGCGG TGAAGGAGCG GCAGGACATC
CAGAGCTTCA ACGAGAAGGC GCAGGCCGAG ATGCAGGCCG CCGGCATCGC CTTCAACAAG
GTCGATACCA AGCCGTTCCG CGACGCGCTG CGCACCGCCG GCTTCTATTC CGAGTGGAAG
ACCAAGTTCG GTGCCGAGGC CTGGAGCCTG CTCGAGAAGT CGGTCGGCCA GCTCTGA
 
Protein sequence
MSSATSMSRR RFVGSALAGA SALAFGSAQA QSAKYRLRYG TAFPATHPGV IRIIEASELI 
KKQTNGLVDL QVYPNSQLGS EPDMFSQVRS GALDFMSTSG VNQTVVPIGG INAVAFAFES
YDQVWSAMDG DLGNHVRGEF AKVGLHVLPK CLDNGYRNIT SGAKPITSPD DLKGFKIRVP
GNPLWVTLFK TLGAAPTPIN FGELYAALQT RIVDGQENPL ALIQSAKLYE VQKFIALSGH
IWDGHHIFAN ATRWSGLPAD VRDAITAALS DAAVKERQDI QSFNEKAQAE MQAAGIAFNK
VDTKPFRDAL RTAGFYSEWK TKFGAEAWSL LEKSVGQL