Gene BBta_2203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_2203 
Symbol 
ID5150896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp2275883 
End bp2277088 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content65% 
IMG OID640557133 
Productputative regulatory protein (nitrile hydratase activator like) 
Protein accessionYP_001238289 
Protein GI148253704 
COG category[R] General function prediction only 
COG ID[COG0523] Putative GTPases (G3E family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.514388 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAAAC TCCCCGTCAC AGTCTTGTCC GGCTTCCTTG GGGCTGGGAA GACTACTTTG 
CTCAACCACA TCCTGAACAA CCGCCAGGGT CTGAAAGTGG CGGTGATCGT CAACGACATG
AGCGAGGTGA ACATCGATGC GGACCTCGTT CGCGATGGCG GTGCCAATCT TTCGCGGACG
GATGAGCAGT TGGTCGAAAT GACCAATGGC TGTATCTGCT GCACCCTGCG CGACGATCTG
CTGAAGGAGG TTCGCGCGCT CGCCGAAAGC GGCCGGTTCG ATTACCTCGT GATCGAATCG
ACCGGGATCT CCGAGCCGCT CCCGGTCGCC GCGACGTTCG ACTTCCGCGA CGAGCACGGC
GCGAGCCTGT CGGACGTCGC ACGCCTCGAC ACGATGGTGA CGGTCGTGGA CGCGGTGAAT
CTGCTGAAGG ATTATTCCTC GTCGGATTTC CTCGCGCAGC GCGGCGAGGC ACTGGATGGC
GACCAGCGCG CGCTGGTCGA TCTCCTGGTC GAGCAGATCG AGTTCGCCGA CGTGGTCGTG
CTGAACAAGG TCGACGACGC CAGCGAGGAT CAACGCGAGG CGGCGCGCAA GATCATTCGC
TCGCTCAATC CCGATGCCGA TTTGATCGAG GCAAGCCATA GCTGCGTGCC GCTGGAGCGC
GTGCTCGCCA CCGGCCGTTT CGACTTCGCG CGCGCTCAGC AGCATCCGCT CTGGTATAAT
GAGCTCTACG GCTTTGCCGA GCACACGCCG GAGACGGAGA CCTATGGCGT GACCAGCTTC
GTCTATCGCG CCCGGCGTCC GTTCGTTCCG GTTCGGTTCG ATCAGTTCCT GCGCGAGCAG
TGGCCTGGGG TCATCCGCGC CAAGGGGCAC TTCTGGCTGG CCACGCGGCC GCAATGGCTC
GGCGAGATCA GCCAGGCCGG AGCCATCGTG CGCACCAGCG CGCTCGGCTT CTGGTGGGCG
GCGGTGCCGC AGAAATTATG GCCGGAGGAC CCCGCTTGGC GCGCACGCAT GCTCGAGCGC
TGGGACGACA TCTACGGCGA CCGCCGTCAG GAGATTGTCT TCATCGGCAG CAACATGGAC
GAGGCTGGGC TGCGTGCGCG CCTGGACGCC TGCCTGCTGG CCGGCAAGCC GGCGATGGAT
GTCGCCGCCT GGGCCAGATT GTCCGATCCG TTTCCCGTCT GGCGACGGGC CAATGAGGCA
GCCTGA
 
Protein sequence
MPKLPVTVLS GFLGAGKTTL LNHILNNRQG LKVAVIVNDM SEVNIDADLV RDGGANLSRT 
DEQLVEMTNG CICCTLRDDL LKEVRALAES GRFDYLVIES TGISEPLPVA ATFDFRDEHG
ASLSDVARLD TMVTVVDAVN LLKDYSSSDF LAQRGEALDG DQRALVDLLV EQIEFADVVV
LNKVDDASED QREAARKIIR SLNPDADLIE ASHSCVPLER VLATGRFDFA RAQQHPLWYN
ELYGFAEHTP ETETYGVTSF VYRARRPFVP VRFDQFLREQ WPGVIRAKGH FWLATRPQWL
GEISQAGAIV RTSALGFWWA AVPQKLWPED PAWRARMLER WDDIYGDRRQ EIVFIGSNMD
EAGLRARLDA CLLAGKPAMD VAAWARLSDP FPVWRRANEA A