Gene BBta_p0233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_p0233 
Symbol 
ID5148517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009475 
Strand
Start bp171579 
End bp172844 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content60% 
IMG OID640539113 
ProductTPR repeat-containing protein 
Protein accessionYP_001220546 
Protein GI148241045 
COG category[R] General function prediction only 
COG ID[COG4785] Lipoprotein NlpI, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.683027 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCGCT GTATTCGAAC ACTTTCTGTT GTTTGCGCCG CGACGCTCGT CTTCGCAATG 
ACAGCAGCAG CTTCTGCAAC CGATACTTCG GAGGTCGCGC GAGCCGCCTT TCAGGCCAAA
GATCGCGGAC AGCTCCTGAT GGCGATCGGC TTGTTCAACC AGGCCATGAA GGACGAGCAA
CTTTCGAAGA AGCAACGCGG CTTGCTGCTG TTCGGCCGTG GCACCGCCTA TCAGCAGCTC
GGGATAACAG AGGCCGCTCT CACCGATCTG GACGGGGTGG TCGCGCTGCT ACCTGATTTT
CCTGGCGGCT ACGCCTATCG CGCCGTGATC TGGATCGCTG AGCGGCGATA CGGAGAGGCG
CTCATGGATT TGCAGCAAGC CCATCAGCTT GCTCCGAACG ATGCGAGCGT GCTTCTGAAT
CTCGGCAACC TCTACGCTCA AACGGGCAGG CTTGAGCTTG CAATTGAGAA CTATGGCCAG
GCGATCGGGC TCCGTCCGGA TTTCGACAAG GCCTACTTCG ATCGGGCGGG CGCTTACATG
GTTAAGCACG ATTTCGCACG CGCGATGGCG GATTTCGACA AGGCGATCGA GCTTCGCCCC
ACATTTGCTG ATGCCATCGC CAATCGCGGG GCGCTGCATC TGGCGAACGG AAATGTCGAA
GCAGCTCTTT CCGACCTGAA CGCCGCCCTT GAGCTCGCGC CGCGCAACGC GCGATATTAT
GACGCCCGTG CCAATGCCTA TCTGGTTGAG GCCCGCTACG GCGATGCGCT TGCCGACTTC
GATGAAGCCT TGCAGATCGA TCCTGGCAAT CCCGCGCTGT ATTTCGGACG TGGTCTCGCC
AACCTGTTCC TTGACAATAC GGCGGCCGCC ATCGACGACC TCCAGATTGC GGTGCGGCTG
CGGCCCACCG ATGCCAACGC TGCGATCTGG CTGCACATTT CTCGCCTCCA TGCCAACACC
GTCGACAAAG ATGAATTCGC CACCAATGCC GCCAGAGTGA GCCGGGACGT GTGGCCTGGT
GCGGTGCTCG ATCTGTACCT GGGAGCACTG ACGCCGACCG AGATGCTCGA AAAGGCCCAA
GAAGGTGTTG AGCAAGACTC CGAGCGACGG CTTTGCGAGG CTCAGTTTTA TGTCGCCGAT
TATGGTATCC ACCGAGGCGC CACAAACGAG GCTTTGGACA TCATGAAGGG AGTTGTTTCG
CGTTGCCGCT CTTTTGCTCT TGTCTACGGC TCCGCACGGG CGGAGATCAG CTTGGCGCAG
CACTGA
 
Protein sequence
MDRCIRTLSV VCAATLVFAM TAAASATDTS EVARAAFQAK DRGQLLMAIG LFNQAMKDEQ 
LSKKQRGLLL FGRGTAYQQL GITEAALTDL DGVVALLPDF PGGYAYRAVI WIAERRYGEA
LMDLQQAHQL APNDASVLLN LGNLYAQTGR LELAIENYGQ AIGLRPDFDK AYFDRAGAYM
VKHDFARAMA DFDKAIELRP TFADAIANRG ALHLANGNVE AALSDLNAAL ELAPRNARYY
DARANAYLVE ARYGDALADF DEALQIDPGN PALYFGRGLA NLFLDNTAAA IDDLQIAVRL
RPTDANAAIW LHISRLHANT VDKDEFATNA ARVSRDVWPG AVLDLYLGAL TPTEMLEKAQ
EGVEQDSERR LCEAQFYVAD YGIHRGATNE ALDIMKGVVS RCRSFALVYG SARAEISLAQ
H