Gene BBta_3698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_3698 
Symbol 
ID5155228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp3860528 
End bp3861733 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content68% 
IMG OID640558537 
Productputative peptidase 
Protein accessionYP_001239683 
Protein GI148255098 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.704745 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTCG ACACCGCCGC CGCCACCGAT CGCCTGATGC GTTTTCTCGC CGTGCCGGGG 
GTGACCGGAC AGGAGAAGGC GATCGCGCGC GAGATCATGG CGGCGCTGAA GCAGGCCGGC
GTGCCGGCCA AGGCGATGCG CTTCGACGAT GCCAATACGC GCATTGCGCT GCCGACCGAG
ACCGGCAATC TCATCGTGGA GCTGCCCGGC CGCGGCACGA TGCGCGATCA GCCGCGCCCG
ATGTTCATGA CCCACATGGA CACGGTGCCG CTCTGCGCCG GGGCGCAGCC GAAGATCTCC
GGCCGGAAGA TCGTCAACCA GGCGAAGACC GCGCTCGGCG GCGACAACCG CTGCGGCTGC
GGCATCCTGG TGTCGCTCGC GGCCGAGCTG ATCCGGCAAA AGCTCGATCA TCCGCCGATC
ACGCTGCTGT TCTGCGTACG CGAGGAGAGC GGCCTGCACG GCGCGCGCCA CGTCGACCTC
AACGCGCTCG GCGCGCCCGC AATGGCCTTC AATTTCGACG GCAGCTCCGC CTCCAGCATC
ACCATTGCCG CTGTGGGCGC CGACCATTGG GAGGCGGAGA TCTTCGGCCG CGCCTCCCAT
GCCGGCGTCG CGCCAGAGCG GGGCATCAGC GCCACCATGA TCCTGGCGCT CGCGCTCGCC
GACGTCAGAG CCGGCGGCTG GTTCGGCAAG GTGGTGAAGG GCAAGGGCAA GGACGCCAGG
CAGGGCACCA GCAATGTCGG CCCGGTCACC GGCGGCGACG GCCGCCCTGC GGGCGACGCC
ACCAATGTCG TCACCGACTA CGTCCATGTG CGCGGCGAAT GCCGCAGCCA TGACGCCAAA
TTCGTTCGCG AGATCAGCAA CGCCTACAAA GCCGCGTTCG AGAAGGCCGC CAAGCAGGTC
ACCAACAGCG ATGGCAAGTC CGGCCGCGTC AAGTTCAAGG CAGTGACGGA GTTTCCGCCG
TTCCGGATCA AGGAGACCCT GCCCGTCGTC AAGCGCGCCA CCGCCGCCGT CACCGACATC
GGCGCCACGC CAACCTTGCG CGCCGGCAAT GGCGGCCTCG ACGCCAATTG GATGGTCCGC
CACGGCATCC CCACCGTCAC CTTCGGCACC GGCCAGAACG AGCCGCACAC GATCGACGAA
TGGATCAGCC TGGACGAATA TGACCGCGCC TGCGCGCTGG CGCTGCGGCT GGCAACGATG
GGATGA
 
Protein sequence
MSVDTAAATD RLMRFLAVPG VTGQEKAIAR EIMAALKQAG VPAKAMRFDD ANTRIALPTE 
TGNLIVELPG RGTMRDQPRP MFMTHMDTVP LCAGAQPKIS GRKIVNQAKT ALGGDNRCGC
GILVSLAAEL IRQKLDHPPI TLLFCVREES GLHGARHVDL NALGAPAMAF NFDGSSASSI
TIAAVGADHW EAEIFGRASH AGVAPERGIS ATMILALALA DVRAGGWFGK VVKGKGKDAR
QGTSNVGPVT GGDGRPAGDA TNVVTDYVHV RGECRSHDAK FVREISNAYK AAFEKAAKQV
TNSDGKSGRV KFKAVTEFPP FRIKETLPVV KRATAAVTDI GATPTLRAGN GGLDANWMVR
HGIPTVTFGT GQNEPHTIDE WISLDEYDRA CALALRLATM G