Gene BBta_4914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_4914 
SymboldegP 
ID5156174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp5155929 
End bp5157053 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content67% 
IMG OID640559712 
Productserine protease 
Protein accessionYP_001240842 
Protein GI148256257 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.471766 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGAT GGAGCTTTGC CGCGCGCCTG TTGGCGGTCG CGGCGACGGC GCTGCTGCTC 
ATGATGGCGT GGCAGACTTT CCCGCTGATC CAGGCGGAGA TCCTGGGCCT GCGCGCCAAG
CCCCGCGAGA TCACGGCGCG CGGCGACCTC GCCGCCGACG AAAAGAGCAC GATTGCGCTG
TTCGAAAGCC GCAGCGGCTC GGTGGTCTTC ATCACGACCG TTCAACAATC AGTCAATGCC
TGGACAGGCG ATGCGCAGCA GGAGCGCTCC GGCACCGGCT CCGGCTTCGT CTGGGACGAT
CTTGGCCATG TCGTCACCAA TTATCACGTC ATCGAGGGCG CGACTGAAGC ACTGGTCAGC
CTGACCGATG GCCGCTCGTT CCGCGCGGCC CTGGTCGGCG CCAACCCGGA GAACGATCTC
GCGGTGCTGC TGATCGGCGT CGGCACCGAC CGGCCGAAGC CGTTGCCGAT CGGGACCAGC
GCCGATCTCA AAGTGGGGCA GAAAGTGTTC GCGATCGGCA ATCCGTTCGG CCTCAGCAGT
ACGCTGACCA CGGGCATCGT CTCGGCGCTC AACCGCAACC TGCAGGTCAC GCAGGAGCGC
ACCCTCAACG GCTTGATCCA GACCGATGCC GCGATCAATC CCGGCAATTC CGGCGGGCCG
CTGCTCGACA GCGCCGGACG GCTGATCGGG GTCAATACCG CGATCTACAG CCCGTCCGGG
GCGTCGGCCG GGATCGGCTT CGCCGTGCCG GTCGACAAGG TCAACCGCAT CGTGCCGCGG
CTGATCGCGA GCGGCCGCTA TGTCAGCCCG AGTCTCGGGA TCCGGACCGA TCCAAAAGCC
AATGAGGCGC TGTCGGCCCG CCTCAATATG TCGGGCGTGT TCGTGCTCGA TGTCGAGCCG
GATTCGGCGG CGGAGAAGGC GGGGCTGATC CCGGCGCGCC TGACCCGCGA CGGCGGCTTC
GCGCTTGGCG ACGTGCTGCT GGCCATCGAC GGACAGGTGG TGGATTCGCC CGACGACATG
ACACGGGCGT TGGAGACCAA GACTCCCGGC GACCGCGTCG TGCTGCGGGT CAGGCGCGCC
GGCAAGACGA TCGAGGTCCG GGTGACGCTC GACGTCGCGC GGTGA
 
Protein sequence
MSRWSFAARL LAVAATALLL MMAWQTFPLI QAEILGLRAK PREITARGDL AADEKSTIAL 
FESRSGSVVF ITTVQQSVNA WTGDAQQERS GTGSGFVWDD LGHVVTNYHV IEGATEALVS
LTDGRSFRAA LVGANPENDL AVLLIGVGTD RPKPLPIGTS ADLKVGQKVF AIGNPFGLSS
TLTTGIVSAL NRNLQVTQER TLNGLIQTDA AINPGNSGGP LLDSAGRLIG VNTAIYSPSG
ASAGIGFAVP VDKVNRIVPR LIASGRYVSP SLGIRTDPKA NEALSARLNM SGVFVLDVEP
DSAAEKAGLI PARLTRDGGF ALGDVLLAID GQVVDSPDDM TRALETKTPG DRVVLRVRRA
GKTIEVRVTL DVAR