Gene BBta_5174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_5174 
Symbol 
ID5153914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp5401605 
End bp5403209 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content67% 
IMG OID640559948 
Productputative peptidase S10, serine carboxypeptidase 
Protein accessionYP_001241073 
Protein GI148256488 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.255269 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGGTT TGAAGTCGAT GGTCGTGCGG CCGAGTCTTT CGGTTTCGGG GCTGGCGGTC 
TCGGCTCTGG CGGTGTTGGC TCTGGCGCTC TTGGTTGCTG GCCCGGCCCT GCATGCCCAG
GATGCGCCGG CAGCCGCGCA GGAGCACGCG GCGTCCGGCA GCGGCCCGCG CAGCGGTTCC
GCCGGCAACC GCGCCGCCAA CGCGCCGGCC TCGCCGTCGG CTGCCGAGCA GCACAAGCTG
CCGGCGGATT CCACGACCAG CCAGACGATC GAACTGCCCG GCCGCACGCT GTCCTTCAGC
GCGACCGCGG GATCGATCCG CCTGTTCGAC GACAAGAGCG AGCCGCAGGC CGACATCGCC
TACACCTCGT ATCAGCTTGA GGGCACCGAG CGCGCCAGCC GCCCGGTCAC GTTCTTCTTC
AATGGCGGGC CCGGCGCGTC CTCGGCCTAT CTGCAGCTCG GAAACGCCGG GCCGTGGCGG
ATCTCGATCG ATGCGGCCAG CATCACGCCC TCATCGAAGC CTGATCTGAT GCCCAATGCC
GAGACCTGGC TCGATTTCAC CGATCTTGTC TTCATCGATC CTGTCGGGAC CGGCTACAGC
CGCTTCGTCG CGTCCGGCGA CGAGGTCCGC AAGCGCTTCT TCTCGGTCGA TGGCGATGTG
CGCTCGCTGG CGCTGGTGGT GCGCCGCTGG CTCGAGAAGC ACGACCGGCT GGCATCGCCG
AAATTCATCG CGGCCGAGAG TTATGGCGGC ATTCGCGGTC CGAAGATCGT GAACAACCTG
CAGCACGATC AGGGAATTGG CATCAATGGT CTGATCCTGC TGTCGCCGCT GATGGATTTC
CGCGACTATT CCGGCTCGAG CCTGATGCAA TATGTCGCCA GCCTGCCAAG CTATGCCGCC
GTGGCGAAGC AGCTCAAGGG GCCGGTGACC CGCGCCGATC TCGCCGACGT CGAGCGTTAC
GCCAGCGGCG ATTTCCTGCT CGATCTCGTC AAGGGCGAGG CCGATCTGGA CGCGACCAAC
CGGCTCGCGG ACAAAGTTGC AAGCCTGACC GGGATCGACC CGGCGGTCAG CCGCAGGCTG
GCCGGCCGCT TCGACGTCTC CGAGTTTCGC CGCGAGTTCG ATCGCAAGAA CGGCCGCATC
GCCGGGCGCT ATGACGGCTC GGTGCTGGGC ATCGATCCCT ATCCGGATTC GAGCCGGTCA
CGCAGCGCCG ATCCCTCCGG TGACGTGCTG ACGGCGCCGC TCACCAGCGC GGCTGTCGAA
TTGACGACCC GCAAGCTGAA CTGGCGGCCC GACGGATCCT ACGAACTTCT GAGCGGGGCG
GTGAACCGGT CGTGGGATTT CGGCCGCGGC ATCAGCCCGC CGGAGTCGAC CAGCGAACTT
CGGCAGATCC TCGCGCTCGA TCCCAAGCTC AAGCTCCTGG TCGGACATGG TCTGTTCGAC
CTCGCGACGC CGTATTACGG CTCCAAGATC CTGCTCGATC AATTGCCCGC CTTCGCGCGG
CCGCCGCGGG TGAAGCTCGT GGTCTATCCG GGCGGCCACA TGTTCTATTC GCGCGACGAT
GCACGGCAGG CGTTCCGCGG CGAGGCCGAG GCGCTGATGA AGTAG
 
Protein sequence
MAGLKSMVVR PSLSVSGLAV SALAVLALAL LVAGPALHAQ DAPAAAQEHA ASGSGPRSGS 
AGNRAANAPA SPSAAEQHKL PADSTTSQTI ELPGRTLSFS ATAGSIRLFD DKSEPQADIA
YTSYQLEGTE RASRPVTFFF NGGPGASSAY LQLGNAGPWR ISIDAASITP SSKPDLMPNA
ETWLDFTDLV FIDPVGTGYS RFVASGDEVR KRFFSVDGDV RSLALVVRRW LEKHDRLASP
KFIAAESYGG IRGPKIVNNL QHDQGIGING LILLSPLMDF RDYSGSSLMQ YVASLPSYAA
VAKQLKGPVT RADLADVERY ASGDFLLDLV KGEADLDATN RLADKVASLT GIDPAVSRRL
AGRFDVSEFR REFDRKNGRI AGRYDGSVLG IDPYPDSSRS RSADPSGDVL TAPLTSAAVE
LTTRKLNWRP DGSYELLSGA VNRSWDFGRG ISPPESTSEL RQILALDPKL KLLVGHGLFD
LATPYYGSKI LLDQLPAFAR PPRVKLVVYP GGHMFYSRDD ARQAFRGEAE ALMK