Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_5174 |
Symbol | |
ID | 5153914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | - |
Start bp | 5401605 |
End bp | 5403209 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640559948 |
Product | putative peptidase S10, serine carboxypeptidase |
Protein accession | YP_001241073 |
Protein GI | 148256488 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.255269 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGGTT TGAAGTCGAT GGTCGTGCGG CCGAGTCTTT CGGTTTCGGG GCTGGCGGTC TCGGCTCTGG CGGTGTTGGC TCTGGCGCTC TTGGTTGCTG GCCCGGCCCT GCATGCCCAG GATGCGCCGG CAGCCGCGCA GGAGCACGCG GCGTCCGGCA GCGGCCCGCG CAGCGGTTCC GCCGGCAACC GCGCCGCCAA CGCGCCGGCC TCGCCGTCGG CTGCCGAGCA GCACAAGCTG CCGGCGGATT CCACGACCAG CCAGACGATC GAACTGCCCG GCCGCACGCT GTCCTTCAGC GCGACCGCGG GATCGATCCG CCTGTTCGAC GACAAGAGCG AGCCGCAGGC CGACATCGCC TACACCTCGT ATCAGCTTGA GGGCACCGAG CGCGCCAGCC GCCCGGTCAC GTTCTTCTTC AATGGCGGGC CCGGCGCGTC CTCGGCCTAT CTGCAGCTCG GAAACGCCGG GCCGTGGCGG ATCTCGATCG ATGCGGCCAG CATCACGCCC TCATCGAAGC CTGATCTGAT GCCCAATGCC GAGACCTGGC TCGATTTCAC CGATCTTGTC TTCATCGATC CTGTCGGGAC CGGCTACAGC CGCTTCGTCG CGTCCGGCGA CGAGGTCCGC AAGCGCTTCT TCTCGGTCGA TGGCGATGTG CGCTCGCTGG CGCTGGTGGT GCGCCGCTGG CTCGAGAAGC ACGACCGGCT GGCATCGCCG AAATTCATCG CGGCCGAGAG TTATGGCGGC ATTCGCGGTC CGAAGATCGT GAACAACCTG CAGCACGATC AGGGAATTGG CATCAATGGT CTGATCCTGC TGTCGCCGCT GATGGATTTC CGCGACTATT CCGGCTCGAG CCTGATGCAA TATGTCGCCA GCCTGCCAAG CTATGCCGCC GTGGCGAAGC AGCTCAAGGG GCCGGTGACC CGCGCCGATC TCGCCGACGT CGAGCGTTAC GCCAGCGGCG ATTTCCTGCT CGATCTCGTC AAGGGCGAGG CCGATCTGGA CGCGACCAAC CGGCTCGCGG ACAAAGTTGC AAGCCTGACC GGGATCGACC CGGCGGTCAG CCGCAGGCTG GCCGGCCGCT TCGACGTCTC CGAGTTTCGC CGCGAGTTCG ATCGCAAGAA CGGCCGCATC GCCGGGCGCT ATGACGGCTC GGTGCTGGGC ATCGATCCCT ATCCGGATTC GAGCCGGTCA CGCAGCGCCG ATCCCTCCGG TGACGTGCTG ACGGCGCCGC TCACCAGCGC GGCTGTCGAA TTGACGACCC GCAAGCTGAA CTGGCGGCCC GACGGATCCT ACGAACTTCT GAGCGGGGCG GTGAACCGGT CGTGGGATTT CGGCCGCGGC ATCAGCCCGC CGGAGTCGAC CAGCGAACTT CGGCAGATCC TCGCGCTCGA TCCCAAGCTC AAGCTCCTGG TCGGACATGG TCTGTTCGAC CTCGCGACGC CGTATTACGG CTCCAAGATC CTGCTCGATC AATTGCCCGC CTTCGCGCGG CCGCCGCGGG TGAAGCTCGT GGTCTATCCG GGCGGCCACA TGTTCTATTC GCGCGACGAT GCACGGCAGG CGTTCCGCGG CGAGGCCGAG GCGCTGATGA AGTAG
|
Protein sequence | MAGLKSMVVR PSLSVSGLAV SALAVLALAL LVAGPALHAQ DAPAAAQEHA ASGSGPRSGS AGNRAANAPA SPSAAEQHKL PADSTTSQTI ELPGRTLSFS ATAGSIRLFD DKSEPQADIA YTSYQLEGTE RASRPVTFFF NGGPGASSAY LQLGNAGPWR ISIDAASITP SSKPDLMPNA ETWLDFTDLV FIDPVGTGYS RFVASGDEVR KRFFSVDGDV RSLALVVRRW LEKHDRLASP KFIAAESYGG IRGPKIVNNL QHDQGIGING LILLSPLMDF RDYSGSSLMQ YVASLPSYAA VAKQLKGPVT RADLADVERY ASGDFLLDLV KGEADLDATN RLADKVASLT GIDPAVSRRL AGRFDVSEFR REFDRKNGRI AGRYDGSVLG IDPYPDSSRS RSADPSGDVL TAPLTSAAVE LTTRKLNWRP DGSYELLSGA VNRSWDFGRG ISPPESTSEL RQILALDPKL KLLVGHGLFD LATPYYGSKI LLDQLPAFAR PPRVKLVVYP GGHMFYSRDD ARQAFRGEAE ALMK
|
| |