Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcav_1116 |
Symbol | |
ID | 7859970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beutenbergia cavernae DSM 12333 |
Kingdom | Bacteria |
Replicon accession | NC_012669 |
Strand | + |
Start bp | 1252926 |
End bp | 1254833 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643865200 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002881138 |
Protein GI | 229819612 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.709816 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGATCA GGCGATTTGC CGCCTTCGGT GCAGCAGTGG CTGCCACCGC GCTCGCACTG TCCGCTTGCC AGTCGCCCGG CAGTGGCAGC GACGACAACG GCGGCGACGA TGAGGGCCTT TCCCAAGACA CAGCGGTCAC GTTCGGATGG AACCAGTCGT TCTACGAGTA CAACGGCGAC TCGTCGACGG GCAACGCGAC CGCCAACGCG ATCGTGCTCT ACCTGATGAA CAGCGCCTTC CGGTACTTCG ACGAGGACCT GAACATCGCT CAGGACACCT CGTTCGGCAC GTACGAGAAG ACGAGCGACG ACCCGCTGAC CGTCGAGTAC ACGGTGAACG ACGACGTGGC CTGGTCCGAC GGGACGCCCG TCGACGCTGC GGACATGCTC CTCTCGTGGG CAGCGACCAG CGGTCACTTC AACACCCTCG CGGACGAAGA GGTCGAGACG GACGAAGAGG GCAACGTCAC CAACCAGGGT GACAACGTCT ACTTCAGCGG CACCTCGGTC GCCTCGAGCT ACATCATCCC GACGCCGGAG ATCAGCGAGG ATGGCAAGAC GATCACCCTC GTCTACGACC GCCCCTTCGC CGACTGGGAG CAGGGCTTCG GCATCGGCGT CCCCGCCCAC GTCGTCGCGA TGCACGCGCT GGGGATCGAC GACGCGACGG AGGCCAAGGA GGCCCTCATC GACGCGATCA ACAACGACGA CACCGAGGCG CTGGCGGCCA TCTCGCAGTT CTGGAACGAG GGCTTCCAGT TCGGTGACTC GCTGCCGGAC GACGAGTCGC TCTACCTCTC GAACGGTGCG TACCTCCTCA CGGACTTCGT TCGCGACCAG TACGTGACGC TCGAGCGCAA CCCGGACTAC AACGGTGACC GTCCGGCGAA CGTCGAGCGC GTCACGATCC GCTACAACGG CGACCCGATG GCGTCCCTCC AGGCGCTGCA GAACGGTGAG GTCGACGTCA TCGAGCCGCA GGCGACGACG GACATCCTCC AGGCCGCCGA GGCGCTGGGT GAGGACTTCA CGGTGCTGAC CGGCGACGCC GCGACGTACG AGCACATCGA CCTCGCGCAG AACAACGGCG GTCCGTTCGA CCCGGCTGCC TACGGCGGCG ACGCGGCCAT CGCGCGCGAC GTCCGTGAGG CGTTCCTGCT CCTCATCCCG CGTGAGGACA TCGTCGACAC GCTGATCCGT CCGCTCAACC CGGAGGCGAA CGTTCGCCAG ACGAACCTCG CCGTGCCGAG CGCGCCGAAC TACGACGCGA TCGTGGCGGC GAACGGGTAC GAGGAGGCGT TCCCGGCGGT CGTCGACCAG TCGTCGATCG ACCGGGCGAC CCAGCTCCTG ACCGACGCCG GCGTCACGAC GCCGATCGAC GTCCGGATCA TGACGGACTC GCTGAACACC CGCCGCCAGA ACCAGCTGCA GATCATCTCT GACACGGTCA ACGCGTCGGG TCTGTTCAGC ATCGTCGACG CGTCCAACGC CGACTGGGGC GCGCTGCTGT CCGACACCTC GCAGTACGAC GCTGCGATCT TCGGGTGGCA GTCGACCGGC ACGGGTGCGA CGAACTCGGA CGCCAACTAC CGTCCGGGCG CGATCAACAA CTTCTACGGG TACGACAACC CTGAGGTGAC CGCGCTGCTC GACGAGATCG CCGTGACGAC CGACCAGGAC ACCGTGACGG AGCTGCAGGG CGAGATGGAG GCGTTCCTGG TTCAGGACGC GTTCGGCCTG CCGATCTACC AGCACCCGGG TGTCGACGTC TACCGGAACT CGATCGAGGG CATCAACCCG ATCGCCCTGA GCCCGACCGT GTTCTGGAAC TACTGGGAGT GGGACGTCAC CGGTGACACC GCCGGCGAGC CCACGTCCGA GGAGGCGACG GAGGAAGCGA CCGAGTAG
|
Protein sequence | MKIRRFAAFG AAVAATALAL SACQSPGSGS DDNGGDDEGL SQDTAVTFGW NQSFYEYNGD SSTGNATANA IVLYLMNSAF RYFDEDLNIA QDTSFGTYEK TSDDPLTVEY TVNDDVAWSD GTPVDAADML LSWAATSGHF NTLADEEVET DEEGNVTNQG DNVYFSGTSV ASSYIIPTPE ISEDGKTITL VYDRPFADWE QGFGIGVPAH VVAMHALGID DATEAKEALI DAINNDDTEA LAAISQFWNE GFQFGDSLPD DESLYLSNGA YLLTDFVRDQ YVTLERNPDY NGDRPANVER VTIRYNGDPM ASLQALQNGE VDVIEPQATT DILQAAEALG EDFTVLTGDA ATYEHIDLAQ NNGGPFDPAA YGGDAAIARD VREAFLLLIP REDIVDTLIR PLNPEANVRQ TNLAVPSAPN YDAIVAANGY EEAFPAVVDQ SSIDRATQLL TDAGVTTPID VRIMTDSLNT RRQNQLQIIS DTVNASGLFS IVDASNADWG ALLSDTSQYD AAIFGWQSTG TGATNSDANY RPGAINNFYG YDNPEVTALL DEIAVTTDQD TVTELQGEME AFLVQDAFGL PIYQHPGVDV YRNSIEGINP IALSPTVFWN YWEWDVTGDT AGEPTSEEAT EEATE
|
| |