Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3621 |
Symbol | |
ID | 9157800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 3735400 |
End bp | 3737490 |
Gene Length | 2091 bp |
Protein Length | 696 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | Oligopeptidase B |
Protein accession | YP_003648538 |
Protein GI | 296141295 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGACG CCGTCCAGCC CGTCGCCAAG AAGGTTCCCA GTGAACGGAC CTTCCACGGC GACACCGTGA CCGACGACTA CGCCTGGCTC GCCGACACCT CGAACGAGGA GGTGCTCGAC TACCTCCACC GGCACAATGC CTACACCGAG GCTGCGACGG CGACGCAGGA GCCGCTGCGC CAGAAGATCT TCAATGAGAT CAAAGCGCAC ACCCAAGAGA CCGATATGTC GGTCCCGCAA CGTCGCGGTG GCTATTGGTA CTACGCCCGC ACCAAGGAAG GCGCCCAGTA CGGCATTCAC TGTCGTGCAC CGATCACCGG ACCCGACGAC TGGTCGCCGC CGATCCTCTC CGATCAGCCG CTCCCCGGCG AGCAGGTGGT GATCGACCTC AACGTGGAGG CCGAGGGGCA TGAGTACATC GCCCTCGGTG CCGCGTCCGT CTCCACCGAC GGCGAGTTGC TGGCGTACTC GCTGGACACT TCCGGTGACG AGCGATTCAC CTTGCGCGTG CGCAACATCG GCACCGGTGA GGTACTACCC GATGTGGTCG AGGGCGTCTT CTACGGCGCC ACGTGGGCAC CCGATTCGCG GCACCTCTTC TACACCACCG TTGACGATGC CTGGCGCGCC GACAGTATCT GGCGCCACGA GATCGGTTCC GGTGCAGAGG ATGTACGCGT CTTCCACGAG ACCGATCAAC GGTTCGGTGT GGGCGTCGGG CTCACCCGCA GCGAGCGCTA TCTGATGATC GCCGCATCGT CGACGTTGAG CTCCGAGACC TGGGTCCTCG AGGCGACCGA TCCGACCGGC GAGTTCCGCG TGCTCATCCC GCGACAAGAA GACGTCGAGT ACTCCGCGGA GCATGCCGTG CTCGACGGCG AAGACCGGTT CCTCCTGCTG CACAACCGCA CCGGCATCAA CTTCGAGCTG GTCTCCGCGC CGGTCGACGC ACCCGAGGAC TGGACCGTCG TCGTGCCGCA CCGAGACGAC GTGCGCCTCG AGTACGTCGA CGCCTACGCG CGCACCCTGG CGCTCGGCTA CCGCCGCGCA GGCCTGCCCC GCCTCGCGCT CGCCGAAGCC ACGACAGCAC CGTCGTTCAC CGAATGGGAT CCGGGCGAAC CGCTCGCCAA CGTGGGCCCC GCGGCGAACC CCGAATGGGA CGCGCCGCGC CTGCGCCTGG CGTACGAATC GTTCGTGACG CCGGGCACGG TCTTAGAACT CGACGCCGCG ACCGGGGCGT CGACGGTGCT CAAGCGCGTC AACGTGCCGG GCTACGACTC CGCGCTGTAC ACCGCGGAAC AGATCTGGGT GAGTGCGCGC GACGGCGCCG AGGTCCCGGT CTCGGTGGTG CATCGCAAGG ACATCCCCGC GGGCGCCGCT CCGACCCTGC TCTACGGCTA CGGCTCCTAC GAGGCCACCC TGGATCCGTG GTTCTCCGTG GCCCGGCTCT CGCTCATGGA TCGCGGCGTC GTGTTCGCCG TCGCCCACAT CCGCGGCGGG GGCGAAATGG GGCGCGCCTG GTACGAGCAC GGCAAGCAGC TGGAGAAGAC CAACACCTTC ACGGATTTCG TGGATGTGGC TCGGCATCTC GTCGACGCGG GGCGCGCGGC CCCCTCGAAG CTGGTCGCCA TGGGCGGCAG CGCGGGCGGT CTTCTGGTGG GTGCCGTCGC GAACCTCGCG CCGGAATTGT TCTGCGGCAT CGTCGCCGAC GTTCCCTTCG TGGACCCGCT CACCTCGATC CTCGATCCGT CGCTCCCGCT GACCGTGGGG GAGTGGGACG AATGGGGTAA TCCGCTGGAG AGCGCCGAGG TGTACCGCTA CATGAAGGCC TACTCGCCGT ACGAGAACGT CGAGGCCAAG GCCTATCCCG CGCTGCTGGT GACCACCTCG CTCAATGACA CCCGCGTACT CCCCACCGAG CCGGCGAAAT GGGTCGCGAA ACTCCTCGAT CACACCACCT CCGGTGAGCA GATCCTGCTC AAGACCGAGA TGGTCGCCGG CCATGCGGGC GTCAGCGGGA GGTACGCCAA GTGGCGGGAG ACCGCGTTCG AGTACGCCTG GGTCCTCGAC AGGCTGGGTG CGGCCCAGTA G
|
Protein sequence | MTDAVQPVAK KVPSERTFHG DTVTDDYAWL ADTSNEEVLD YLHRHNAYTE AATATQEPLR QKIFNEIKAH TQETDMSVPQ RRGGYWYYAR TKEGAQYGIH CRAPITGPDD WSPPILSDQP LPGEQVVIDL NVEAEGHEYI ALGAASVSTD GELLAYSLDT SGDERFTLRV RNIGTGEVLP DVVEGVFYGA TWAPDSRHLF YTTVDDAWRA DSIWRHEIGS GAEDVRVFHE TDQRFGVGVG LTRSERYLMI AASSTLSSET WVLEATDPTG EFRVLIPRQE DVEYSAEHAV LDGEDRFLLL HNRTGINFEL VSAPVDAPED WTVVVPHRDD VRLEYVDAYA RTLALGYRRA GLPRLALAEA TTAPSFTEWD PGEPLANVGP AANPEWDAPR LRLAYESFVT PGTVLELDAA TGASTVLKRV NVPGYDSALY TAEQIWVSAR DGAEVPVSVV HRKDIPAGAA PTLLYGYGSY EATLDPWFSV ARLSLMDRGV VFAVAHIRGG GEMGRAWYEH GKQLEKTNTF TDFVDVARHL VDAGRAAPSK LVAMGGSAGG LLVGAVANLA PELFCGIVAD VPFVDPLTSI LDPSLPLTVG EWDEWGNPLE SAEVYRYMKA YSPYENVEAK AYPALLVTTS LNDTRVLPTE PAKWVAKLLD HTTSGEQILL KTEMVAGHAG VSGRYAKWRE TAFEYAWVLD RLGAAQ
|
| |