Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3690 |
Symbol | |
ID | 8449309 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 4047598 |
End bp | 4049379 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645042754 |
Product | glycoside hydrolase 15-related |
Protein accession | YP_003202990 |
Protein GI | 258653834 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00190476 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0841535 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGGGT GCCCCGAACC GGATGCCGGT ACGGACCAGT TCGTACCGCA GGTGCTGCGC GAATACGCCC TGCTGGCCGA CGGAGAACGC GGCGCCATGC TCGGCCCCCG CGGCGACATC GTATGGATGT GCGCCCCGCG GTGGGACAGT GACGCCGTCT TCTCCGCGCT GCTCGACGGC CCTGGCGGTT ACTCGATCAC CCCGGTGGAC CGGTTCGTCT GGGGCGGCTT CTACGAAGAG GGCAGCATGA TCTGGCGCAG CCGCTGGGTC ACAAATCAGG GCATCATCGA GTGCCGAGAG GCGCTCGCGT TTCCCGGTGA CCCTCATCGA GTGGTGCTGC TGCGTCGCGT CCTGGCCGTC GACGGGGGCG GCCAGGTCCA AATCACGCTC GCCCCGCGGG CAGGATTCGG TCGGCACGGG CTCACCCGGC TGCACGGCGC GGACGGCACC TGGACCGGCC GGAGCGGCCC GTTGCACGTG CGATGGACCG GCGCCCCCGC CGACACCCGA CCGGTGGACC GGCGCCACGC GCTGACCGGG CAGCTGACGG TGCCCACCGG CGCACACCAC GACCTCATCC TGGAGATCAG CGACCAGGCC CTCCCGGACC GGCCCCCCGC CCCCGAGGCT ATGTGGGAGG CCACCGAAAC CGCCTGGCAC ACAGCAGTTC CCGACCTCGT CAGCTGCCTC GAACCGAAGG ATGCCCGCCG CTCCTACGTC GTGCTCCGCG GGTTGACCTC GGCCAGCGGC GGCATGGTGG CCGCCGCCAC CACCAGCCTG CCCGAACGCG CCGAGGCCGG CCGGAACTAC GACTACCGGT ACGTTTGGAT CCGCGACCAG TGCTACGCCG GGCAGGCCGC CGCCACGGCC GGGACGCCGC CCCTGCTCGA CGACGCCGTC CGTTTCGTCA GCGCCCGGAT CCTGGACCAC GGGCCCGACC TGAGACCCGC CTACACCACC GGCGGCGCAC CGGTGCCGGA CCAGCGGACC CTGAACCTGC CCGGATACCC CGGCGGCAAG AACCTGATCG GGAACTGGGC CAACCAACAA TTCCAACTCG ACGCGTTCGG CGAATCGCTG CTGCTGCTCG CCGCGGCCGG CCGGGCCGAC CGGCTGGACA CCGACCACTG GAAAGCCGCC ATCGTCGCCG CCGACGCCAT CAGCGCACGG TGGACCGAAC CCGACGCCGG GATCTGGGAG ATCGACAACC AACCCTGGAC GCACAGCCGG CTCACCTGCG CCGCCGGGCT GCGGGCCATC GCTCGGATCC CGCACGCCGG GCCCGCCGCG GTCGACTGGC TCGCCCTGGC CGACCGGATC ACCGCCGACA CCGCCACCAA CGCGGTGCAT CCCAGCGGGC GGTGGCAACG CTCCCCCCGG GACCCGGCGC TCGACGCCGC GCTGCTGCTG CCGCCGCTCC GCGGTGGCAT CGACCCGGCT GATCCCCGCA CCATCCGCAC CCTCGACGGG TACCTGACCG ACCTCACCCG CGACGGGTAC GCCTATCGAT TCCGGCACGA CGACCGGCCC CTGGCCGACG CCGAAGGTTC CTTCACTCTG TGCGGATTCC TCGTCGCCCT GGCGCTGCAC CAACAGCACC GGCCCGTCGA GGCGGCCCGC TGGTACGAAC GGACCCGCGC GTGCGCCGGA CCGGCCGAGC TGTACTCCGA GGAGTTCGAC GTCCACCAAC ACCAACTGCG GGGCAACCTC CCGCAGGCCT TCGTACACGC CCTGCACCTG GAAGCCGCCG CCCGCCTGGC CGACCCGCCG GACCACCCCT GA
|
Protein sequence | MSGCPEPDAG TDQFVPQVLR EYALLADGER GAMLGPRGDI VWMCAPRWDS DAVFSALLDG PGGYSITPVD RFVWGGFYEE GSMIWRSRWV TNQGIIECRE ALAFPGDPHR VVLLRRVLAV DGGGQVQITL APRAGFGRHG LTRLHGADGT WTGRSGPLHV RWTGAPADTR PVDRRHALTG QLTVPTGAHH DLILEISDQA LPDRPPAPEA MWEATETAWH TAVPDLVSCL EPKDARRSYV VLRGLTSASG GMVAAATTSL PERAEAGRNY DYRYVWIRDQ CYAGQAAATA GTPPLLDDAV RFVSARILDH GPDLRPAYTT GGAPVPDQRT LNLPGYPGGK NLIGNWANQQ FQLDAFGESL LLLAAAGRAD RLDTDHWKAA IVAADAISAR WTEPDAGIWE IDNQPWTHSR LTCAAGLRAI ARIPHAGPAA VDWLALADRI TADTATNAVH PSGRWQRSPR DPALDAALLL PPLRGGIDPA DPRTIRTLDG YLTDLTRDGY AYRFRHDDRP LADAEGSFTL CGFLVALALH QQHRPVEAAR WYERTRACAG PAELYSEEFD VHQHQLRGNL PQAFVHALHL EAAARLADPP DHP
|
| |