Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_1776 |
Symbol | |
ID | 8447378 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 1947622 |
End bp | 1949778 |
Gene Length | 2157 bp |
Protein Length | 718 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645040902 |
Product | glycoside hydrolase clan GH-D |
Protein accession | YP_003201155 |
Protein GI | 258651999 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.191926 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0421989 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGAGC TGGTCCATGA CGATCTGGCG CACCTCTGTG CCGGCGGTGT CAGCGTCGTC CTGGATACAG CGGGAGGCGG TCTCCCGACG GTCCTGTACT GGGGACCCGC TTTGAGTGAA CTGTCGTCGG CAGATCTGGT CGAGCTCCGG CGAGCAACTC AGGCTCCGTT CGGCGATAGC CGCATCGACG TCCCCGAACG GGTGAGTGTG CTTCCCACCC CCGCCGAAGG GTGGGTCGGC CGCCCGGGAA TCGCCGGAAG CTCGAATGGA CAGGCATTCT CACCGCTGTT CCGACTTCTT GTGACCGAGA TTGTCGAACG GAGCGAGACG GTGGCGACCG GTTTTCGCTA TCACGCCGCC GACCCGACGG CCGAGCTGGA CCTGGCCCTG GACGTCGAGT TGACCCACTC CGGACTGGTG CGGCTGCGGG CCGCGTTGAT CAATCAGGGG TCATCGACCT ATCAGCTGGA CGGCCTGGTG CTGGCACTGC CGGTGCCGAC CCGGGCCGAT GAGCTGTTCG ACTTCACCGG TCGGCATACC CGGGAGCGGT CGCCGCAGCG CTCACCCTTC CAGGTCGGTA CCCACGCTCG CGAGTCCCGG CAGGGACGGC CCGGATTGGA CTCTCCGTTC CTGCTGGCCG CGGGGACGAC CGGCTTCGGT TGGGAGTCCG GTCAGCTCTG GGGATTGCAC GTCGGCTGGT CCGGCAACCA GGTCAGCTAT GCCGAGCGGA TGTACAACGG GCTGAAGATG ATTGGCGGCG GAGAGCTGAT CCTGCCGGGC GAGGTACGGC TTGAACGTGG CGAAGGTTAC GAATCACCCT GGCTCTACGG TGTCTTCGGC AGCGGTCTGA ACGAGGTCAG TCGGCGGTTC CATCGGTTCC TGCGCTCGCG GCCCAGGCAT CCGCGGACCG AGCGGCCCGT GCTGGTCAAC ACCTGGGAAG CGGCCTACTT CGACCACGAT CTGCCGCGGC TGCTGGAATT GGCCGAGCGC GCTGCCCACG TCGGGGTGGA GCGATTCGTG CTCGACGATG GCTGGTTCCT CGGCCGTCGG CATGACTCGG CCGGTCTGGG CGACTGGCAG GTCGACCCCA CCGTCTGGCC CAACGGCCTC AAACCGCTGA TTCATCGTGT CGAAGAGCTA GGCATGCAGT TCGGCCTGTG GGTCGAGCCG GAGATGATCA ACCTTGATTC CGAGCTCGCT CGTGAACATC CGGAGTGGAT CTTCCGCGCC GGTGGACGGG AGGGCATCGC CACTCGGCAA CAGCATGTGC TTGATCTGGG TCACCCGGAG GCCTACGCAC ATATCGCGGG GTGCCTGCAC GCACTGTTGA ACGAGTACAA CATCGGCTAC CTGAAGTGGG ACCACAACCG GATGGTGGTC GAGGCGGGTC ATTGGCCCAC CGGCGTTCCG GGCGTGCATC GCCATACCTT GGCGGTCTAT CGGTTGATGG ACGAACTCCG AGCGGCTCAT CCCGGTCTGG AGATCGAGTC GTGCGCGGGC GGCGGCGGCC GGATCGACCT GGAGATCCTC AACCGGACCG ACCGGGTCTG GCCCAGCGAC TGCATCGATG CCCTTGAGCG CCAACAGATC CAGCGGTACA CCCAGCTCCT CCTCCCGCCG GAGCTGGTGG GCACGCACTT GGGTGACGCG GAGGCCCACT CGACCCGGCG TCGGCACCAC CTCGGATTCC GGGCGGCGGC CGCCATCTGG GGGCACATGG GCATCGAGTG GGACCTGACC TCGACCAGTC CCGGCAAGCT GGACCAAGTG CGTCGCTGGG TCGAGTTGCA CAAGCAGTTG CGCCCTCTGC TCCATTCCGG GGACGTTGTC GTCGGCGACC ATCCCGATCC GGCCGTATGG ATCAACGGTG TGGTCGCGGT GGACCAGTCC GACGCCGTCT TTGGCATCAC CACCGTCGGT CGATCGGTCA CCTTCCCGCC GGGCCGGGTC AGTCTTCCCG GCCTGGATCC GGTCAAACGC TATCGGGTGC AGCCCCTTCC ACCGTCCGAC CACTACCCCG GCACCAACCA GTATCCCGGT TGGTGGGACG AAGGCGTCGT CCTGTCGGGT CGAACCCTGC GCGAAGTCGG CGTGCAGATC CCCGCGATGT TCCCCGAGTA CACCCATATC CTCCGCGCCC GCGCCGTGTC GGCGTGA
|
Protein sequence | MPELVHDDLA HLCAGGVSVV LDTAGGGLPT VLYWGPALSE LSSADLVELR RATQAPFGDS RIDVPERVSV LPTPAEGWVG RPGIAGSSNG QAFSPLFRLL VTEIVERSET VATGFRYHAA DPTAELDLAL DVELTHSGLV RLRAALINQG SSTYQLDGLV LALPVPTRAD ELFDFTGRHT RERSPQRSPF QVGTHARESR QGRPGLDSPF LLAAGTTGFG WESGQLWGLH VGWSGNQVSY AERMYNGLKM IGGGELILPG EVRLERGEGY ESPWLYGVFG SGLNEVSRRF HRFLRSRPRH PRTERPVLVN TWEAAYFDHD LPRLLELAER AAHVGVERFV LDDGWFLGRR HDSAGLGDWQ VDPTVWPNGL KPLIHRVEEL GMQFGLWVEP EMINLDSELA REHPEWIFRA GGREGIATRQ QHVLDLGHPE AYAHIAGCLH ALLNEYNIGY LKWDHNRMVV EAGHWPTGVP GVHRHTLAVY RLMDELRAAH PGLEIESCAG GGGRIDLEIL NRTDRVWPSD CIDALERQQI QRYTQLLLPP ELVGTHLGDA EAHSTRRRHH LGFRAAAAIW GHMGIEWDLT STSPGKLDQV RRWVELHKQL RPLLHSGDVV VGDHPDPAVW INGVVAVDQS DAVFGITTVG RSVTFPPGRV SLPGLDPVKR YRVQPLPPSD HYPGTNQYPG WWDEGVVLSG RTLREVGVQI PAMFPEYTHI LRARAVSA
|
| |