Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_4031 |
Symbol | |
ID | 8449650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 4445319 |
End bp | 4446560 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645043076 |
Product | alpha amylase catalytic region |
Protein accession | YP_003203312 |
Protein GI | 258654156 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0102813 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACGTGA CGACGGATCG ATCCTTCGAA CAGGTCATCT GGTGGCAGGT ATATCCGCTG GGTTTTGGCG GCGCCCCGAT TCGGCAGGCC CACACCCCCG GTCACCGGCT GCGCCGGCTG CTCGGCTGGT TGGACGAGGT GGTGGAGCTG GGCTGCACCG GGCTGCTGCT CGGGCCGATC TGCGCGTCGG CGACGCACGG GTACGACAGC GTCGATCTGC GGCGGATCGA CCCGCGCCTG GGCACCGAGC ACGACTTCGA CGACCTGGTC GCCGGCTGCC GATCCCGGGG CCTGCGCCTG CTGCTGGACG GGGTGTTCAG CCACGTCGGC CGCGATCATC CGCTGGTCGC GCAGGCGCTC GCCGAAGGCC CCGACAGTGC CGCGGCCGCC CTGTTCGACA TCGACTGGTC CGACCCGGCC GACCCGCACC CACGGGTGTT CGAGGGCCAC GACACCCTGG TCCGGCTGAA CCACTCCGGT GACGCCGCCG CCGACTGGGT CACCGACGTG CTGATCCACT GGTTGGACCG CGGCGCCGAC GGCTGGCGGC TGGATGCGGC CTATTCGGTG CCGCCCCCGT TCTGGGCCCG CGTGCTGGCC GCCACTCGCG AGAAGCACCC CGACGCCTGG TTTCTCGGCG AGGTCATCCA CGGCGACTAC CCGGATTTCG TGACCCGCTC CACCGTCGAC TCGGTGACCC AGTACGAGCT GTGGAAGGCG ATCTGGTCCT CGCTCAAGGA CGGCAACTTC TTCGAGCTGG ACTGGACGCT GCAGCGGCAC AACGACTTCC TGGATCACTT CCGGCCCAAC ACCTTCATCG GCAACCACGA CGTCACCCGG ATCGCCTCTC AGGTCGGCCC GACGCTGGTG CCGGTCGCGC TGACGATCCT GCTCACCGTC GGCGGCATCC CGTCGATCTA CTACGGCGAC GAGCGCGGTT TCACCGGGGT CAAGCAGGAC CGGCTGGGTG GCGACGACGC GGTCCGGCCC GAGTACCCCG ACTCGCCCGC CGATCTGCCC CGCAACGATC TGTGGCGGAT GCACGCCGGG CTCATCGACG TGCGCCGGAG CCGGCCCTGG TTGGCCGGCG CGTCCACCGA ATCGCTGGAG CTGACCAACA CCCGCTACCG GTACCGCGCA TCCGGCGACG GCGAGCACCT GGACGTCGAG CTGGACCTGG ACCGGCCGTC GGTGCTGATC CGGGACGCCG GCGGCGGGAC GATCTGGGAG CACGCCGGCT GA
|
Protein sequence | MDVTTDRSFE QVIWWQVYPL GFGGAPIRQA HTPGHRLRRL LGWLDEVVEL GCTGLLLGPI CASATHGYDS VDLRRIDPRL GTEHDFDDLV AGCRSRGLRL LLDGVFSHVG RDHPLVAQAL AEGPDSAAAA LFDIDWSDPA DPHPRVFEGH DTLVRLNHSG DAAADWVTDV LIHWLDRGAD GWRLDAAYSV PPPFWARVLA ATREKHPDAW FLGEVIHGDY PDFVTRSTVD SVTQYELWKA IWSSLKDGNF FELDWTLQRH NDFLDHFRPN TFIGNHDVTR IASQVGPTLV PVALTILLTV GGIPSIYYGD ERGFTGVKQD RLGGDDAVRP EYPDSPADLP RNDLWRMHAG LIDVRRSRPW LAGASTESLE LTNTRYRYRA SGDGEHLDVE LDLDRPSVLI RDAGGGTIWE HAG
|
| |