Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_1578 |
Symbol | |
ID | 8447176 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 1738761 |
End bp | 1740374 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645040705 |
Product | CBS domain containing protein |
Protein accession | YP_003200962 |
Protein GI | 258651806 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.110465 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.116098 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTGTTGC TGGTGATCGC GGCTGCTCTG GTGCCGATCG CCGGCCTGCT GGCCGCGGCC GACGCCGCGA TCACCATGGT CTCGCCGGCC CGGTTGGAGG AGCTGGCCCG CGAGGACCGG CGCGGCGCGT CGTCGCTGCT GACCATCACC GAGGACCGGC CCCGGTACAC CAACCTGCTG TTGCTGCTGA GGATCACCGC CGAGATCACC GCGACCGTGA TGGTGGCCAA GGTCGCGTTG ACCACCCTGG GCTTCCAGTT CTGGGTCGGC CTGCTGCTGG TGGCGGTGAT GGTCGTGGTC ACCTACGTGG GGATCGGCGT GCTGCCGCGC ACCATCGGCC GCCAGCACCC CTACCCGGTG GGTCTGGCCC TGGCCGGCCC GACCCGGGCG CTGGCCCGGC TGCTCGCGCC GGTCGCCTCG CTGCTGATCC TGATCGGCAA TGCGATCACC CCCGGCCGCG GCTTCCGGGA GGGCCCGTTC TCCTCCGACA TCGAGCTGCG GGAGCTGGTC GACATCGCCG GCAGCCGCGG CGTGGTCGAG GAGACCGAGC GGGAGATGCT GCAGAGTGTC TTCGATCTGG GCGACACCAT CGTGCGCGAG GTGATGGTGC CGCGCACCGA GACGGTGTGG ATCGAACGGG ACAAGACGCT GCGCCAGGCG CTGGCCCTGG CCAGCCGCTC GGGCATGTCC CGGATCCCCG TGGTCGGCGA GGACCTGGAC GACATCGTCG GCGTCGCCTA CCTCAAGGAC CTGATCGCGC CGGCGATGAA CCTGGCCCCG GACGACCAGG GTCCCGTGCT GACCCAGATC ATGCGGGAGC CGGTGTTCGT GCCCGAGTCC AAGAACGTCG ACGACCTGCT GCGCGAGATG CAGCGCGATC GGACCCACTT CGCCGTCGTG GTGGACGAGT ACGGCGGCAC CGCCGGCATC GTCACCATCG AGGACATCCT GGAGGAGATC GTCGGCGAGA TCACCGACGA GTACGACGCG GACACCCCGG CCCCGGTGGT GCCGCTGCCC GACGGCTCGT TCCGGGTCTC GGCCCGGTTG CCGGTCGAGG ACCTGGGAGA GCTGTTCGAC GTCGAGCTCG ACGACGACGA GGTGGACACC GTCGGTGGGC TGCTGGCCCA GCAGCTCGGC CGGGTCCCGC TGCCCGGCTC GGAGGTCGGC GTCGCCGGGC TGTTCCTGCA CGGGGAACCC GGGGTGGACC GCCGGGGCCG GCCGCGGGTG CAGACCGTGC TGGTGCGCCG GCTGACCGCG GCCGAGCTGG CCCAGGAGCA GGCCGAACGC GACCGCGAGG TGCGGGAGCG GGAGGACCAG CGGCGGGCGC AGGAGCAGGA ACCGGAACGG CCCGCGGTCG AGGAGTCCGA CGCCGAACCG GCCGAGGCCA AGCGGGCCAA ACCGCGCAAG AAAAAGAAGA AGGCCAAACC GCCGGCCGCC GACGAGTCCG CCGAGCCCGG GGCGACGGAT TCCCCGGGTG ACTCCCCGCC GGACTCGCCG GCCGGCGCCC CGCCGGGCCG CTCGACGCCG GCGGACCCGC CGCTGCTCGG ACCGCTGATC GAGACCAGGA ACCAGCCAGT GCACGATTCG ACGGAACGGA ACCGAGCCGG ATGA
|
Protein sequence | MLLLVIAAAL VPIAGLLAAA DAAITMVSPA RLEELAREDR RGASSLLTIT EDRPRYTNLL LLLRITAEIT ATVMVAKVAL TTLGFQFWVG LLLVAVMVVV TYVGIGVLPR TIGRQHPYPV GLALAGPTRA LARLLAPVAS LLILIGNAIT PGRGFREGPF SSDIELRELV DIAGSRGVVE ETEREMLQSV FDLGDTIVRE VMVPRTETVW IERDKTLRQA LALASRSGMS RIPVVGEDLD DIVGVAYLKD LIAPAMNLAP DDQGPVLTQI MREPVFVPES KNVDDLLREM QRDRTHFAVV VDEYGGTAGI VTIEDILEEI VGEITDEYDA DTPAPVVPLP DGSFRVSARL PVEDLGELFD VELDDDEVDT VGGLLAQQLG RVPLPGSEVG VAGLFLHGEP GVDRRGRPRV QTVLVRRLTA AELAQEQAER DREVREREDQ RRAQEQEPER PAVEESDAEP AEAKRAKPRK KKKKAKPPAA DESAEPGATD SPGDSPPDSP AGAPPGRSTP ADPPLLGPLI ETRNQPVHDS TERNRAG
|
| |