Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2781 |
Symbol | |
ID | 8448394 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 3045778 |
End bp | 3046953 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 645041874 |
Product | ROK family protein |
Protein accession | YP_003202116 |
Protein GI | 258652960 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00000000764975 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000282521 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACCTCC CAGAACCCTC CTCAGCCCGG CGGCATCGCT CTCGTCAGCA GCTGCTGGAG GTGATCCGAC GCGAGAACGG CGTCACCCGA GCCGACTTAA GCCTGATCAC CGGCCTGTCC CGGAGCGCGG TTGCGGAGAC CGTCCAGGAC CTGCTCAACG AACGCCTGAT CGCCGAGGAC GTGCTGGCCG CCGGCGGCCG GGGGGCCGGA CGGGGTCGCC CCTCGGCTCT GCTGGTCGCA TCCGGCGGGA CGGGGTCGGT GGTCGGCATC GACTTCGACC ACGAACGGGT GACGGTCGCC GTCGCCGGCA GTGACGGCAG CATTCGCGGC GAGGAGCACG CCGCGGTGAA TGTGGACAGC GAGGCCGCGG CGGCGCTGGA CGTGTCGGTG GGCATGGTGC ACCGGCTGCT GGGCCAGACC GGCACCAGCA TGTCCGACAT CCGCTCGATC GCGGCCGGGG TGCCGGCACC GCTGGACATG CGCACCAATC GCATCCACTC CGCGTCGGTG CTCACCGGCT GGGTCGGCCT GGACCCGGCC GAGGAGCTGT CCAACCGGCT GGGCCGGCCC GTCCTGATCG GTAACGACGC GGACCTGGGC GCGGTCGGCG AACTGCGGTA CGGCGCGGCC AAGGGCGCGC GGGACTTCAT CTACGTCAAG GCCTCCGAGG GCATCGGCGC CGGCCTGGTG CTCGGCGGGT CGGCTTACCA CGGCGCGACC GGCGCGGCCG GCGAGATCGG GCACACCCGG CTCGGCGAGC AGGGCACCTG GTGCCGCTGC GGCAACCGGG GCTGTCTGGA GACGGTGGTG TCCAGCACCC TGGTCCGCCG CCTGATGACC GAACTGGGCA TTCCCCGTGG TCGGGACGAG ACCTTCCCGC TGGCCGACGC GGCCAAGCAC CCGGTCACCG GACGTTTCAT CTCCGAGGCC GGCCGCACCC TCGGCCGGGT CCTGGCCGAC CTGTGCAACT GCCTGAACCC GTCGCTGATC GTGCTCGGCG GCGAGCTGGG CACCGCCGGC GAGCCGCTGG CCGACGGCGT GCGCGAGTCC ATCAACCGGT TCGCCCAGCC GGCCACCGCG GCCTCCCTGG AGGTCAAGGT CGGCGCCCTG GGGCTGCGGG CCGAGCTGCT GGGCGCGGTC AGCCTGGCCG GCCAGCACGC CCTGCTGGAG ATCTGA
|
Protein sequence | MNLPEPSSAR RHRSRQQLLE VIRRENGVTR ADLSLITGLS RSAVAETVQD LLNERLIAED VLAAGGRGAG RGRPSALLVA SGGTGSVVGI DFDHERVTVA VAGSDGSIRG EEHAAVNVDS EAAAALDVSV GMVHRLLGQT GTSMSDIRSI AAGVPAPLDM RTNRIHSASV LTGWVGLDPA EELSNRLGRP VLIGNDADLG AVGELRYGAA KGARDFIYVK ASEGIGAGLV LGGSAYHGAT GAAGEIGHTR LGEQGTWCRC GNRGCLETVV SSTLVRRLMT ELGIPRGRDE TFPLADAAKH PVTGRFISEA GRTLGRVLAD LCNCLNPSLI VLGGELGTAG EPLADGVRES INRFAQPATA ASLEVKVGAL GLRAELLGAV SLAGQHALLE I
|
| |