Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2284 |
Symbol | |
ID | 8447895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 2519020 |
End bp | 2520240 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645041406 |
Product | protein of unknown function UPF0118 |
Protein accession | YP_003201650 |
Protein GI | 258652494 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.00112442 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00386241 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCAGGA CGAAATCCGG GCAGCCGAGC GAGCCGGTCG CGCACCATCC GCGGGTCGGA CAGCCGCACG ACGCCGCCGC GGAGGGTCCG ATTGCCCAGG CCGAGGCGGT CGCCGCCACG CTGCGCACGC CGGCCGACCC GCTCGGGCCG CTGGGCCGCC GGCTGAACTG GCGCTCGGCC TTCCTGATCG GGCTGGCCGC CACCGCCGGG GTCGCGGTGA CGGTCGGGAT CATCCAGATG CTGCTGCTGG CCGGTCAGGT GCTGCTGCTG ATCGCGCTGG CCCTGTTCCT GGCCATCGGC CTGGAACCGG CCGTGTCCTG GCTGATCACC CACCGTTTCC CGCGCTGGGC CGCGGTGTTC ACCGTGCTGG CCGGGCTGAC CCTGCTGCTG GCCGGGTTCA TCGCCGCCGC CATCCCGGCC CTGGTCGAGC AGGGTCGGCG GCTGGTCGAG GCGGCCCCGC AGTACGTGGC CCAACTCAGC GACGACAGCT CGGCCATCGG CCAGCTCAAC CAGCGCTTCC ACCTGCAGGA GACGGTGCAG CAGATCGTCG ACGGCGGCGG CCCGGGCCTG GCCTCCGGGG TGATCAGCGT CGGCGAGGCC GTCTTCGGTG CGTTCTCCGG TCTGCTGGTG GTGGCCGTGC TCACGGTCTA CTTCCTGGCC GACATGCCCC GCGTGCGCAC CACCCTGTAC CGGTTCATGC CGGCGCCCCG GCGGCCCCGG GCGATCCTGC TGGGGGACCA GATCATGGTC AAGGTCGGCG GGTACGTGCT GGGCAACGTG GTCATCTCGG TGATCTCGGC GGTGGTCACC TTCGTCTGGC TGATCGCATT CGGCGTGCCC TACCCGCTGT TGCTGGCCAT CCTGTTCGCC CTGCTCGACC TGATCCCGGT GATCGGATCG CTGATCGCCG GAGCCCTGGT CGCCCTGGCC GCGTTCAGCG TGTCGGTGCC GGTCGGGCTG GCCACCATCG GCTACTTCGT GGCCTACAAG CTGGTCGAGG ACTACCTGCT CACGCCAAAG GTGTTCGGCC GGGTGCTGCG GATGCCCGCG CTGGTCACCG TCTGCGCGAT CCTCATCGGC GGCGCCCTGC TCGGGCTGGT CGGCGCGCTC GTCGCGCTGC CCACGGCGGC CGCGATCATG CTGCTGGTGC AGGAGGTGGT GTTCCCGCGC CTGGATCGCG CCCATGCCAG CGGCGAGCCG GCAGCTGACC CGGCGGCCTG A
|
Protein sequence | MTRTKSGQPS EPVAHHPRVG QPHDAAAEGP IAQAEAVAAT LRTPADPLGP LGRRLNWRSA FLIGLAATAG VAVTVGIIQM LLLAGQVLLL IALALFLAIG LEPAVSWLIT HRFPRWAAVF TVLAGLTLLL AGFIAAAIPA LVEQGRRLVE AAPQYVAQLS DDSSAIGQLN QRFHLQETVQ QIVDGGGPGL ASGVISVGEA VFGAFSGLLV VAVLTVYFLA DMPRVRTTLY RFMPAPRRPR AILLGDQIMV KVGGYVLGNV VISVISAVVT FVWLIAFGVP YPLLLAILFA LLDLIPVIGS LIAGALVALA AFSVSVPVGL ATIGYFVAYK LVEDYLLTPK VFGRVLRMPA LVTVCAILIG GALLGLVGAL VALPTAAAIM LLVQEVVFPR LDRAHASGEP AADPAA
|
| |