Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2408 |
Symbol | |
ID | 8448019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 2658133 |
End bp | 2660424 |
Gene Length | 2292 bp |
Protein Length | 763 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 645041527 |
Product | hypothetical protein |
Protein accession | YP_003201771 |
Protein GI | 258652615 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0000507419 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000148317 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGCGATC CGGTGGTGCG GGTGCTCGGC ATCCGGCACC ACGGGCCGGG TTCGGCCCGG GCCGTGCTGG CCACGTTGAC GCAGTGGCAG CCGGACCGGG TGCTGATCGA AGGTCCGCCG GAGGCCGACG CGCTGGTCGG CTTCGCCGCC GACCCCGACC TGGTGCCGCC GGTGGCGTTG CTGGCCTACC GGCAGGACGA TCCGGCCGCG TCCGCGTTCT GGCCGCTGGC CGTGTTCTCC CCGGAGTGGC AGGCGATGAG CTGGGCGGCG CGCCGGGGCG TGCCGGTCGC CTTCATGGAT CTGCCCGCGG CGATGCTGCT GGCCCGGGAC CGGCCCGATC CGGCCGCCGA ATCCGCCGGC ACCGGCGATC GGGACGACGG GACAGCCGGT CCTGATGAGG AGAGCGTCCC GACGGCCGAC GAGCCGACCG GCGCGGCGCC GACCGTGCGC ACCGACCCCA TCGCGCTACT GGCCCGGGCC GCGGGGTACG ACGACCCGGA ACGGTGGTGG GAGGACGTGA TCGAAAGTCG CCGGACCGTC GCCGGCGCGG ACGGCCCGTT CGAGGCGGTC ACCGAAGCGA TGACCGCGGT GCGGGCGGAC CGGCCCGAGA CCGACGAGGT GACTTTGCGC CGAGAGGCGC ACATGCGCAA GGTGCTTCGC GCCCAGCTCA AGACCGGGCC GCAGCGGGTG GCGGTGGTCT GCGGCGCCTG GCACGCGCCG GCGCTGGCCG GCACGCTGCC CGGCAAGGCC CCGACCGCCA CCGCCGACGC GGCCCTGCTG CGGGGGCGGC CGACCGCGAA GGTCACTCTC ACCTGGGTGC CCTGGACCCA CTCCCGGCTG GTCGCCGCGT CCGGGTACGG CGCCGGCATC GACTCACCCG GCTGGTACCG GCACCTGTTC GTCACCGCCG AACACCCGCT GGAACGCTGG ATGACGCTGA CCGCCGGCGT GTTGCGCCGC CGCGACCTGC CCACCTCGAC CGCGCACGTC ATCGAGGCCG TCCGGCTGGC CCGGACCCTG GCCCACCTGC GGGACCGCCC GGAGCCCGGC CTGAGCGAAG TCATCGACGC CACCCGCGCG GTGCTGTGCG AGGGCAGCGA GCTGGCCGTG GATTTCGTCC TGCGGGACGC GGTGGTCGGC GAGGAGCTGG GCCGGGTCCC CGACGCGGCG CCGACGGTGC CGCTGGATGC GGACCTGCGA GCGACCGCGC GAGCGCTGCG GCTCAAGTTC GACGCCGTCG CGCGGGAGGT CACCCTCGAC CTGCGCGCGG CCACCGACCT CCGCAAATCC CACCTGTGGT GGCGGCTGGG CATCCTGGGC GTCGACTGGG CCGACCCCGC GGAGGTCGCC GGCACCGGCA CCTTCAAGGA GGCCTGGACC CTGGCCTGGC GGCCGGAGCT GGCGGTCCGG TTGATCGAGG CATCCGTCTG GGGCACCACC ATCCCGGCGG CCGCCGCTCG CCGGCTGGTC GACCGGGCCA CCACCCTGGC CGCGTGTACC GCGGCCATCG CCGACTGCAT CACCGCCGAT CTGCCGGCCG CGATGACCGA CCTGCTGGAC CGGCTGGATC GGCTGGCGGC CGGGACGGCA GACGTCAGCG CGCTGCTCGA GGCGCTGCCG GCGCTGGTCC GCGCCCAGCG GTACGGCACC GTCCGCGGCT CGGACACCGC CGCGGTGGCC ACGGTCGCCC AGGCCGTGCT CATCCGGATC AGCGCGGGCC TGCCGGCGGC GCTGGGCGGG CTCGGCCCGG ATGCGGCCCG GGAGATCCGC CAGCCACTGG AACGCACCCA CGAGGTGGTG CCGCTGTTGC CCGAGGGACC GGCCCGGGAC GGCTGGTACC GCGCCTTGGC CCAGGCCGGC GAACGGCACG ACCTGCCGGC GCTGCTGGCC GGCCGCATCG TCCGGCTGCT CATGGACGCC GGCCTGATGC CCCGCCCCGA GGCCGCCGAC CGGCTGCACG CGGCCCTGTC GGGCGGCCCG ACCGCGGCCG AGAAGGCGGA ATGGGCCGAG GGTTTCACCG CCGGCGGAGC CTTGCTGCTG ATCCACGACG ACGCCCTGCT GGCCGTGCTG GACCGTTGGG TGCGCAGCCT GACCGACGAG GAGTTCCTGC AGGTCCTGCC GTTGCTGCGG CGCGGCTTCG GCACCTTCGC GCCGGCCGAA CGAGGCAACC TGCTGCTCGC CGCTCGCAGC CTGTCCGGGT CCGGCGGCGC CCCCACCCAC GGCCCGGGCG TCGACCTGAG CCGGGCCGGG CCGGTGCTGG CGACGGCTCG ACGGCTGCTG GGGACGAGCT GA
|
Protein sequence | MSDPVVRVLG IRHHGPGSAR AVLATLTQWQ PDRVLIEGPP EADALVGFAA DPDLVPPVAL LAYRQDDPAA SAFWPLAVFS PEWQAMSWAA RRGVPVAFMD LPAAMLLARD RPDPAAESAG TGDRDDGTAG PDEESVPTAD EPTGAAPTVR TDPIALLARA AGYDDPERWW EDVIESRRTV AGADGPFEAV TEAMTAVRAD RPETDEVTLR REAHMRKVLR AQLKTGPQRV AVVCGAWHAP ALAGTLPGKA PTATADAALL RGRPTAKVTL TWVPWTHSRL VAASGYGAGI DSPGWYRHLF VTAEHPLERW MTLTAGVLRR RDLPTSTAHV IEAVRLARTL AHLRDRPEPG LSEVIDATRA VLCEGSELAV DFVLRDAVVG EELGRVPDAA PTVPLDADLR ATARALRLKF DAVAREVTLD LRAATDLRKS HLWWRLGILG VDWADPAEVA GTGTFKEAWT LAWRPELAVR LIEASVWGTT IPAAAARRLV DRATTLAACT AAIADCITAD LPAAMTDLLD RLDRLAAGTA DVSALLEALP ALVRAQRYGT VRGSDTAAVA TVAQAVLIRI SAGLPAALGG LGPDAAREIR QPLERTHEVV PLLPEGPARD GWYRALAQAG ERHDLPALLA GRIVRLLMDA GLMPRPEAAD RLHAALSGGP TAAEKAEWAE GFTAGGALLL IHDDALLAVL DRWVRSLTDE EFLQVLPLLR RGFGTFAPAE RGNLLLAARS LSGSGGAPTH GPGVDLSRAG PVLATARRLL GTS
|
| |