Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_1101 |
Symbol | |
ID | 8446697 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 1221825 |
End bp | 1223858 |
Gene Length | 2034 bp |
Protein Length | 677 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645040238 |
Product | hypothetical protein |
Protein accession | YP_003200497 |
Protein GI | 258651341 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.682215 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGCGT TGTCGGCAGT GGACCTTCGA CTCGGGCTGG CCGACGTCGC CGCACTCGCT CTTGTGCAAC GCCCGGTCGT CTCGGTATGG CGGACCCGTA GCACCGGTAC CGACCACCCG TTCCCCGCAC CCGTCGAGAC CCTGGACGGT CAGGAGTGGT TCGACGGTCG GCAGGTAGTC GAGTGGCTCG AGGCCACCGG TCGAGGCAAC AACCCCGAAG CACGAGCGGA TTTGGCCGCG TTTGCCAGCA TCGATGGCGG TTCGCCCGCC GGCGATCAGG AGGTCTTTTT CGGTCTGACC GCACTGCTGT GCTTGACGGC GATAGTCGGA AGCCCCTTCG GTGAGACGGC CGCCGGGGAC ATCCTGGACC TGGCTGACGA GGCCGACCCG GACAATGATC TGCTGTTCGG TGAGATCGAG GAGCTCGGCG GACGAATCGG CGCCCTGGCC CACTACTGCG ACCTGCTGGT CGACGCGGCC TACCACCCCG CTGCGGCGTT GGAGAAGCTG ATCGGCGAGC GATTCCGTCG CCATGTCCCC GCACAGAGCA CAGTGGCCTT GACTGGGCCG GCCCACGAAC TCATCGGGTC CGCCGCGATC GCACTGGCCG ACGGCGCCGG GCACGAGTCG CTGACCGTTG TCGACCCGAC CCCTGGCGGG AGCGATCTCC TCCTCGCGGT CGCCGAGCGG GCCGGCGCGC GTGACCTGAC GGTGTTCACC GGCCGATCAA ACGACGCGGC AGCCCGATTG GCCCGGCGCC GGCTGCGGGT ACACGGCGTT CATCGCGCGG ACGGAGCGGG TAGCGACGCT TCCGGCGACC AGCCGGACCA GGCGATCCTG CTCGCCCGAT ACCCGAGCCC GGGGCAGCCG GACGTGGCGG CGGCGGAGAT GCTCAACGCC GTCACCGATC TGCTTCTCGG CACCACCCCC CAGCAGCGAT TCCTGGTCGC AGCACCAGCC GCCGTGCTGA CCGATCGGCT CCGTGATCCC GGGGCGCGAA TGGCCAGGGA CCGGCTGATT CGCTTCCCGA CGCTGCGCGC GGTCGTGCGG CTGCCGGCCG GGCTGGTCGC GGCTAAACCG CGCGAGCGAC TGGCATTGAT GCTGTTCGGG CGTCGGCACG ACCCATCACC CGACGAGCCG CAGATGCTCG TCGGTGATCT GAGCAACGCC GCGCTCAGCA ACGCGGTGAT CGACGACCTG GTAACCGACC TCGTGGCCAG CACCTCGTCG CCCGCAATCG CACGAACACA CCCTTTCCGC TTCCTGCGCC CGGTCATGGT GCGCCGGGTC CTGGCCACCG CCGGTTCGAT CGTCGAGGTC GTCGGTCCCG GTCAGGCCCG GCCGGTCTCT GGGGCCGACT TGGTGCTGCG CATCACGGAG ATTGTTGAGG GCATCGCCCG CCCGCTGGCG GCGCCAACCG TGCCGGAGAT GGCGGTGGCG AGCCATCTCC AGAACCTCAT CCCGATCACC ACGCTCGGTG CGGCCAAGGA CCGCGGTGAC GTTCGGATCC TGGCCGGGTT GCGGCTGGAC ACCGGGCTCG GCCCCGGCGG GGTTATCCTC CTGGGCGAGT CGGAAGTGTG CGGCGCCGCC CGCGTGGGGG ATCGGACGGT CGATCGGCTC GCCGTGATTG CCCGCCATCC GGCGGTCCAG TTCACGGAGC CGGGCGACGT GGTCTTCACG AGCTCCCCGC GGCCGGGTGC GCTGGTCGAT ACCAACGGCA GCGCGGTCGT GACCTACCCC GCGCGGATTG CCCGCATCGC CCGCCCCGAC TCCGGGCTGG CCGCCCGCCT GCTCGCCGCC GACATCAACG CGCGGCCGGA AGGAGCGAAG GCCTGGCGCG GGTGGCCGCT GCGTCGGCTT CCGTCCGACC AGGCGACGGT GCTCGACCAG GCCCTCGCGG AGATCGAGGA GTATCGGGTC GACCTCGAAC GGCGCCGGCA CGACGCCGAG GATCTCGCCC GGCTGCTGGC CCGCGGTGCC ACCGACGGCG CTGTCACCCT GACCACCACA ACCGAACCGA CGAAGGGAAC TTGA
|
Protein sequence | MSALSAVDLR LGLADVAALA LVQRPVVSVW RTRSTGTDHP FPAPVETLDG QEWFDGRQVV EWLEATGRGN NPEARADLAA FASIDGGSPA GDQEVFFGLT ALLCLTAIVG SPFGETAAGD ILDLADEADP DNDLLFGEIE ELGGRIGALA HYCDLLVDAA YHPAAALEKL IGERFRRHVP AQSTVALTGP AHELIGSAAI ALADGAGHES LTVVDPTPGG SDLLLAVAER AGARDLTVFT GRSNDAAARL ARRRLRVHGV HRADGAGSDA SGDQPDQAIL LARYPSPGQP DVAAAEMLNA VTDLLLGTTP QQRFLVAAPA AVLTDRLRDP GARMARDRLI RFPTLRAVVR LPAGLVAAKP RERLALMLFG RRHDPSPDEP QMLVGDLSNA ALSNAVIDDL VTDLVASTSS PAIARTHPFR FLRPVMVRRV LATAGSIVEV VGPGQARPVS GADLVLRITE IVEGIARPLA APTVPEMAVA SHLQNLIPIT TLGAAKDRGD VRILAGLRLD TGLGPGGVIL LGESEVCGAA RVGDRTVDRL AVIARHPAVQ FTEPGDVVFT SSPRPGALVD TNGSAVVTYP ARIARIARPD SGLAARLLAA DINARPEGAK AWRGWPLRRL PSDQATVLDQ ALAEIEEYRV DLERRRHDAE DLARLLARGA TDGAVTLTTT TEPTKGT
|
| |