Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_1788 |
Symbol | |
ID | 8447392 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 1963680 |
End bp | 1966562 |
Gene Length | 2883 bp |
Protein Length | 960 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 645040916 |
Product | hypothetical protein |
Protein accession | YP_003201167 |
Protein GI | 258652011 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0736583 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.105048 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCAAC GCAACCCTGC GCCGGTCGAT CGGGGTGCCC CGGCCACCGC GCGCCGGAGG CACCCCGGTC GTCGGCCCGG CCGCGGACTG GCTCACCCCG GCGTGGCCCT GAGCCTGCTG GTGGCGGCGG TCGTCGCGAG CGGGTCCGCG ATCGGCGCGC TGCTCGCCGG CGGCTCGCTG GGCCGTGACA TCGACACGTT GGCCGGACAT CCCCGGACCG CGGTCGCCGC CGACTGGATC GTCATCGTGG GGTTCGGTCT CACGCTGGTC CTGGGCACGG TGCTGGCCAC ACAGGTCGTC TTCGACCCGC ACTGCCGCGC GATCGCCGGC TTCGCCCGGC TGGCGGCCGT CCTCGCGGTC GTCGGCGAGT GCGCCGGCAT CCTCGCCTCG GGGCTCGACC AGCCGGTCGC CGCCGCGGTG GCCAGCGTCC TCACCGTCGG CGCGGCCCTA CCGGCCAGTC TGGTGGCGAT CACCGGACTC GGGTTGACGG CGGTGCGGCT GGGCGCGCAC GTCAGGACGG ACGGCCCGGG GCTGGCCGCG GAACCGGGAC CGACCGTCCC GACGGGCCCG GCCGAGGCCG GTCCCCGGAC CCGGTGGCGC CGCGGGTACG ACACCGGCCG GGGCCCGGAC CTGCAGGCCC TGGACGCCGC CGTCGGCGAC CGGATCGGCG TCGGCCTGTC CGGGGGCGGG ATCCGCGCCG CCAGCGTCAT GCTCGGCGCC TTCCAGGACC CGGACATGCG CGAGAAGGTG CTCAACCGCG CGCGGTACCT GGTGGCCGTC TCCGGCGGCG GATTCGTCGC GGGGGCCTTT CAGCAGGCGC TGACCGCGGC CGGGCCGGCC CCCACCGGCG ACGAGACGGT CCTGCGAGAT CCTCGTCTGG CCTTCGTCGA GGGTAGCCCC GAGGAGGACC ATATCCGCAG GCACGCAAGC TATCTCGCGG CCAACCCGGT CGAGGTGCTG GTCGCCCTCG GGCTGCTGAT GCGGCACCTG CTGCTGAATC TGGTGCTGCT GTTCGGCCCG GCGATCGCGC TCGGTGTCCT GGCCGGCTGC TTCCTCAACC GGGTCCCCCT CACCTCACTG CGGGTCGCCG CGGACGGGAC GACCACGCGC CTGGAGTTCG CCGCGGAGGC CCGCACCACC TGGATCGCGA TCGCCGCCGT CGCGCTGCTC GTTGTGCTGA GCTGGCTGGG TGCGCAGTGG GCGGCCGCCC ACTGCGATCC GTCCGGCCGA CCCGGCCTTG CCCACCGGCT GCGTCACGCG CTGAACTGGG CGACTCCCGC CATCGGCCTG GTGTTCTGGT CCGTCGTTGC GCTCACTCTG GGCCTGCCCG CGGTGATCTG GGCGGCGGCC TGGATCCTGA GCCTGGGCGA TCGGACGGCC GCCCTGGGCG GCTCGGTCGG AGCCGTGCTC ATCACCTACG GCGCGAGCCT GGCCGCCCTG CTCTGGCGAC ATCGCAAGTA CCTCACCGAT GATTCGCCGC AGATCCCGCA GGCCGTGCCC CGCGGCGTCG GCCAGATCCT GCTGGTCATC CTGGCTTTGG GCGCATTGGC GGCATGCTGG CTGCTGTTGT TCGGCGGCGC GGCGACGACG GTCCTGGGCG CCGACCACAC GGGCGACTGG GTCGGCGCCG TCCTGGTGTT GCTCGTCGTC GTGTTCCTGG GTGGTTTCGT CGACGAGACC ACGTTGAGCC TGCACCCGTT CTACCGCCGG CGACTGGCCC GGACGTTCGC CGTCCGGTCG GTCCGGCGGC CCGGCGAGCC CGCTCTGGCG GTGCCGTTGG CACCGGCCGA GCGCACCACG CTGTCCCGCT ACGGCGTCGT CGCGCCGGCC GTGGCCCGGC AGTTTCCCGA GGTCATCTTC GCCGCCTCGG CCACGGTCGG CGACGGTCGC ACCCCGGCCG GCGCCAACCG GGTCTCCTAC ACCTTCAGCA GCGAGTTCGT CGGTGGGCCG GAAGTCGGTT ATGTGCCGAC CAGCAAGTTG GAGCGGCAGC TGTCCCCGCG GCTGCAACGC GACCTCACCG TGCAGGGTGC GGTCGCTCTG AGCGGCGCCG CCCTGGCCGC CTCGGTCGGC ACCCAGAACG CCAAGTGGTA TGAGGCACTG TTCGCCGTCA GTGGTCTGCG GCTGGGCGCG TGGATGCCCA ACCCGGGCTT CCTCACCGGT CAGACCGGGC GCGGGGGCCG CGCCTGGTAC GAACCGGGGC TGCCCCGGGT CCGGCGCATG AGCTACCTGC TGCGCGAACT GTTCGGCGCC CACCCCGCCG GGGCGCCGCT GGTCCAGGTC AGTGATGGCG GCTTCTACGA CAACCTCGGC CTGATCGAAC TGTTCCGCCG CCGGTGCACC ACCATCTGCT GCATCGACGC CAGCGCCGAC AGCCCACCCG TGGCCGCCAC CGTCGCCCAG GTGGTCGGCC TGGCCTACCA GGAGCTGGGG GTGCGGGTCC ACCTGGACGA CGCGCCCTTC GCCACCACCC CCGGGTCCGG CAAGCCGCCC ACCGACCGGC CGAGCCTGCC CGGCCTGGAC CAACGGATGT CCGAAACCGG CGTGATGAGG GTGCGATTCA CCTACCCGCC GGAGTCGGGG CTGCCCGCGG ACCGGCGCAC CGGCACCCTG GTGGTGGCCA AAGCGCTGCT CTGGCCCAGC CTTCCGTACC AGCTGCAGGC GTATGCGGTG GACAACCCGG TCTTTCCGCA CGACAGCACC GCCGACCAGT GGTTCGACGA CGGCCAGTAC GGCGCCTACA CCGCGCTCGG ACGAGCCCTG GGCGCAGCGG CCGCCGCCGC TCTGACCGCC CGCGTCGACG ATCCCGCGGT CACCCCACCC GCCGGCGCCG AAAAGATCCG GCTCAGTTGC GCTCGGCCTG CCCCCGCCGC CAGTACCCCA TGA
|
Protein sequence | MLQRNPAPVD RGAPATARRR HPGRRPGRGL AHPGVALSLL VAAVVASGSA IGALLAGGSL GRDIDTLAGH PRTAVAADWI VIVGFGLTLV LGTVLATQVV FDPHCRAIAG FARLAAVLAV VGECAGILAS GLDQPVAAAV ASVLTVGAAL PASLVAITGL GLTAVRLGAH VRTDGPGLAA EPGPTVPTGP AEAGPRTRWR RGYDTGRGPD LQALDAAVGD RIGVGLSGGG IRAASVMLGA FQDPDMREKV LNRARYLVAV SGGGFVAGAF QQALTAAGPA PTGDETVLRD PRLAFVEGSP EEDHIRRHAS YLAANPVEVL VALGLLMRHL LLNLVLLFGP AIALGVLAGC FLNRVPLTSL RVAADGTTTR LEFAAEARTT WIAIAAVALL VVLSWLGAQW AAAHCDPSGR PGLAHRLRHA LNWATPAIGL VFWSVVALTL GLPAVIWAAA WILSLGDRTA ALGGSVGAVL ITYGASLAAL LWRHRKYLTD DSPQIPQAVP RGVGQILLVI LALGALAACW LLLFGGAATT VLGADHTGDW VGAVLVLLVV VFLGGFVDET TLSLHPFYRR RLARTFAVRS VRRPGEPALA VPLAPAERTT LSRYGVVAPA VARQFPEVIF AASATVGDGR TPAGANRVSY TFSSEFVGGP EVGYVPTSKL ERQLSPRLQR DLTVQGAVAL SGAALAASVG TQNAKWYEAL FAVSGLRLGA WMPNPGFLTG QTGRGGRAWY EPGLPRVRRM SYLLRELFGA HPAGAPLVQV SDGGFYDNLG LIELFRRRCT TICCIDASAD SPPVAATVAQ VVGLAYQELG VRVHLDDAPF ATTPGSGKPP TDRPSLPGLD QRMSETGVMR VRFTYPPESG LPADRRTGTL VVAKALLWPS LPYQLQAYAV DNPVFPHDST ADQWFDDGQY GAYTALGRAL GAAAAAALTA RVDDPAVTPP AGAEKIRLSC ARPAPAASTP
|
| |