Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2649 |
Symbol | |
ID | 8448261 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 2895491 |
End bp | 2897353 |
Gene Length | 1863 bp |
Protein Length | 620 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645041743 |
Product | hypothetical protein |
Protein accession | YP_003201986 |
Protein GI | 258652830 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0121345 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00491611 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGCGCCG ACCTGAGCGA CGTCCGGATC CATACCGGCA GCGAACCTGC CAATCTCGCG AGATCGGTGC AGGCGACGGC GTTCACGCTT GGAAGAGATA TCTATTTCGG AGCGGGTAGC TACGCGCCGC ACACCACGTC CGGGCGACGG TTGCTGGCCC ACGAGCTGGC CCACACCGTT GAGCAGGGTG CTGCCGCATC CGGACCGGGT CCGGTCATCG GCCGCGCTGC CGACCCCGCG GAGAAGCAGG CCGATCGGGT CGCGGACAAC GTTCTGCGGG TGCTGCGGCA GCAAACCCAT AATCCCGCGG ATCCAGGTGG TGCCACGCGC GGTGACGGCA GTTCGACCTC GCCTTTGTCG GCGCTGCGGG AGCCGTCAGC GGGCGACACC GCGCCGGGCC GGGCACCCGG TGAGGCGCCC CGGTCTGGGC TCGTCGGTCG TCAACTGCGA CGCATGGTCG GTTTCGAGGC AGAACTGATG GTGCCCAGCC TTGGGCCGAG CGCCAACCAA CTCAAGTACA CCAAGGATCC GGACGACGTC ACCGATTCCA TCAAGTCGTT CCTGGACGGC GGCGTTCCCT ACGGCACCGA CATCGGCGGC AAGACCGCGG ACGTCGACGT GCGCCTGGAC AGCGATCACG GGGGGTCGAT CGATCGCACC CCGATCGTGA GCAAGCTCGC CGAGCTGGGC TGGATCTCCG GCAAGCCCTC CGAACCGAGG ACCAAGATCG AATTCGTCAC CAAGGCGGTG GACGAACTTG CGCCTGGCTC CAACAAGAGG CTCAGGACCG TCGGACTGGC TTTGAAGGGC CAGCTCACTG ACGCCCTGTC CCAAGCCAAG AGCGGGCAGA TGAAGCAGCT CGGGGCGCCC GCGAAGGCCG GCTACATGAC CGGTGTGCCC GTCGCCGATC TCGAATGGTG GTGGTTGATG GGCAAGGAGT CCAGCGAAAT GGACGCCATG GTCCAGGACT ATCTGACGAA CGGGATTCAG GACGATGTTT ACCTGCAGGC GACGGTGGGG GTCGTCCCCT CCTCCTTGAT AAAGTTCTTT GCTCAGGCAG CGCTGCCCGG CGGGAAGGTG GAACTCGCCC CGCCCTCACA GGCACGCCAA CAGATTCTCG GCCTGGTGCA GGAGGTCACC TCCGATCTGG AGAAGAAGTT CACGGCGGCC CCGGAGGAGC ATTGGGTCAA GAAGCTCGAC CAGGTGTCGA AGGATGCATT CCTGGGGCTT CTGGGCCTGA TCTACAGCTA CCTGCTGGGC GACACGTTGC ATCAGACTTC CGGGGGAACG CTCTCCACGG TCAAGAATGC CGTTCCCTTC CTCATCAAGA TGAGCCCGTA CGGCCTGCTT GCCAGTACCG CACCGCACAT GCTCAAGGAC AGTCCGCCGC CACGGGAATT CGTGCGCAGT ATCGGCAGCT TCTTGAAGAA GTCCAACTAC CTGCAGGTTG CCTACTGGGT CGAGGAGGCA CGAAAGGAAG GGCCGACCGC GGTCGGCGAG GGCAAACTCG GCGCGAAGCT CGAGGCGCGC CCGAGCTCGA CGCGCCTGGT CAAGGGCGAC TACACCGACT TCGTCGAGCA GGTTCTCCTG GGCTCGGGAG GGGCGATCGA GGTGGTGGTA GGGAAGGCGT TGCCGGCACC CGACAAGCCG CCCACCGACT CCGGGGGCGT CGATGTGTTC TTCGAGCTCT ACAACCAGAG CGGGATTCCG CTGGAGTATC GCGCGATCAC CAAGCGCTAC AAGGTCTCCG AAGTCCTGCC AGCCATCGGT GAGATCATCA GCGACGTCCG GATGGCCGGC ATGAGTGGGC TGACCGAGGA GCAAAAGGCC AAGGTCAAGG AGGCGTACGA GAGCGATGTC TGA
|
Protein sequence | MGADLSDVRI HTGSEPANLA RSVQATAFTL GRDIYFGAGS YAPHTTSGRR LLAHELAHTV EQGAAASGPG PVIGRAADPA EKQADRVADN VLRVLRQQTH NPADPGGATR GDGSSTSPLS ALREPSAGDT APGRAPGEAP RSGLVGRQLR RMVGFEAELM VPSLGPSANQ LKYTKDPDDV TDSIKSFLDG GVPYGTDIGG KTADVDVRLD SDHGGSIDRT PIVSKLAELG WISGKPSEPR TKIEFVTKAV DELAPGSNKR LRTVGLALKG QLTDALSQAK SGQMKQLGAP AKAGYMTGVP VADLEWWWLM GKESSEMDAM VQDYLTNGIQ DDVYLQATVG VVPSSLIKFF AQAALPGGKV ELAPPSQARQ QILGLVQEVT SDLEKKFTAA PEEHWVKKLD QVSKDAFLGL LGLIYSYLLG DTLHQTSGGT LSTVKNAVPF LIKMSPYGLL ASTAPHMLKD SPPPREFVRS IGSFLKKSNY LQVAYWVEEA RKEGPTAVGE GKLGAKLEAR PSSTRLVKGD YTDFVEQVLL GSGGAIEVVV GKALPAPDKP PTDSGGVDVF FELYNQSGIP LEYRAITKRY KVSEVLPAIG EIISDVRMAG MSGLTEEQKA KVKEAYESDV
|
| |