Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_1971 |
Symbol | |
ID | 8447580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 2175931 |
End bp | 2177721 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 645041100 |
Product | protein of unknown function DUF262 |
Protein accession | YP_003201346 |
Protein GI | 258652190 |
COG category | [S] Function unknown |
COG ID | [COG3472] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0185458 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00310192 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGAAGT ACTCGGTGCA GCAGGAGTCT GTCGACGGGC TTCTCACCCT CGTCAAGGGC GGCAAGATCG CGATACCGGA ACTGCAACGT CCGTTCGTGT GGAACTCGAC CAAGGTCCGG GATCTCCTTG ACTCGCTCTA CAAGGGCTAC CCGGTGGGAT ACCTGATCAC CTGGCAGTCG GTGGGTGCAG CGCTGAAGGA CGGCCAGATT GCGCAGCACC AGCAGATCCT CATCGACGGT CAGCAACGGG TGACCGCCCT TCGGGCGGCC GTCGCCGGAC TGCCCGTAGT CGACAATCAC TACAAGAAAC AGAAGATCAC CATCGCGTTC AATCCACTCA CCGGTGACTT CGAGACCGTC ACCCCGGTGA TCCGCAAGAC TCCGCAGTGG ATCCCCGACG TCTCGGAATT GTTCGGCGCG ACGTCCACTT TCGCGTTCTT CTCGAAGTAT GTGGCTCGCA ATCCGGATGT CGACGTTGCC CTCGTGGAGG AGTCGATCGA CCGTTTGCTG ACGATCAGGA GCGCGCAGAT CGGGATCATC GCGCTGGCCG ACGACCTCGA GGTTGAGACG GTCTCTGAGA TCTTCATCCG GATCAACTCC AAGGGCGTGC CGCTGTCCAG CGCCGACTTC GCGATGAGCA AGATCGCCAC CTACGGTGAC CGCGGGCGTA ATCTGCGCAA GCTAATCGAC TACTTCTGCC ACCTGTCAGT CGCTCCGCAC GTCTACCAGG ACATCGTCGA CAACGACCAC GAGTTCGTGG CCACGCCGTT CCTGCCGAAG ATCGCCTGGC TCAAGGACGA TGCCGAGGAC CTGTACGACC CGAATTACAG CGACGTCATC CGGATCGCCA ACCTCATCGG ATTCAACCGC GGCGTCGCAT CAGCATTCGT CAGCGAGTTG TCCGGGCGTG ACCCGGAGAC CCGAAAGGTC GACGAATCGA GGATTCCGGT CGCTTACGAC CGACTCGAAG ACGCGCTGCT TCGGATCGTC AACAAGTACG ACTTCCAGAA CTTCATCATG ACAATCAAGT CGGCCGGCTT CATTACGACG GCGATGATCA GCTCCAAGAA CGCGCTGAAC TTCGCCTACG CCCTGTACCT ACGTCTGCGG CACGACCCCA CAATGTCCGA GGGCGAGCGG AAGAGCATCG TCCGAAGGTG GTTCGTCATG TCGATGCTGA CCGGGCGCAA CTCGGGGAGC GTAGAGACCC AGTGGGAACT TGACGTTCGG CGGATCAGTC AGTACGGCGC GGCGGCCCAC CTCAAACAGA TCGAAGAGGC GGAGCTGTCG GATGCCTTCT GGCAGGTGAC TCTACCCGGG AACCTCGAGA CGTCCAGCGT CCGCAGCCCC TACTTCCAAA CGTTCCTCGC CGCTCAGGTG AAGACCGGAG CGCGCGGCTT CCTGTCGAAA TCGATCACAG TCCAGGCGAT GCACGAGCAG ATCGGCGACA TCCACCACAT CGTGCCCAAG GACTACCTCA AAAAGGAAGG CGTCACCGAC CGGTCCGACT ACAACCAGGT AGCGAACTAC GTTCTCACCG AGACATCGAT CAACATCCGC ATCAGCAACC GCGCCCCCGG CGCCTACATG GACGAGATCC GCACCCAGGT CGACACCGGA AAGCTCACTC TCGGTGAGAT CACCGATGAT CAGGACCTCC GTCGAAACTT CGCCGAGAAC GCCGTGCCGC ACGATCTCGA CACCGTCACC GCCGGCTCGT ACTTCGACTT TCTGGTCCGA CGACGGGTCA TGATGGCACA GACCATCGGT CGGTACTACG ACGGACTATA G
|
Protein sequence | MAKYSVQQES VDGLLTLVKG GKIAIPELQR PFVWNSTKVR DLLDSLYKGY PVGYLITWQS VGAALKDGQI AQHQQILIDG QQRVTALRAA VAGLPVVDNH YKKQKITIAF NPLTGDFETV TPVIRKTPQW IPDVSELFGA TSTFAFFSKY VARNPDVDVA LVEESIDRLL TIRSAQIGII ALADDLEVET VSEIFIRINS KGVPLSSADF AMSKIATYGD RGRNLRKLID YFCHLSVAPH VYQDIVDNDH EFVATPFLPK IAWLKDDAED LYDPNYSDVI RIANLIGFNR GVASAFVSEL SGRDPETRKV DESRIPVAYD RLEDALLRIV NKYDFQNFIM TIKSAGFITT AMISSKNALN FAYALYLRLR HDPTMSEGER KSIVRRWFVM SMLTGRNSGS VETQWELDVR RISQYGAAAH LKQIEEAELS DAFWQVTLPG NLETSSVRSP YFQTFLAAQV KTGARGFLSK SITVQAMHEQ IGDIHHIVPK DYLKKEGVTD RSDYNQVANY VLTETSINIR ISNRAPGAYM DEIRTQVDTG KLTLGEITDD QDLRRNFAEN AVPHDLDTVT AGSYFDFLVR RRVMMAQTIG RYYDGL
|
| |