Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_4571 |
Symbol | |
ID | 8450199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 5090416 |
End bp | 5092053 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645043612 |
Product | protein of unknown function DUF404 |
Protein accession | YP_003203839 |
Protein GI | 258654683 |
COG category | [S] Function unknown |
COG ID | [COG2308] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.903579 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGATC TGTTCGAGGA TTACCCGTTC GCTCGGGCCT GGGATGAAAT GTTCGCGGCA CCCGGAGAGA TCCGGCCCGC GTACGAATCG GTGTTCGCCG CGCTGCAGAC CATGGACGCG GCCGACCTCA AGGCCCGAGC CGACATCATG GGCCGCACCT TCCTGGACCA GGGCATCACC TTCGCCCTGG GCGGGGTGGA GCGGCCGTTC CCGCTGGACC TGATTCCGCG GATCGTGACC GCAGCCGAGT GGCAGACGGT GGAAAAGGGC GTCCCGCAGC GGGTTCGGGC ACTGGAGGCG TTCCTGGCCG ACGTCTACGG GCAGGGCCGG ATCTTCACCG ACGGCGTCGT GCCCAAGCGG CTGGTCACCA CCTCCCCGCA CTTCCACCGG CAGGTCATGG GCATGAGCGC CCAGGACGGC GCCCGGGTGG TGATCTCCGG GGTCGACCTG ATCCGGGACG AGAAGGGCGA GTTCCGGGTC CTGGAGGACA ACGTCCGGGT CCCCTCCGGC GTGTCCTACG TGCTGGAGAA CCGGCAGGCG GTCGCCCAGG TGCTCTCCGA GGCCGGCGCC GACCAGCTGG TGCGGCCGGT GTCGGAGTAC CCCGGCCAGC TGCTGGCCGC GCTGCGCGCC GTCGCCCCGT GGAACGTCAC CGATCCCAAC GTGGTCGTCC TCACCCCCGG CGTCTACAAC TCGGCCTACT TCGAGCACAC CCTGCTGGCC CGGGAGATGG GCGTCGAGCT CGTCGAGGGA CGCGACCTGA TCTGCCGCAA CAACCGGGTC TTCCTGCGTA CCACCTCCAG CGAGATGCCG GTACACGTCA TCTACCGCCG GATCGACGAC GAGTTCCTGG ACCCGATGCA GTTCCGGGCC GACTCGCTGC TGGGCTCCCC CGGCCTGATC AACGCGGCGC GGGCCGGCAA CCTGACCATC GCCAACGCGG TCGGCAACGG CATCGCCGAC GACAAGCTGG TCTACACCTA CGTTCCGGAC ATCATTCGCT ACTACCTGAG CGAAGAGCCG ATCCTGCAGA ACGTGGACAC CTACCGGATG GAGGTGCCCG ACCACCGCGA GTACGCCCTC GAGCACCTGG CCGAACTGGT CCTCAAGCCG GTCGACGGAT CCGGCGGCAA GGGCATCGTC ATCGGGTCCC GGGCGGATCG CGCGGTGCTG CGCAAGGCGC GGGAGACCAT CCTGGAGAAC CCCCGCGGCT GGATCGCGCA GCGCGAGATC GCCCTGTCCA CGGTGCCCAC CCTGATCGGC GAGAAGATGC GACCCCGGCA CGTGGACCTG CGGCCGTTCG CGGTCAACAA CGGGCGCAGC GTCTGGGTGC TGCCCGGTGG CCTGACCCGG GTCGCGCTGC CCGAGGGCGA GCTGGTGGTG AACTCCTCGC AGGGCGGCGG TTCCAAGGAC ACCTGGGTGC TCGGCGGACC GATCCCCGAG CCCGAGCCGC AACCCGCGGC CGACGCCACC CAGGTGATGA ACATGCGCGA CCTGTCCTTC CACCAGCCGA TCAGCCCGCC CGAGGACAAT CTCGGCTTCC GCACCCAGCA CGAACAGCAA CAGCAACAGT CCGGCGGTGT CCGGGAACAG AACGTCCTGG CACACAGCGC CCGGGAACTG GAGGAACCGC AGTGCTGA
|
Protein sequence | MADLFEDYPF ARAWDEMFAA PGEIRPAYES VFAALQTMDA ADLKARADIM GRTFLDQGIT FALGGVERPF PLDLIPRIVT AAEWQTVEKG VPQRVRALEA FLADVYGQGR IFTDGVVPKR LVTTSPHFHR QVMGMSAQDG ARVVISGVDL IRDEKGEFRV LEDNVRVPSG VSYVLENRQA VAQVLSEAGA DQLVRPVSEY PGQLLAALRA VAPWNVTDPN VVVLTPGVYN SAYFEHTLLA REMGVELVEG RDLICRNNRV FLRTTSSEMP VHVIYRRIDD EFLDPMQFRA DSLLGSPGLI NAARAGNLTI ANAVGNGIAD DKLVYTYVPD IIRYYLSEEP ILQNVDTYRM EVPDHREYAL EHLAELVLKP VDGSGGKGIV IGSRADRAVL RKARETILEN PRGWIAQREI ALSTVPTLIG EKMRPRHVDL RPFAVNNGRS VWVLPGGLTR VALPEGELVV NSSQGGGSKD TWVLGGPIPE PEPQPAADAT QVMNMRDLSF HQPISPPEDN LGFRTQHEQQ QQQSGGVREQ NVLAHSAREL EEPQC
|
| |