Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2947 |
Symbol | |
ID | 8448560 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 3227730 |
End bp | 3228731 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 645042032 |
Product | 5'-3' exonuclease |
Protein accession | YP_003202274 |
Protein GI | 258653118 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.0176217 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000125626 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTGCAAT TGCTTGATCT GGCCGGCATC TATTTCCGCG CGTTCTACGC GGTGCCGACG TCGACGACCG CGCCGGACGG CCGGCCGATG AACGCGGTGC GCGGATCGCT GGACATCATC GCCCGGGTGA TGGCCGACGC CCGGCCGACC CGGGTGATCG CCTGCCTGGA CCTGGATTGG CGGCCGGCCT GGCGGGTCGA GCTGATCGAG TCCTACAAGA CCCACCGCGT ACTGGAACCC GGTTCGGCCC AGGCGGCCTC GCTGGCCGCA GCCGCCGACG CGCTGGCCGC AGCCACCCCG CACAGCGGCA CCGCGGGACC CTCGGCCGAC ATCGAGGTGG TGCCCGACGA ACTGTCCCCG CAGGTGCCGA TCCTGCTGGA CACGCTGGCC GCGATCGGCA TCACCGTCGC CGGCGCCCCC GGATTCGAGG CCGACGACGT CATCGGCACC CTGGCCCACG CCGAGTCGGT GGACCCGGTC GAGGTGGTGA CCGGCGACCG GGACCTGTTC CAGGTCGCCC GGGACGACGC CCCGCCGGTG ACGGTTCGCT ACATCGGGGC CGGGATGAGC AAGGCCAAGG TCTACTCGGC CGCCGACGTG GCCGCCCGCT ACGGCATTCC CGCCGACAGC TACGCCGACT TCGCCGCCCT GCGGGGCGAC CCCTCCGACG GCCTGCCCGG CGTGGCCGGG GTCGGGGAAA AGACCGCGGC CACCCTGATC GGCCGGTTCG GCTCCATCGA GGGCCTGATC GAGGCGCTGG ACGCCCGGTC CACCGGCCTG GCCCCCGGGG TGCGCAGCAA ACTGGCCGCC GCCCGCGACT ACCTGCTGGT CGCCCCGCGG GTGGTGCGGG TCGCCCAGGA CGCTCCGGTC TCCGTCGAAC CCGCCGGCGA CGGCCGCCTG CCGGCCGAAC CCGTCGACCC CCAGGCTCTG TCCGACCTGA TCCGGGCCCA CGGCATCGGC GGCCCGGTCA ACCGCCTGCT CCAGACCCTC ACCGCCGGCT GA
|
Protein sequence | MLQLLDLAGI YFRAFYAVPT STTAPDGRPM NAVRGSLDII ARVMADARPT RVIACLDLDW RPAWRVELIE SYKTHRVLEP GSAQAASLAA AADALAAATP HSGTAGPSAD IEVVPDELSP QVPILLDTLA AIGITVAGAP GFEADDVIGT LAHAESVDPV EVVTGDRDLF QVARDDAPPV TVRYIGAGMS KAKVYSAADV AARYGIPADS YADFAALRGD PSDGLPGVAG VGEKTAATLI GRFGSIEGLI EALDARSTGL APGVRSKLAA ARDYLLVAPR VVRVAQDAPV SVEPAGDGRL PAEPVDPQAL SDLIRAHGIG GPVNRLLQTL TAG
|
| |