Gene Namu_2962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2962 
Symbol 
ID8448575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3246670 
End bp3248130 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content72% 
IMG OID645042047 
Productprotein of unknown function DUF21 
Protein accessionYP_003202289 
Protein GI258653133 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.387429 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00195258 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCGCGC AGTGGATCAT GGTTGGCGTC GGCCTCGTCC TGATCGCCGG CACCGCGCTC 
TACGTCGCCG CCGAGTTCAG CCTGGTCACG GCGGACCGGG CGGCCGTGGC CAAGCAGGCC
GCGCAGGGCG ATCGCGGCGC CCGCTCCCTG ATGATCGGGC TGCGCTCGCT GTCCACCCAG
CTGTCCGGGG CGCAGATCGG CATCACCATC ACCACCCTGG CGTTGGGCTT CGTCATGCAG
CCGGCGCTGG CCGACCTGGT CGCGCCGCTA CTGGACGCGA TCGGCCTCGG CGCGGGCGTC
TCGCAGACCG TCGGCGCCCT GTTCGGGCTG GTCGTGGCGA CGGTGCTGTC GATGGTGTTC
GGCGAGCTGG TGCCCAAGAA CATCGCCATC GCCGAACCGC TGGACACGGC CAAGACGGTG
ATCACCCCCA TGCGGGTGTC CACCATGCTG TTCAAGCCGC TGATCATCGT GCTCAACGGC
ACTGCCAACG CGGTGCTGCG GGCCATCGGG GTGGAACCCC AGGAGGAGCT GCGCTCGGCC
CGGTCCGCGG TGGAGCTCGA TTCCCTGGTC CGCCGTTCGG CCGCCCAGGG CACCCTGGAA
CAGCCCACCG CCGGCCTGCT GGCCCGGTCG ATCTCGTTCT CCGGCAAGAC CGCCGACGAC
GTGCTCACCC CCCGGGTGCG GGTCCGGTTC GTCAAGGCCA CCGACACCGC GAACGCGGTA
CTCACCGCGG CCGTCGAGAC CGGGCATTCC CGGTTCCCGG TGTTCGGGGA GGACTCCGAC
GACGTCGTCG GGCTGGTGCA CCTCAAACGC GCGGTGGCCA TCCCGCCCGA CGAACGCGCC
GGCGTGCGGG TCGAGCAGCT GATGGTGCCG GTGCCGGTGG TCCCGGGGTC GATCCCGCTG
GACGACCTGA TGGACGAGCT GCGCAGCGGG CTGCAGATGG CGGTGGTGGC CGACGAGTAC
GGCGGCACGG CCGGGCTGCT CACCCTGGAG GACGTCGTCG AGGAGCTGGT CGGCGAGATC
AAGGACGAGC ACGACCCGGT GGACAGCCGG GCCGAACGGC GGGCCGACGA CACCTGGCTG
CTGCCGGGCA CCCTGCGGCC GGACGAGATC GTCGACATCA CCGGGGTGCG GCTGCCCGAA
TCCAGCGCCT ACGAGACGGT GGCCGGCCTG CTCATCGCCC GGCTGGGCCG GATGCCCAAG
GAGTTCGACG CCGTCGAGGT GGACGCGACC ATGGATGCCT CCGCCCATCT GGGCGTGCCC
GACCGGCAGG TCGTGCACTC CGACGACCGC CGGATCGAGC CGGAGACCGA GGACGACCTG
CCGCGGGCGG TGACCGTCCG GCTGACCGTG CACGGGCTGG CCCGGCGCCG GATCGAGGCC
GTGCTGCTGT CCGCGGTCAG CCTGAACACC GGCGACGAGG ACGAGAACGA GTCCGACGAA
CGCCGCGGAG ACGGCCGATG A
 
Protein sequence
MIAQWIMVGV GLVLIAGTAL YVAAEFSLVT ADRAAVAKQA AQGDRGARSL MIGLRSLSTQ 
LSGAQIGITI TTLALGFVMQ PALADLVAPL LDAIGLGAGV SQTVGALFGL VVATVLSMVF
GELVPKNIAI AEPLDTAKTV ITPMRVSTML FKPLIIVLNG TANAVLRAIG VEPQEELRSA
RSAVELDSLV RRSAAQGTLE QPTAGLLARS ISFSGKTADD VLTPRVRVRF VKATDTANAV
LTAAVETGHS RFPVFGEDSD DVVGLVHLKR AVAIPPDERA GVRVEQLMVP VPVVPGSIPL
DDLMDELRSG LQMAVVADEY GGTAGLLTLE DVVEELVGEI KDEHDPVDSR AERRADDTWL
LPGTLRPDEI VDITGVRLPE SSAYETVAGL LIARLGRMPK EFDAVEVDAT MDASAHLGVP
DRQVVHSDDR RIEPETEDDL PRAVTVRLTV HGLARRRIEA VLLSAVSLNT GDEDENESDE
RRGDGR