Gene Namu_2971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2971 
Symbol 
ID8448584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3255746 
End bp3257257 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content71% 
IMG OID645042056 
Productprotein of unknown function DUF245 domain protein 
Protein accessionYP_003202298 
Protein GI258653142 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0494873 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0174629 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTGC GTCGCATCAT GGGCACCGAA GTCGAGTACG GCATTTCGGT GCCGGGCGAG 
CCGGGCGTGA ACCCGGTGAT CTCCTCCACC CAGGTGGTCC TGGCCTACGC CGCCTCGGTG
GCCGCGCCCC GGGCCCGCCG GCCCCGGTGG GACTACGAGG TGGAATCGCC GCTGCGGGAC
GCCCGCGGGT ACGACCTGTC CTCGCTGTTC GGGGCGGCCG AACCGGACGT GGACGACATC
GGTGCGGCCA ATGTGATCCT GTCCAACGGG GCCCGCCTGT ACGTCGACCA CGCCCACCCG
GAGTTCTCCG CCCCCGAGGT GACCAACCCG CTGGACGCCG TGCTCTACGA CAAGGCCGGC
GAGCGGGTGA TGGAGACGGC GGCCCGGCTG GCCGCCTCGC TGCCCGGGTC CAAGCCGATC
CAGATGTACA AGAACAACGT CGACGGCAAG GGCGCCTCCT ACGGCACCCA CGAGAACTAC
CTGTGCAGCC GGGACACCCC GTTCCCGGCG ATCATCGCCG GCCTCACCCC GTTCTTTACC
ACCCGGCAGG TCTTCGCCGG CGCCGGGCGG GTGGGCATTG GGCCGGCCGG GCAGACCGAG
GGCTTCCAGC TCGGCCAGCG CAGCGACTAC ATCGAGGTCG AGGTCGGCCT GGAGACCACT
CTCAAGCGCG GCATCATCAA CACCCGCGAC GAGCCGCACG CCGACGCCGA CAAGCACCGC
CGCCTGCACG TGATCATCGG GGACGCGAAC CTGGCTGAGA TCGCCACCTA CCTCAAGGTC
GGCACGGCAG CCCTGGTGCT GGCCATGATC GAGTCCGGCT GGGCGCTGCC GTCGGTCGAG
CTGGCCCACC CGGTGGCGGC GGTGCACCAG ATCTCGCACG ACCCGACGCT CAAGGTCACC
GTCCCGCTCA CCGACGGCCG CCGGCTGACC GGGGTGGACG TGCAGCGGGC CTACCACGAG
GCGGCGGCCA AGTACGTCGA GGCCGAGTAC GGCGATGACG TCGACGAGCA GACCCGGGAC
GTGCTGGACC GCTGGATCAG CGTGCTGGAC CGGCTGGCCC ACGACCCGAT GGACCTGGCC
TCCGAGCTGG ACTGGCCGGC CAAGCTGCGG CTGCTGGAGG GCTATCGCAG CCGCGACGGG
CTGGCCTGGG GCGCCGGCCG GCTGGCCCTG ATCGACCTGC AGTACTCCGA CGTGCGGATG
GACAAGGGTC TGTACAACCG GCTGGTCTCC CGCGGGTCGA TGCAGCGGCT GGTCACCGAG
GAGCAGGTGA CCGCGGCGAT GACCGATCCG CCCGAGGACA CCCGCGCCTA CTTCCGCGGC
CGCTGCGTGT CCAAGTACGC CGACCGCCTG GCCGCGGCGT CCTGGGACTC GGTCATCTTC
GACATCGGCC GGGAATCGCT GGTGCGCATC CCGACCATGG AGCCCACCCG GGGCACCAAG
GCGCACGTGG GGGCCCTGCT GGACGCCGCC GCGGACGCGA CCGAACTGGT CGACGCACTC
ACCCGCCGGT GA
 
Protein sequence
MSVRRIMGTE VEYGISVPGE PGVNPVISST QVVLAYAASV AAPRARRPRW DYEVESPLRD 
ARGYDLSSLF GAAEPDVDDI GAANVILSNG ARLYVDHAHP EFSAPEVTNP LDAVLYDKAG
ERVMETAARL AASLPGSKPI QMYKNNVDGK GASYGTHENY LCSRDTPFPA IIAGLTPFFT
TRQVFAGAGR VGIGPAGQTE GFQLGQRSDY IEVEVGLETT LKRGIINTRD EPHADADKHR
RLHVIIGDAN LAEIATYLKV GTAALVLAMI ESGWALPSVE LAHPVAAVHQ ISHDPTLKVT
VPLTDGRRLT GVDVQRAYHE AAAKYVEAEY GDDVDEQTRD VLDRWISVLD RLAHDPMDLA
SELDWPAKLR LLEGYRSRDG LAWGAGRLAL IDLQYSDVRM DKGLYNRLVS RGSMQRLVTE
EQVTAAMTDP PEDTRAYFRG RCVSKYADRL AAASWDSVIF DIGRESLVRI PTMEPTRGTK
AHVGALLDAA ADATELVDAL TRR