Gene Namu_2031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2031 
Symbol 
ID8447640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2242101 
End bp2243102 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content73% 
IMG OID645041157 
Producttranscriptional regulator, LysR family 
Protein accessionYP_003201403 
Protein GI258652247 
COG category[K] Transcription 
COG ID[COG0583] Transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0205829 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00632018 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTAAGA TGCACTGCAT GATTGATGTC CGCAAGCTGG AGATCCTGCG GGAGCTCGAT 
CGGTGCGGCA CCATCGCCGC CACCGCCGCC GCCGTCCACC TCACCCCGTC CGCGGTGTCC
CAACAGCTGG CCGCGTTGTC CAAGGAGGCC GGCACGGCCA TGCTGGAACC GGACGGGCGC
CGGGTCCGGC TTACCGAGGC CGCCCAACTG CTGCTGCAGC ACGCGCACCA GATCTTCACC
CACCTGGAAC ACGCCGAGTC GGACCTGGCC GCCTTCCGGC GGGGGGACGC CGGCACGGTC
CGGGTGGGCA CGTTCAGCTC CGCGGTGAAA GCCCTGGCGG TGCCGCTGGT GTCCGACCTG
TCCACCCGGA CCCGAATCCG GGTGGAACTG CGCGAGGTGC AGCCGGAGGA TGCGCTGGAC
GCCCTGCTGG GGCGGCGGGT GGACATCTGC ATGAACCTGG CCACCACCGA ATTGTTGCCC
GGCTCGGACG ACAAACGGGT CCACTCCGAG CACCTGCTCG ACGACGTGAT GGACGTCGCC
CTGCCGTTCG ATCACCCGCT GGCCGATCGC GCCGAGATCG AGCTGGCCGA TTTGGCCGAC
GAGGACTGGA TCCTGGCCAA CCCCGGGGTG CCGTGCTGGC AGCTGAGCCG GGACGCCTGC
GAACGGGCCG GGTTCTCCCC GCGCGCCCGC CACTACGCCG ACGAATTCGT CGGCGTGGTC
GGGCTGGTCG CGGCCGGCCA CGGGGTCAGC CTGCTCCCCC GGCTCGCCCA ACCCGAGGCC
GTGCACGAAC CGATCGTGCT GCGACCCGTC GCCGGGGTCA GCCCGGTTCG CCGGATCAGC
GTGCAGACCC GGGCCGGCAC CGCCGACCAG CCGCACATCG CGCCCGCCCT GGAGTCCCTG
CGCCGGGTCG CCGCCGGCGT GGCCCGGGGC CCGCTGGCCT GTCGCAGCGT GACCCGGGGC
CCGGCCCCGA TCGCCGATCC GGCCCCGGCG TTGGTGAGCT GA
 
Protein sequence
MRKMHCMIDV RKLEILRELD RCGTIAATAA AVHLTPSAVS QQLAALSKEA GTAMLEPDGR 
RVRLTEAAQL LLQHAHQIFT HLEHAESDLA AFRRGDAGTV RVGTFSSAVK ALAVPLVSDL
STRTRIRVEL REVQPEDALD ALLGRRVDIC MNLATTELLP GSDDKRVHSE HLLDDVMDVA
LPFDHPLADR AEIELADLAD EDWILANPGV PCWQLSRDAC ERAGFSPRAR HYADEFVGVV
GLVAAGHGVS LLPRLAQPEA VHEPIVLRPV AGVSPVRRIS VQTRAGTADQ PHIAPALESL
RRVAAGVARG PLACRSVTRG PAPIADPAPA LVS