Gene Namu_1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1000 
Symbol 
ID8446592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1098798 
End bp1100312 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content71% 
IMG OID645040135 
Producttranscriptional regulator, XRE family 
Protein accessionYP_003200398 
Protein GI258651242 
COG category[R] General function prediction only 
COG ID[COG3800] Predicted transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGACC GGCCGGCTGT CAAAAGTTCT GGTCCGCGAT CCGCGCAGGC TCGATCGGAG 
CCGATCAGAA CCGAACAGAA CGCCGGCGCC CCCGACCCGT TGACCATCGG CCGCCGGCTG
CGGCACCTGC GCAAGGCCGC CGGCCTGACG CTGTCCGACG TCGCCGAGGC CGCCGGCATC
AGCCCGTCGG CGCTCTCGTT GTTCGAGAAC GGCAAGCGGG AGGCCAAGCT GTCGCTGCTG
ACCACCCTGG CCGGGGTGCT CGGCACCGAT CTGGGGGAAC TGCTGGCGGT GGCCCCGCCG
AGCCGGCGGG CGGCGCTGGA GATCGAGCTG GAACGGGCGC AACGCTCGTC CGGGTTCAAG
TCGCTGGAGA TCGCGGCGGT CAAGCCGGGC CCGCGGCTGC AGACCGAGGC GCTGGAGTCG
CTGGTCGGCC TGCACCGGGC GCTGGCCCGG ATCCAGGCCG AGCGGCAGGC CACCCCGGAA
CAGGCCCGCC GGGCCAACGC CGAGTTGCGC GCGGAGATGC GCCGGCGCGG CAACTACTTC
GGCGAGATCG AGAAGGTGGC CGCCGATCTG CTGACCGCCA CCGGGTACGA GGGCGGTCCG
ATCACCCGGT CCGTGGTCGA CCGGCTGGCC GCGCACCTGG GCTTCCGGCT GCGGCACTCG
GGGGATCTGC CCCAGTCCAC CCGCACGGTG ACCGACCTGG CCCACCGCAT CATCTACCTG
CCCCAGCCCG ACGCCGGCCA GCACGACTCG CGCTCCCTGG CCCTCAACGC GCTCGGCCAT
GTGGTGCTGG GACACGAAGT GCCGCAGGAC TATTCGGAGT TCCTGCGGCA GCGGGTGGAG
ATCAACTACT TCGCCGCCTC GCTGCTGATC CCCGAGCGCG GCGCGCTGAC CCTGCTCCGG
CGGGCCAAGG CGGCCAAGGA CATCGCGATC GAGGACCTGC GCGATGCCTA CGCGGTCTCC
TACGAGACCG CCGCGCACCG CTTCACCAAC CTGGCCACCC GGCACCTGGA CCTGCCCGTG
CACTTCATGC GGATCAGCAA GGCCGGGGTG ATCTACAAGG CCTACGAGAA CGATGGCGTG
CAGTTCCCGA TGGACGCGTC CGGGGCGATC GAGGGCCAGC GGGTCTGCCG GTACTGGACC
GCCCGGGTCG TCTTCGACCG GCCCGACCTG TCCTCGGCCT ACCAGCAATA CACCGACACC
AAGTCCGGAA CCTATTGGTG TACGGCCATT GTCGACCGCA CGGCGCAGGG CTTGTTCTCG
GTCAACGTGG GTGTGCCCTA CGCCGACGTC AAGTGGATGC GCGGTCGGGA GACCACCGAA
CGCTCCCGCT CCCGCTGCCC GGACCCGACC TGCTGCGCCC TGCCGCCGTC CGAGCTGGCC
GACCGCTGGG AGGGCATGGC CTGGCCCAGC GCCCGGGTGC ACTCGCACCT GCTGGCCGCC
ATGCCGCCCG GGGTCTTCCC GGGGGTGGAC CAGGTCGAGG TGCTCGGCTT CCTGGAGCGG
CACTCCGCCG ACTGA
 
Protein sequence
MADRPAVKSS GPRSAQARSE PIRTEQNAGA PDPLTIGRRL RHLRKAAGLT LSDVAEAAGI 
SPSALSLFEN GKREAKLSLL TTLAGVLGTD LGELLAVAPP SRRAALEIEL ERAQRSSGFK
SLEIAAVKPG PRLQTEALES LVGLHRALAR IQAERQATPE QARRANAELR AEMRRRGNYF
GEIEKVAADL LTATGYEGGP ITRSVVDRLA AHLGFRLRHS GDLPQSTRTV TDLAHRIIYL
PQPDAGQHDS RSLALNALGH VVLGHEVPQD YSEFLRQRVE INYFAASLLI PERGALTLLR
RAKAAKDIAI EDLRDAYAVS YETAAHRFTN LATRHLDLPV HFMRISKAGV IYKAYENDGV
QFPMDASGAI EGQRVCRYWT ARVVFDRPDL SSAYQQYTDT KSGTYWCTAI VDRTAQGLFS
VNVGVPYADV KWMRGRETTE RSRSRCPDPT CCALPPSELA DRWEGMAWPS ARVHSHLLAA
MPPGVFPGVD QVEVLGFLER HSAD