Gene Namu_3598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3598 
Symbol 
ID8449217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3950476 
End bp3951417 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content68% 
IMG OID645042669 
Producttranscriptional regulator, AraC family 
Protein accessionYP_003202905 
Protein GI258653749 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0167816 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0986203 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATCG GACTGATCGC GATCGACGGC TGCTTCGGTT CGGCTGTCGC GTCGGTCATC 
GACATCGTGC GGGTGGCCGA CGGAGCCCGC GGCGATGTCG ACCCGCGGAT CGACCCGATC
GAACTCGCCA TCCTCGGACC GAAACGGCGA GTGACCACGA CGGCATCGAT GACCCTGTCG
GTGGACCACC CGCTGTCGGA GTCCGCAGAG TTCGACGTGG TCGTCGTCCC TGCGCTTGGA
ACCCTCACGG CCGCCGCGAC CCACGACGCC CTCCAGAGCC GAGATGCTCG TTCGGTCATC
GCCTCGCTCG GGCGCCTCGA CGACGCGACC ACCCGGATCG CCGCGGCGTG CACCGGCGTG
TTCGCCGTCG CCGAGACCGG ACGGATGCAT CATCGGCGGG CGACGACCAG CTGGTTCCTG
GGGCCGGAGT TCCTGAAGCG CTATCCGACC GTCGCCCTCG ATCTCGACAC CATGGTCGTG
GTCGACGGGA ACCTCGTCAC CGCCGGCGCC GCGTTCGCCC ACATCGACCT CGCGCTCTCA
CTCGTGCGAT CGATCAGCCC CGACCTGGCC CAACATGTCG CCAAGCTCCT CATCATCGAC
GAGCGTCCGT CGCAGGCGGC CTTCGTCGCC TACGAACATC TCCGGCACGA GGACCCGATC
GTCGTCGAGT TCGAACGCTT CGTGCGCGCC CGCCTGGACG AACCGTTCAA CGTCGCCTTC
GTCGCGCAGT CGCTCGGCAC CAGCCGGCGC ACCCTCGAAC GACGAGTCCG TGCGGCGCTC
AACCTCACTC CGCTCGGCTT CGTCCAACGG CTTCGCATCG AACGAGCTCG GCACCTCTTA
GCAACCACGG ACCTCACCTC CGCCGAGATC GCGCTACGGG TCGGCTACGC GAACGCCGAG
ACTCTGCGCT CCCTCCTGCG CAGGGAGCGA CGCCGTTCCT GA
 
Protein sequence
MRIGLIAIDG CFGSAVASVI DIVRVADGAR GDVDPRIDPI ELAILGPKRR VTTTASMTLS 
VDHPLSESAE FDVVVVPALG TLTAAATHDA LQSRDARSVI ASLGRLDDAT TRIAAACTGV
FAVAETGRMH HRRATTSWFL GPEFLKRYPT VALDLDTMVV VDGNLVTAGA AFAHIDLALS
LVRSISPDLA QHVAKLLIID ERPSQAAFVA YEHLRHEDPI VVEFERFVRA RLDEPFNVAF
VAQSLGTSRR TLERRVRAAL NLTPLGFVQR LRIERARHLL ATTDLTSAEI ALRVGYANAE
TLRSLLRRER RRS