Gene Namu_2626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2626 
Symbol 
ID8448238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2877594 
End bp2878604 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content66% 
IMG OID645041722 
Producttranscriptional regulator, AraC family 
Protein accessionYP_003201965 
Protein GI258652809 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00000849178 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00271101 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCGGGG CGAGCGGAGC AGGGCTACCG GTCATCGTGT TCGATGCGCA GCCGACGTCC 
GCGGAGGAAC AGTTCGAGCT GTTCCACGAC ACGACCGCGC CGGTGTTCGA CACGCTCCCC
TCGGGGAGCC CAGCCGACTT CTCTGCCCAA GCGACGGACT ATCTCGTGGG TGACGTGGTC
ATCAGCCGGA TCGCGCACGC ACCGCAGAGC ATGCGGCGCA GCATCCGCCA TATCCGCTGC
GGCAGCGAGG ACGCGCTCGC AGTCCTCGTC TACCGCCGAG GGCGTGTCGA CCTCAGCTTC
GACCGCACCG AGATGACCCT GGACTCGCAG CACGTCGGCA TCATCGACCT CGCTCGATCC
TTCTACGCGG CCTGCACCGA CATCGATTCG GTCTGGGCGG TCATCCCTCG TCGTCGCCTC
CGTGCTTCCC TGGGGCGATC GCCGTGCGCG CGACTGCACA GGGACTCGCC CCGCGGGCGG
GTGCTGCGCA GCACGGTCAT ATCCGTCTGG AACAGACTTC CGAACGCTTC CGCGGAAGAC
GCAACGACTC TGGCGCAGGA AATCATCGAC GCCACCCAAT CGGTGCTCAC CGACGGCGAC
TTCGCGCCTT CCGACACCGC TCTAGCAGTG GCGATGAGCG ACTTCGTCAT CGCGCACCTG
GATGATCTGG ACCTCGACGC GCGCATGCTC GCCCGCACGT TCCACTGCTC GCGGTCGACG
CTTTTCCGGA TCTTCGCACC GCATGGCGGT GTCGCCGCCT ACATCCGCGA CGCGCGGCTG
GACCGTTGCC TCGACGAGCT GCTCGAACCG TACGAGTCAA CCCGCACGGT CCACCAGATC
GCGACCAGAT GGGGATTTGA GAACCCGAGT CATTTCCACC GACTTTTCAC CACGCGCTAC
GGAACTCCGC CATCCACAGC GCGCGGCACA CGCCACGCAC CGCCCGGTCG CGCCTACGAC
CAAGACACGA GCAAAAAGAT CAACACGTTC CATCAATGGG CCACCCGGTG A
 
Protein sequence
MRGASGAGLP VIVFDAQPTS AEEQFELFHD TTAPVFDTLP SGSPADFSAQ ATDYLVGDVV 
ISRIAHAPQS MRRSIRHIRC GSEDALAVLV YRRGRVDLSF DRTEMTLDSQ HVGIIDLARS
FYAACTDIDS VWAVIPRRRL RASLGRSPCA RLHRDSPRGR VLRSTVISVW NRLPNASAED
ATTLAQEIID ATQSVLTDGD FAPSDTALAV AMSDFVIAHL DDLDLDARML ARTFHCSRST
LFRIFAPHGG VAAYIRDARL DRCLDELLEP YESTRTVHQI ATRWGFENPS HFHRLFTTRY
GTPPSTARGT RHAPPGRAYD QDTSKKINTF HQWATR