Gene Namu_1911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1911 
Symbol 
ID8447518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2104202 
End bp2107174 
Gene Length2973 bp 
Protein Length990 aa 
Translation table11 
GC content76% 
IMG OID645041041 
Producttranscriptional regulator domain protein 
Protein accessionYP_003201289 
Protein GI258652133 
COG category[K] Transcription 
COG ID[COG2909] ATP-dependent transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.345318 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000588756 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGACCAGAC CCGTGGTGCT GACCGAGAAG CTGCGCCGCC CCGAACCGGT GGGGTTGGCC 
CGGCGCCGGC TCGAGGAGCA GTTGCTCGCG CCCTCGACCA GCCGGTTGGA CCTGGTGGTG
GCGCCGCCCG GATCGGGCAA GACAACCCTG TTGGCCCGGG TGGCCGCGGC CGCCGAGAGC
GCGGGCAGTG CGGTTGGCTG GTACCGGGTG ACCGCCGACG ACGCCACCGA GTCGGCCTTC
GTGGCCCACC TGGCTCGGGC GCTGCACCTG GACACCGCCG CCGACGAGTC GTCGGTGACC
GACATGCTCG AGTTGCTCGA TCGGTCGCCG ACCGCGCCCA CCCTGGTGGT GTTCGACGAC
GTCCATGAGA TCGCCGGCTC GGTCGCCGAA CGGGCCCTGG ACCAGTTCGT GCAGCTGCGG
CCGCCGCACC TGCGGGTGCT GGTCGGTTGC CGCCGGCCCC CGGAACTGAA CATCCCGCGG
CTGCGGGTGT CCGGCCAGTT ACGCGAGATC ACCAGCGACG ACTTGCGTTT CCGCTCCTGG
GAGGTCGAGG AGCTGTTCCT GTCGGTGTTT CGGGAACCGC TCTCGCCGGA GTCGGCGGCC
GCGCTGACCC GGCGCACCGG CGGCTGGGCG GCCGGCCTGC AACTGTTCCA CCTGGCCACC
TCCGGACGCA GCGCGGTCGA CCGCGGGCGG GCCGTCGACG ACCTGGGTGG CCGGTCCAAG
CTGGTGCGCT CGTACCTGGC CCGCAACGTG TTGGCCGAGC TGCCCGAGGA ACGGCGGCAC
TTCCTGCTGC GCACGTGCAC GTTGGGCACG CTGACCGGAC CGCTGTGCGA CGCGCTGCTC
GGGGTGACCG GCAGTTCCAC CATCCTGGAC GAGCTGGAAC ACCGGCAACT GTTCACCTCC
TCCGACGACG ACGGCCACAC CTTCCGGTAC CACGAGGTGC TGCGCACGCA TCTGGAGTGG
GCCCTGGTCC AGGAGTACGG GGCGGCCGGC GCGCGCGAGT GGTACTCGCG CAGCGCCGCG
TTGTTGGAGT CGGTCGGCGA CCAGCGGGCG GCGGTGCGGG CCTACGCGCG GGCCGAGGAC
TGGGGTGCGG TCGGCCGGCT GTTGCAGACC CGCGGCACCG GCCCGGATGC CGGGGCCGCC
GGCACCGACC TGCTGCTGCC GCCGGCGGTG GTCGAGCACG ATCCGTGGTT GTCCCTGGCC
CAGGCCCGCC GCCGGGTCCG CGAGGGCGCC CTGACCGCCG CGGTCGACCG TTTCCGCCGG
GCCGAGAGCC TGCTCGACGA ACCGGATTTC CGCGACGACT GCCAGCGGGA ACGATCGGTC
GCGGCCATGT GGGCCTCGAC CGACACCACC GGCATGAGCT GGGCCGTGGA CGGGCGCGGG
TCGGCCCGGC ACTGGTCCAT CCCGATCCGC CTGGCCACCC GGCGGCTGGG CCCGGCCCGG
GTCGCTGCCC CGGCGCTGAC CGGATCGACC CCGGCGCCGC ACGAACTGGC CCGCGGCATC
ATCAGCCTGC TGGCCGGCGA CTTCCGCGAC GCCCGCCGCA CGCTGGACGC CGTCGTGGCC
GACCGGGCGG CCGAGTCGAC CCACCGGTTG CTGGCCGACC TGACCATTGC CGTCATCGAC
CTGATCCGCG ACCGCCCCGG GGATCCGGCG AGCCGGTTGG GCCAGATCGC GCTCGACGCC
GAGGTGGCCG GGTTGCCGTG GATCGCCCGG CTCGCCCGCG GGCTCGGCGA GTGCGTGCTG
GCGGCCACCG ACCCGGCCGC GTGGCGGCTG GCCGCCTGCG TCGATCAGAT CGACGAGTGT
GACGAGGCCG GCGACGCCTG GGGCGCCGCC CTGCTGACCT TGGCCGCCGC CGTCACCGGC
CTGCTGGCCG ACCCAGTGGC CGAGGCGCAT GGATTTGCCG ACGCGGCCGC GCGCTTCCGC
CGGCTGGATG CCCCGGTCCC GGCCCTGTGG GCGCAGGCGT TGCAGGCCTG CGCCCTGGCC
GGCCGGCGCC GCCCGGGCGC GGCCGAGTTG GCCGGCCGGG TGGCCACCGA GGCGCGCGCC
GCGCAGGTCA CCGGGGCCCA GGCCCTGGCC CAGATCGCCG CGGCGATGGC CGGGGCGCCG
GTCGGCACCG CGCCGGAAGC GACCGGCAGC GAACTGGACC GGATGATCGC CCTGTTCACC
GGGCCGGCCG CGCCCACCCC GCCCGTGGGT GAGCCGGCCG CCGCGACCTC CTCCATTCCC
GCGGACCAGC CGGCGCCGCT CGCCGCGAGC CGGACCGTCA CGATCCGCTG CCTGGGCGGG
TTCAGCGTCG ACATCGACGG TCAGAGCGTC GATCTGGCTC CGCTTCGGCC GCGGGCCCGC
GCGCTGCTGC GGCTGCTGGC GATGACGCCG AACCGGGACG TCCACCGCGA GCATCTGGTC
GACGCGCTGT GGCCGGGCAC GGATCTGACC GTCGGCACCC GGCGGTTGCA GGTGGCCGTG
TCCAGCGTGC GACAGCTGCT GGAACAGCGT GGCCTGCCCG GCGGCGAGGT GGTGCTGCGC
CACGGTGACG CCTACCGGCT GGCCCTGCCC CCGGGGTCGG TCGTGGACAC GGACGCCTTC
GAGCGGGGCG TGCGGGACGC CGAGACCGCC GCCGCCCGTG GTGATGTCAC CGCCGCCGCG
GCGCTGCGCG GCTCGGCCCT GTCCTGGTAC CGGGGCGACC TGCTGCCCGA GGACGGTCCG
GCCGAGCACG TGGTCGGCGA ACGCGACCGG CTCCGGCTGC TGGCCGCGAC CACGGCCGGC
ACCCTGGCCC AGGACTACCG GACGCTGGGG CAGCTGCGGC AGGCGGTGGC CGCCGCCCGG
CAGTCCGTGC AACTCGACCG CTATCAGGAC CTGGCCTGGG AGCTGCTGGC CGACCTGCAC
CGCGACGCCG GCGACGACAG CGCCGCCGCC CGCACCCGGC GCGAGCACGC CGCCGCCCAG
GCCGAACTGG AGCTGGATTC CGTGCGGCCC TGA
 
Protein sequence
MTRPVVLTEK LRRPEPVGLA RRRLEEQLLA PSTSRLDLVV APPGSGKTTL LARVAAAAES 
AGSAVGWYRV TADDATESAF VAHLARALHL DTAADESSVT DMLELLDRSP TAPTLVVFDD
VHEIAGSVAE RALDQFVQLR PPHLRVLVGC RRPPELNIPR LRVSGQLREI TSDDLRFRSW
EVEELFLSVF REPLSPESAA ALTRRTGGWA AGLQLFHLAT SGRSAVDRGR AVDDLGGRSK
LVRSYLARNV LAELPEERRH FLLRTCTLGT LTGPLCDALL GVTGSSTILD ELEHRQLFTS
SDDDGHTFRY HEVLRTHLEW ALVQEYGAAG AREWYSRSAA LLESVGDQRA AVRAYARAED
WGAVGRLLQT RGTGPDAGAA GTDLLLPPAV VEHDPWLSLA QARRRVREGA LTAAVDRFRR
AESLLDEPDF RDDCQRERSV AAMWASTDTT GMSWAVDGRG SARHWSIPIR LATRRLGPAR
VAAPALTGST PAPHELARGI ISLLAGDFRD ARRTLDAVVA DRAAESTHRL LADLTIAVID
LIRDRPGDPA SRLGQIALDA EVAGLPWIAR LARGLGECVL AATDPAAWRL AACVDQIDEC
DEAGDAWGAA LLTLAAAVTG LLADPVAEAH GFADAAARFR RLDAPVPALW AQALQACALA
GRRRPGAAEL AGRVATEARA AQVTGAQALA QIAAAMAGAP VGTAPEATGS ELDRMIALFT
GPAAPTPPVG EPAAATSSIP ADQPAPLAAS RTVTIRCLGG FSVDIDGQSV DLAPLRPRAR
ALLRLLAMTP NRDVHREHLV DALWPGTDLT VGTRRLQVAV SSVRQLLEQR GLPGGEVVLR
HGDAYRLALP PGSVVDTDAF ERGVRDAETA AARGDVTAAA ALRGSALSWY RGDLLPEDGP
AEHVVGERDR LRLLAATTAG TLAQDYRTLG QLRQAVAAAR QSVQLDRYQD LAWELLADLH
RDAGDDSAAA RTRREHAAAQ AELELDSVRP