Gene Namu_1020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1020 
Symbol 
ID8446616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1116265 
End bp1118205 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content75% 
IMG OID645040159 
ProductAAA ATPase central domain protein 
Protein accessionYP_003200418 
Protein GI258651262 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0464] ATPases of the AAA+ class 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCGGAC GACTGGTCGA CGTCGTGAAG TGGATCGGCG CTGCCGCCGA CCGCCGGGAC 
GGCGGCGACC CCGTCGCGCA TGGCCGGCTG GTGGCCACCG TCGGAGTCAT CGAACAGGAC
TACGCCGCGA TCGACCCGAC CGAGCCGGGC GGCGATCCGC TCGACGGCCT GGGCGAGTTG
TTCGGCCTGG ACGAGACCGA CCGCGACCTG CTGCTGCTGG CCGCCGCCGC CGACCTGGAC
GGCAACATCG CGTTGGCCTT CGGCCTGCTG CGCGGGGGCA GCGCACCGTG CCACCCCAGC
ATCGGGCTGG CCCTGGAACT GCTGGGCATC TCCTCGTTGT CCGGCCCGGC CCGGGCCCGG
TTCGCCGACG GGGGCCGGCT GGCCGCGGCC GCGTTGATCG AGGTCGCCGC GACCGGGCCG
TGGCTGGCCC GCGAATTCCG CTGCCCCGAC CGGGTGGTCA CCCAGCTGGC CGGCGCGGCC
CACCCGGATC GCAGCCTGGA CCCGCTGGTC GTCGAGCCGG TGCCGTTGCC GCTGCCCGGC
GGTGCCGAGC TGGTGCGGGC CCTGGAGCAG GGCGCCAGCT TCGTCTGGGT GCATTCGCCG
ACCGGCACGG CCGGCCTGAC CATGGTCGTC GGGGCGCTGG GCGAGGTCGG CGTCCGCAGC
CTGAGCCTGA CGGTGCCGCT GGCCGGCGAC GCCGGCCACG GCGGCTTCCT CCGGGTGGCC
ATCCGGGAGG CGGCGCTGAC CAACCGGTGT CTGGTGCTGG ACCGGGCCGA GCGCCTGTTC
GCCGCCCCGG AGGGTGCCCC GCCGCGGGAC CCGGCCGACC TGCTGGCCGC GCTGGCCCGC
CCGCCGGTGC CGGTCGTCGC GGTCGGCACC GCGCCATGGG ACCGGCACCT GTCCACCACC
ACCGCGCCGT ACCTGCTGGC CGCGCCGCGG CTGGGACCGG ACGAATGCCT GCGGATGTGG
GCGGCGATCA CCGGCACGCC GCCGCCGGCG GGCGCGCTGG CCGGGCTGCG GCTGTCACCC
GACCTGGTCG ACCAGCTGGC CCGGTCGGCC ACCCGGCTGG CCGCGGCCGA GGACGTGCCG
CTGACCGCCG ACCTGGTGCG CCGGGTGGCC CGGCAACTGG CCGGCTCGGC CGAGCCCGAC
GCCGCGGTCG GATTGGCCGA TCTGGTGCTG CCCGACCCGA CCGAGCGGGC GATCCGCCGG
CTGATCGGGT GGGCCCGGCA CCGCGAGGAG CTGATCGCCC GCGGGCTGAT CGTCGCCGGG
CCCGGCAGCG GCGGCGGTAT CACCGCGCTG TTCAGCGGCA GCCCGGGGAC CGGCAAGACG
CTGGCCGCGC ACGTGGTGGC GGCCGAGCTG GGCATCGACG TGCTGCGGGT GGACCTGGCC
GCTGTGGTCG ACAAGTACAT CGGGCAGACC CAGAAGAACC TGGAGCAGGT CTTTCACCGC
GCGGAGAGCC TGAACGTGCT GCTGTTCTTC GACGAGGCCG AGGCGTTGTT CGGGCGGCGG
TCAGAGGTCA AGGACGCGCA CGACCGGTAC GCCAACCAGG AGGTCGCCTA CCTGCTGCAG
CGGATGGAGC AGTTCGACGG CATCACCGTG CTGACCACCA ACCTGCGCGG CAGCCTGGAC
CCGGCGTTCA GCCGGCGCCT GAGCTTCATC CTGCACTTCC CCGACCCGGA CGAGCCGACC
CGCCGCCGGC TCTGGCTCAC CCACGCCGCC CGGCTGGGAC CGATCGACCC GGACGATCCG
ATCGACGCCG GCCGGCTGGC CGCCACCGTC GAGCTCTCCG GCGGCGACAT CCGCAACGTG
GTCGTCGCCG CCGGGTACGA CGCGGCCATC GACGGCGTGC GGCCGGGCAT GCGGCACCTG
CTGGACGCCG CGGTCGCCGA GTACACCAAG TTGGGTCGCC GGGTACCGGC CGACCTGCTG
GACGGCGCCC GCCCCCGGTA G
 
Protein sequence
MFGRLVDVVK WIGAAADRRD GGDPVAHGRL VATVGVIEQD YAAIDPTEPG GDPLDGLGEL 
FGLDETDRDL LLLAAAADLD GNIALAFGLL RGGSAPCHPS IGLALELLGI SSLSGPARAR
FADGGRLAAA ALIEVAATGP WLAREFRCPD RVVTQLAGAA HPDRSLDPLV VEPVPLPLPG
GAELVRALEQ GASFVWVHSP TGTAGLTMVV GALGEVGVRS LSLTVPLAGD AGHGGFLRVA
IREAALTNRC LVLDRAERLF AAPEGAPPRD PADLLAALAR PPVPVVAVGT APWDRHLSTT
TAPYLLAAPR LGPDECLRMW AAITGTPPPA GALAGLRLSP DLVDQLARSA TRLAAAEDVP
LTADLVRRVA RQLAGSAEPD AAVGLADLVL PDPTERAIRR LIGWARHREE LIARGLIVAG
PGSGGGITAL FSGSPGTGKT LAAHVVAAEL GIDVLRVDLA AVVDKYIGQT QKNLEQVFHR
AESLNVLLFF DEAEALFGRR SEVKDAHDRY ANQEVAYLLQ RMEQFDGITV LTTNLRGSLD
PAFSRRLSFI LHFPDPDEPT RRRLWLTHAA RLGPIDPDDP IDAGRLAATV ELSGGDIRNV
VVAAGYDAAI DGVRPGMRHL LDAAVAEYTK LGRRVPADLL DGARPR