Gene Namu_3774 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3774 
Symbol 
ID8449393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4140858 
End bp4142609 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content74% 
IMG OID645042825 
Producttype III restriction protein res subunit 
Protein accessionYP_003203061 
Protein GI258653905 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.476443 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.253817 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGGG TGGGGCAGCC GCCGCTGCGG CCTTGGCAGC GCGAGGCACT GGGGGCCTAC 
CATGCGGCGC ACGCCGGCGG GGGACGGCGC GACTTCCTGG TCACTGCCAC CCCGGGCGCC
GGCAAGACGA CGTTCGCCCT CGCGGTGGCC GGCGACCTGC TGCGCCGCCG GGTGATCGAC
CGCGTCGTCA TCGTCTGCCC GACCGACCAT CTGCGCACCC AGTGGGCCGA CGCGGCCGCC
GCCTTCAACC TGGTGCTGGA CCCGAAGATG TCCAACGCGC AGGGCCCGGT GCCGGCCGGC
TGCCAGGGCT ATGTCGCCAC GTACGCGCAG GTCGCGGCGC GCCCGGCCAT CCACCAGGCC
CGCAGCGGCA AGCTGCGCTC GTTGGTGGTG CTCGACGAGA TCCACCACGC CGGGGACGGG
TTGTCCTGGG GTGAGGCGGT CGGCGAGGCG TTCGGCGAGG TGCACCGGCG GCTGTCGCTG
ACCGGGACGC CGTTCCGGAC CCGGGCCGGC GAGCGGATCC CGTTCGTCGA GTACGAGATC
GACGGCGACC TGCTGCGTTC GGTCGCCGAC TTCACCTATG GCTACCGCCG GGCGCTGGCC
GACCGGGTGG TCCGGCCGGT CGTGTTCGCC GCCTACAGCG GCGTCTCCCG GTGGCGCAAC
AGCGCCGGCG AGGTGATCGC CGCCTCGCTC ACCGAGGCCG GCACCAAGTC GGTGGAGACG
GCGGCCTGGC GGACGGCGCT GGATCCGGCC GGTGGTTGGG TGCCGCACGT CATCGCGGCG
ATGGACGAGC GGATCAGCCA GTTGCGGTCC TCGGGCATTC CCGACGCCGC CGGGCTGGTC
CTGGCCAGTG ATCAGGACGA CGCCCGGGAC TACGCCGACG TGGTGCATCG CATCACCGGC
ACACGCCCGG TGCTGATCCT GTCCGACGAC GCCGCGGCAT CCAAGCGGAT CGAGCGGTTC
CGCGGCAGCG ACGAGCGGAT CGCGGTGTGT GTGCGGATGA TCTCCGAAGG CGTCGACATC
CCCCGCGCCG CCTGTCTGGC CTGGATGACC TCCTACCGGA CGCCGCTGTT CTTCGCCCAG
GCCGTCGGCC GCGTGGTCCG GGCCCGGGGG GCGCACGAGG CGGCGACGGT GTTCCTGCCG
GCCGTGCGGC CGCTGCTGGC CCTGGCCGCC GAGTTGGAGC AGGACCGCAA CTACGTCATG
GCGCCGCCCC CGCCGGTGCA GGACGACCTG GACGCGCTGG CCGATCCGCT GCCCCGGGAG
CCGGTCGAGC CGGGCAGCCG GAAGATCGAG GGCCTGGACT CGGAGGCCGA GTTCGCGCAC
GTGCTGCACT CGGGCCGGGC CGTGGTGTCC GGCGGGTCGG CGCCGGGCCG GGAGCCGGCC
GTCATGACGT CCGGGATGAT CGGCGAGGAG GACCAGGATT ATCTGGGGCT GCCCGGATTG
CTCAGCCCGG AACAGACCGC GGCCCTGCTG GCCACCCGGG ATTCCGACCT GCGGCGCCGG
GTGCGATCCA TGCCGCGGGA CCCGTTGGCC GACGCGCCGG ACCCGGACGC CGGGTCGATG
GCCGGCTGGC GGGCCGCCGC GGATCTGCGG CGTGAGGTCA ATCAGCTGGT TGCCCGGGTC
GCGGCGCGGA CCGGCAAGCC GCACGCCAGC GTGCACTCCC AGGTGCGTCG GGCGGTGCCC
GGACCGGCCT CGGCGGCCGC CGACCCGGAC GTGCTGACCG CCCGCCGGGA CCACCTGCTG
GGCCTGCTGT AA
 
Protein sequence
MSGVGQPPLR PWQREALGAY HAAHAGGGRR DFLVTATPGA GKTTFALAVA GDLLRRRVID 
RVVIVCPTDH LRTQWADAAA AFNLVLDPKM SNAQGPVPAG CQGYVATYAQ VAARPAIHQA
RSGKLRSLVV LDEIHHAGDG LSWGEAVGEA FGEVHRRLSL TGTPFRTRAG ERIPFVEYEI
DGDLLRSVAD FTYGYRRALA DRVVRPVVFA AYSGVSRWRN SAGEVIAASL TEAGTKSVET
AAWRTALDPA GGWVPHVIAA MDERISQLRS SGIPDAAGLV LASDQDDARD YADVVHRITG
TRPVLILSDD AAASKRIERF RGSDERIAVC VRMISEGVDI PRAACLAWMT SYRTPLFFAQ
AVGRVVRARG AHEAATVFLP AVRPLLALAA ELEQDRNYVM APPPPVQDDL DALADPLPRE
PVEPGSRKIE GLDSEAEFAH VLHSGRAVVS GGSAPGREPA VMTSGMIGEE DQDYLGLPGL
LSPEQTAALL ATRDSDLRRR VRSMPRDPLA DAPDPDAGSM AGWRAAADLR REVNQLVARV
AARTGKPHAS VHSQVRRAVP GPASAAADPD VLTARRDHLL GLL