Gene Namu_1329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1329 
Symbol 
ID8446925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1458918 
End bp1461923 
Gene Length3006 bp 
Protein Length1001 aa 
Translation table11 
GC content74% 
IMG OID645040462 
Producthypothetical protein 
Protein accessionYP_003200721 
Protein GI258651565 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.031741 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCGGA TCCGCGCCCG CTGGACCGGT TCCACCCGGT CCGTCCGGTC CACGGTGGTG 
ATCGTGGTGC TGACCCTGCT GGCCTCGGTG CTGGTGGTAC TGGGGGTCAA CGCCCAGGGC
TACCCGGTGA CCGACGTCAA CCTGGCGTCC TCGACGGTCT GGGTGACGAA CGAGGCCAAG
GGCGTGGTCG GCCGGGTGAA CCGGCAGATC GACGAGCTCA ACTCGTCCGT CAAGGCCAAC
AAGCCCGGTT TCGACGTGCT GCAGGACGGC GACGCCGTGC TCGTGGTCGA CCGGATCAAG
AACGAGGTCC GGCCGGTCGA CGTCGCCGCG GTGGTGCTGA CCTCGCGGAT CGGCCTGCCG
GAGAACGCCT CCGTCGGATT CGGCGGCGGC ACCCTGGCCG TGGCCGACCC GGTCACCGGC
CAGCTGTGGG CCGGCACCCC GGACTCGCTG ACCGGCCTGG ACCCGGCGTC GACCGAGCCG
CTGCTGACCG CGGGGGCCGG GGCGCGGGTG GCCGTCTCGG TCGGCGGGAC CGTCTTCGCG
GTCGCCCCGG GCAGCGACGC GCTGTGGTCG GCGCCGATCG GGGACACCGG CGAGGTGACC
GGCTGGGCCG TCGACGACGC CGGCGCCCCG GTCGCCCCGG AACCGGCCCG GCTGCCCGGC
GGCCCGCTGA CCCCGGTGGC CGCCCAGGTG GTCTCGCAGC AGGCGCCGAC GGTGGATCTG
ACCGCGGTCG GCGATGTCCC GGTGGTACTG GATCGGGCGA GCGGGGAGCT GATCCTGCCC
GACCGGCGGA TCGCGATCCC CGGCGGCGCG GAGGCGGTGC TGCAGCAGCC GGGACCGGCC
GCCGACGCGG TGCTGGTCGC CACCGGGACG GCGTTGCTGC GGGTGCCGCT GACCGGCGGC
GACCCGCAGA CCGTGGTGGA CGGCGTCACC GGCACCCCGA GCGCGCCGGT GGCGCTGGAC
GGCTGCGCGC ACGCGGCCTG GGCCGGCCAG CAGCCCCGTT ACGCCGTCGC CTGCGGGGAC
GACCCGGCCC GGCTGATCGA CGTGCCGAAC GCGGCCGCGG CCGCCCGGCT GGTCTTCCGG
GTGAACCGGA ACGTGATCGT GCTCAACGAC CAGGCCACCG GCGACATCTG GCTGGTCGAC
GCGGACATGA AGCTGATCAG CAACTGGGAC GACGTGCAGC GTCAGGACTC CAGCAGCGAT
CAGACCTCGC CGGACGGCGA GCAGAACAGC TCCGACCAGC TCACCAACTC CCGCACGGAC
TGCGCGGCCA CCTCCGCCGC GGGCACCCCG CCGCAGGCCG CGGCCGACGA GTTCGGGGTC
CGCGCCGGCC GGACCACGGT GCTGCGGGTG CTGGACAACG ACTCCTCGGC CGACTGCTCC
ATGCTGGCGA TCAGCGCGGT CGCCCCGCTG CCGGCCGAGC AGGGCACGGT CGCCGTCGCC
GAGGGCGGTC AGGCGCTCCA GCTGACCGTG CCGGCCACCG CCACCGGGCC GCTGCCGACC
ATCGACTACA CCGTCGACGA CGGCCTGGGC CGCACCTCGA CGGCGCAGGT CAGCGTGAGC
GTGGCCGCCG CCGACGACGT GCGGGCGCCC CGCAAGCTGC GCGACTCGGC GACCCTGGTC
GAGGTCGGCG GCACGGTCTC CTACAACGTG CTGCCCGACT TCCGCTCGCC GGTCGGCGAG
GATCTGTCGC TGATCTCGGC CACCGCCACC ACCGACGACT CGGTGACCTT CCAGCCCAAC
GGCCTGGTCA CCTTCCGGGA CACCGGATCG GCCGGCGCGA TCAAGAAGAC CGTCGACCTG
GTGATCTCCG ACGGCACCAG CCAGATCAAC GGCGCCCTGA CCATCGACGT CAAGGCCGAG
GGCACCACCA CCCCGGTGCC CGGCCCGGTC GCCGCCAGCG GTGTCGTCGA CGAGGCGGTC
ACCGTGAACC CGTTGCGCAG CGTGCTGTCC GGGCTGCGCG ACCCGGCCCG GGTCACCGCG
GTGCAGCCGC TGGTGGCGGC CACCCCGGAC CGGCCGGCAT CCGCGAGCGC CACCCTGAAC
CCGCTGGATT CCACCGTCGT GCTCACCGGG ACCGCCCCGG GCACCAGCTA CTTCCTGTAC
ACGGTGGTCG CCGGCTCGGC CAGCGCCACC GGGGTCATCC GCTTCGACGT GACCGCCCCG
CCCGACACCC CGGCCCCGCC CGTGGTCACC CCGGACGTCG GCTACCTGGT GCCCGGCCGG
ACCCTGGTGC TCGACCCGCT GGCCAACGAC CGCGACCCGA TGGGCGCGGT GCTCTCGATC
CAGCAGCTGA GCCAGCCGGC GGACAGTCCG CTGACCGCGA CCGTGCACGA CCTGAGCCTG
CTGCAGGTCT CCTCCGCGCG CAGCGTGCCG GCCACCGGCG TCACCCTGAC CTACACCGCG
GCGAACACGG CCGGGTCGAC GACCGGCCAA ATCCGGGTGA TCCCGGTGCC CGCGCCGCTG
ACCCCGCAAC CCCCGGTCGC CGCCGACATC GCGGTGACGG TGCGGGCCGG GGACGCGGTG
ACCGTGCCGA TCTCCCGGTA CGCCACCGAC CCGACCGGCC AGGTGCTGAC GGTCAAGCCG
TTCCCCGACG GCACCCTGCC GGCCGACCAG GGCCTGGTGT TCGCCACCGA ATCGGCGATT
CGCTACCTGG CCCCGGCCAC CCCGCCGCCG ACCTCCGTCC GGTTCAGCTA CACGGTCGTC
AACACCGATC AGCTCACCGA CACCCGGTCG GTCACCATCT CGGTGCTGCC CGCCGATCCG
GCAATCAACA CCGCCCCGCA GACCCCGCCG CTGACCACCG CGCGGGTGTT CGCCGGCCGC
ACCGCGACGA TCCCGCTGCC GTTGGACGGG CTCGACCCGG ACGGCGACTG GGTCACCTTC
GCCGGGCAGA CCGATCCGGC GCCGACCCTG GGCCGGGTGG ACAAGGCCGG CCCGGCCACC
CTGACCTACA CCGCGCTGGG CGCCCCCGGC CTGGACGCCG CCGGCTACCT GGTCACCGAC
CCGTAA
 
Protein sequence
MRRIRARWTG STRSVRSTVV IVVLTLLASV LVVLGVNAQG YPVTDVNLAS STVWVTNEAK 
GVVGRVNRQI DELNSSVKAN KPGFDVLQDG DAVLVVDRIK NEVRPVDVAA VVLTSRIGLP
ENASVGFGGG TLAVADPVTG QLWAGTPDSL TGLDPASTEP LLTAGAGARV AVSVGGTVFA
VAPGSDALWS APIGDTGEVT GWAVDDAGAP VAPEPARLPG GPLTPVAAQV VSQQAPTVDL
TAVGDVPVVL DRASGELILP DRRIAIPGGA EAVLQQPGPA ADAVLVATGT ALLRVPLTGG
DPQTVVDGVT GTPSAPVALD GCAHAAWAGQ QPRYAVACGD DPARLIDVPN AAAAARLVFR
VNRNVIVLND QATGDIWLVD ADMKLISNWD DVQRQDSSSD QTSPDGEQNS SDQLTNSRTD
CAATSAAGTP PQAAADEFGV RAGRTTVLRV LDNDSSADCS MLAISAVAPL PAEQGTVAVA
EGGQALQLTV PATATGPLPT IDYTVDDGLG RTSTAQVSVS VAAADDVRAP RKLRDSATLV
EVGGTVSYNV LPDFRSPVGE DLSLISATAT TDDSVTFQPN GLVTFRDTGS AGAIKKTVDL
VISDGTSQIN GALTIDVKAE GTTTPVPGPV AASGVVDEAV TVNPLRSVLS GLRDPARVTA
VQPLVAATPD RPASASATLN PLDSTVVLTG TAPGTSYFLY TVVAGSASAT GVIRFDVTAP
PDTPAPPVVT PDVGYLVPGR TLVLDPLAND RDPMGAVLSI QQLSQPADSP LTATVHDLSL
LQVSSARSVP ATGVTLTYTA ANTAGSTTGQ IRVIPVPAPL TPQPPVAADI AVTVRAGDAV
TVPISRYATD PTGQVLTVKP FPDGTLPADQ GLVFATESAI RYLAPATPPP TSVRFSYTVV
NTDQLTDTRS VTISVLPADP AINTAPQTPP LTTARVFAGR TATIPLPLDG LDPDGDWVTF
AGQTDPAPTL GRVDKAGPAT LTYTALGAPG LDAAGYLVTD P