Gene Namu_3521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3521 
Symbol 
ID8449140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3865724 
End bp3867805 
Gene Length2082 bp 
Protein Length693 aa 
Translation table11 
GC content71% 
IMG OID645042599 
Producthypothetical protein 
Protein accessionYP_003202835 
Protein GI258653679 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.000399321 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.178114 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGCAG TTCAGGTCGG CCGGGCCGCC GTCGGCCGGG AGCCCGCTGC GGCTCCCGCT 
GCCGACGAGA CCGTTGATGC AGCGGCCGCG CCGCCGAGTG TCCTTGCCCC GCCGGCGATG
CCGGGCATCG GGCCGGTCGC GGGACCGGGC TCCGCCGGTG CGCTGCGCCG CCACGGTGGG
GAGCTCCACG ACCCGCTCGG CGGATCGCCG GTCAGCCCGG AGGTCAACGG GGCGTTGGCC
CGGCTGCAGG GTCGCGGCCG CCCGCTGCCC GAGAGCGTCG CCGGGCCGAT GAGCCGGGCC
ATGGGCGCCG ACCTGGGCGG CGTCCGGATC CACACCGGGG CCGAGCCGGC CCGGCTCGCC
CGGTCGGTGC AGGCCACCGC GTTCACCCTG GGCCGCGACA TCTTTTTCGG CGCCGGCCAT
TACGCGCCCC AGACCTCCGC CGGCCAACGG CTGCTGGCCC ACGAGCTGGC CCACACGATC
GCGCCGGGCG GCGGGTCGGC ATCGGCATCG GGTCCGATCA TCGGGCGCGC CGCCGATCCC
GTGGAGGCCC AGGCCGATCG GGTCGCCGAC GACGCCCTGC GGGTCCTGCG CCGCCAGCCC
GCCGCGCCGG CGGCGCCGGA CGAGCACCCG GCGGCACCGG CGCCGCTGTC GCTGCTGCGC
AGCCGGTCGG CCGGCGGTGA TCGATTGCGG CGCAAGGTGG GCTTCGAGGC CGAGCTGATG
GTGCCGAGTC TGGGCCCGAG CGCCAACCAG CTCACCTACG CCAAGGAACC CGGCAAGGTC
ACCGACTCGA TCAAGTCGTT CCTGGACGGC GGGGTGGCCT ACGGCACCGA CATCGGCGGC
AAGGACAGCG GTGCCGACGT CCGGCTGGAC AGCGATCACG GCGCATCGGT CGACCGGCGG
CCGATCGTGA ACAAGCTCAT CGAACTGGGC TACGTCACCG GGACACCGTC CGAGCCGCGG
ACGAAGATCG AATTCGTCAC CACCGCCATG GACGAACTGG CCCCGGGGTC CACGCGCCGG
GTCAAGGAGG TCGTGGGCAA GCTGCGCGGC CAGCTGAGCG CGGCCCTGAC CCAGGCGCAA
AGTGGGGAAC TCCACCAGCT CGGGGCGCCG GCGAAGGCCG GGTACAAGAC GGGAGTGCCC
GTCGCCGATC TGAAGGCCTG GCTGGGCGCG GACTACGCCG AACTGGATAC GGTGGTCAAG
GAGTACCTGA CCGCTGGGGT CAAGGACGAG GTGTACCTGC AGGCGACCGT GGGGGTGATC
CCGTCCTCGC TGATGACCTT CTTCGCTCGG GCGTCCCTGC CGGGCAAGGT CCAGGTCGCG
CCGCCGTCGG CGGCCCGCCA GCAGATCCTG GGCATGGTCG CCGAGGTGGT GGCCGCGTTC
GAGACCAAAT TCTCGACGGC TCCCGAGGAC CACTGGGTGC GCCAGCTCGG CGCCACCTCC
AGCCATGCGT TCCTGGGCTT GTTGGGTCTG ATCTACAGCT ACCTGCTGGG TGACACGTTG
CACCAGACGT CCGACGGGAC AGAGTCCACC GTCAAGAACG CGGTGCCGTT CCTGATCAAG
ATGAGCCCGT ACGGGCTGGT GGCCAAGACC GCCCCGCACG CGCTCAAGGA CAACCCGCCA
CCGCGCGAGT TCGTCCGGAG CATCGGTGAC CTGATGAAGA AGACCAAGTA CCTCCAGCTG
GCCTACTGGG TCGAGGAGTC CCGCAAGGAC GGCACCGCCG TGAGTGACGG CAAGCTCCCG
GCCAAGCTCG ACGCGCGCCC GCGCGCCGAG CGGCTGATCA CCGGCGACTA CACCGATTTC
GTCGAGAAGG TGCTGCTGGG AACGGGCGGC GCGGTACCGG TCGTGGTCGG CAAGATGCTG
CCGGGGCCGG ACAAGCCGCC GACCGATACG GGCGGGGTGA ATGTTTTCCA CGAGCTCTAC
AACCAGCAGG GCATCCCGCT GGAGTACCGG GCCATCTCCA AGCGCTATAC GGTGTCCGAG
GTCACGGCGG CCCTCGGCGA GATCCTCGCC GAGCTCCGCG TGATCAACCT GTCCGGTCTC
ACCGAGGAAC AGCGGGCCAC CGTGACGGAG GCGTTCAAAT AG
 
Protein sequence
MRAVQVGRAA VGREPAAAPA ADETVDAAAA PPSVLAPPAM PGIGPVAGPG SAGALRRHGG 
ELHDPLGGSP VSPEVNGALA RLQGRGRPLP ESVAGPMSRA MGADLGGVRI HTGAEPARLA
RSVQATAFTL GRDIFFGAGH YAPQTSAGQR LLAHELAHTI APGGGSASAS GPIIGRAADP
VEAQADRVAD DALRVLRRQP AAPAAPDEHP AAPAPLSLLR SRSAGGDRLR RKVGFEAELM
VPSLGPSANQ LTYAKEPGKV TDSIKSFLDG GVAYGTDIGG KDSGADVRLD SDHGASVDRR
PIVNKLIELG YVTGTPSEPR TKIEFVTTAM DELAPGSTRR VKEVVGKLRG QLSAALTQAQ
SGELHQLGAP AKAGYKTGVP VADLKAWLGA DYAELDTVVK EYLTAGVKDE VYLQATVGVI
PSSLMTFFAR ASLPGKVQVA PPSAARQQIL GMVAEVVAAF ETKFSTAPED HWVRQLGATS
SHAFLGLLGL IYSYLLGDTL HQTSDGTEST VKNAVPFLIK MSPYGLVAKT APHALKDNPP
PREFVRSIGD LMKKTKYLQL AYWVEESRKD GTAVSDGKLP AKLDARPRAE RLITGDYTDF
VEKVLLGTGG AVPVVVGKML PGPDKPPTDT GGVNVFHELY NQQGIPLEYR AISKRYTVSE
VTAALGEILA ELRVINLSGL TEEQRATVTE AFK