Gene Namu_1906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1906 
Symbol 
ID8447513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2099063 
End bp2100883 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content69% 
IMG OID645041036 
ProductRhs element Vgr protein 
Protein accessionYP_003201284 
Protein GI258652128 
COG category[R] General function prediction only 
COG ID[COG3500] Phage protein D 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0194585 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0014616 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGCCCA CCTCGTCCAG CTTCCTGGTC GACATCGACG GCTCGCCGCT GGCCGCCGAC 
GCCAAGGCAC TGCTGGTCTC GGCGATCGTC GACGACAGCC TGCGGCTGCC CGACTTCTTC
CTGCTCCGCT TCCGTGATCC GGACCGGTTG GTGATCACCA AGTCCGGCGC CAAGATCGGG
TCCAAGGTCA AGGTCAGCGT GGCCACCGAC GCCGCGCCCA GCCCGTTGCC GCTGATCGAG
GGGGAGATCA CCGCGCTGGA GGCCGAGTAC GACGCCACCG GCACCTACAC CGTCATCCGC
GGCTACGACC AGGCCAACCG GCTCTTCCGG GGCCGGCGCA CCGAGTCCTA CACCCAGAGC
ACCGCCTCGG ACGTGGCCAC CAAGGTGGCC CAGCGGGCGG GGCTGTCCAT CGGCGAGGTC
GAGTCGACCA GCACCGTCTA CGAGCACCTG TCCCAGGGCG GGGTCACCGA CTGGGAGTTC
CTGGACGGCC TGGCCCGCGA GATCGGCTAC GAGATCGCCG TCAAGGACGG CAAGTTCGAC
TTCCGCAAGC CGAAGAAGGC CGACACCGCC CCGGCCGGCG GCGGGGGACC GGAGCAGCAG
AACCCGTTGG TGCTGCGGCT GGGTACCGAC CTGCTGCGGT TCCGGTCACT GATCACCGCG
GCCGAGCAGG TCAAGGAGGT GCAGGCCCGC GGCTGGGACC TGGCCCAGAA GAAGGCGTTC
GTGGCCACCG CGCCGGCCGC GACCACCTCG GCCGTGCTGC CCGCCTACAA CCCGGTCGAC
ATCGCCAAGA AGTTCGGCGA TCCGGTCTAC GTGGCCACCG ATGTCGCCTA CCGCAGCCAG
GCCGAGGTGG ACAGCGCGGC CGCGGCCATC GCCGAGCAGA TCGCCGGTGC CTTCGCCGAG
TTCGAGGGTG TCGCCCGGGG CAATCCGAAA CTGCACGCCG GGGCGGCGAT CTCGGTGGAC
AACGTGGGGG CCCCGTTCGA CGGGAAGTAC ACCATCACCT CCTCCCATCA TCGCTACGAC
CCGAACACCG GCTACACCAC GATGTTCTCG GTCACCGGCC GCTCGGAACG CAGCCTCTAC
GGGCTGGCCA ACGGCGGTGG TGGCGGCAAG CTCGGGCAGG GGCCGGTCGT CGCGCAGGTC
AGCGACGCCA AGGACCCGCT GGAACAGGGC CGGGTCAAGC TGACCTTCCC CTGGTTGTCG
GACACCTATG TCAGCGACTG GGCCCGCACC GTGCAGCCGG GGGCCGGCAA GGACCGGGGG
GCGCTGGTGC TGCCCGAGGT CGGCGACGAG GTGCTGGTGC TCTTCGAGCA GGGCGACATC
CGCCGGCCCT ACGTGCTCGG TGGCCTGTTC AACGGAGTGG ACACCGCACC CAAGGGCAAA
CCCGACCTGA TCGACGGCAG CTCCGGAGCG ATCAACCGGC GCTCGTTCGT CTCCCGCCGC
GGTCACCGCA TCGACCTGAT CGACGAGGAC GGCCGGACCG AGGGCATCAC GCTGTCCACC
ACCGGCGACA AGCTGCAGCT CAAGCTGGAC TCGGTCGAGA CCAAGATCAC CGTGCACAGC
GACGGCAAGA TCCTGATCGA GGGCAAGGGC GGCGTGCTGA TCGACTCGGC CAGCAGCAAG
CTCGAACTCA AGGGCGGCGA GGTCTCGATC ACCTCCACCA GCGGGGTCAA GATCGACGGC
GGCAGCGGTG GGGTGGACGT GCAGACCAAC GGCCAACTCT CGCTCAAGGG CAGCACCGCC
AAGCTGGAGG GTCAGGCCAG CGCCGAGGTC AAGGCCAGCG GCGTGCTGAC CGTCCAGGGT
TCCCTGGTCA AGATCAACTG A
 
Protein sequence
MAPTSSSFLV DIDGSPLAAD AKALLVSAIV DDSLRLPDFF LLRFRDPDRL VITKSGAKIG 
SKVKVSVATD AAPSPLPLIE GEITALEAEY DATGTYTVIR GYDQANRLFR GRRTESYTQS
TASDVATKVA QRAGLSIGEV ESTSTVYEHL SQGGVTDWEF LDGLAREIGY EIAVKDGKFD
FRKPKKADTA PAGGGGPEQQ NPLVLRLGTD LLRFRSLITA AEQVKEVQAR GWDLAQKKAF
VATAPAATTS AVLPAYNPVD IAKKFGDPVY VATDVAYRSQ AEVDSAAAAI AEQIAGAFAE
FEGVARGNPK LHAGAAISVD NVGAPFDGKY TITSSHHRYD PNTGYTTMFS VTGRSERSLY
GLANGGGGGK LGQGPVVAQV SDAKDPLEQG RVKLTFPWLS DTYVSDWART VQPGAGKDRG
ALVLPEVGDE VLVLFEQGDI RRPYVLGGLF NGVDTAPKGK PDLIDGSSGA INRRSFVSRR
GHRIDLIDED GRTEGITLST TGDKLQLKLD SVETKITVHS DGKILIEGKG GVLIDSASSK
LELKGGEVSI TSTSGVKIDG GSGGVDVQTN GQLSLKGSTA KLEGQASAEV KASGVLTVQG
SLVKIN