Gene Namu_2006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2006 
Symbol 
ID8447615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2214728 
End bp2215774 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content71% 
IMG OID645041134 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_003201380 
Protein GI258652224 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.00873285 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00397123 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGTCGGCG GTGTGGACAC CCATAAGGAC ACCCACACCG CGGCAGCGGT GGACACCGCG 
GGGCGGGTGT TGGGGTCGGC CCAGTTCCCC ACCGACGCCG CCGGCTACCG GGCGTTGCTA
CGGTGGCTGC GCGGGTTCGG GACGCTGCTG CTGGTCGGTG TCGAGGGCAC CGGTGTCTAC
GGGGCCGGCC TGGCCCGATT GCTGGCCGCC CAGGGCGTGG CCATGGTCGA GGTCGACCGG
CCCGACCGCA AGGCCCGCCG GTGGCAGGGC AAATCCGATC CCGTCGATGC CGAGGCTGCG
GCCCGGGCCG CGTTGGCCCG GGTGCGCACC GGGCTGCCCA AGCAGCGAGA CGGTCGTGTC
GAGGCGCTGC GGGCATTGCG GGTGGCGCGC CGTTCGGCCG TCGGGCACCG CGCTGACGTG
CAGCGACAGA TCAAGGCGCT GATCGTCACC GCACCGGAAT CGCTGCGCGC CCAGCTGCGG
GCGTTGCCCG ACCGAGAACT GATCAAGGTC TGCGCCGACC AGCGGCCGGA CCGTGCCGGT
GCCGGCGATC CGGGCACGGC CACCAAGATC GCGCTGCGCT CTCTTGCTCG GCGCCACCGG
GCGCTCAGCG TCGAGATCGC CGATCTCGAC GAGCTGCTCG GTCCGCTCGT GGCCCAGATC
AACCCCGGGC TGCTCGCACT CAAAGGCATC GGTCCCGACG TGGCCGGGCA GATGCTCGTC
ACGGCCGGCG AGAATGCCGA CCGCCTCACC AACGAGGCCG CCTTCGCGAT GCTGTGCGGC
GTGGCGCCCT TGCCTGCTTC GTCGGGCAGG ACGACCCGGC ACCGGCTCAA CCGCGGCGGA
GACCGAGCCG CCAATAGCGC ACTCTGGCGC ATCGTCATCA CCCGCATGGC CACCGACCAG
AGAACCAAGA ACTACATCGC CCGACGCACC GCCCAGGGGC TGACCAAGCC CGAGATCATC
CGCTGCCTCA AGCGATATGT CGCCCGAGAA GTCTTCCTCG CGCTTACGTC CGCGTCCGCA
GAAAAACGAC CCGCCAAAGC AGCTTGA
 
Protein sequence
MVGGVDTHKD THTAAAVDTA GRVLGSAQFP TDAAGYRALL RWLRGFGTLL LVGVEGTGVY 
GAGLARLLAA QGVAMVEVDR PDRKARRWQG KSDPVDAEAA ARAALARVRT GLPKQRDGRV
EALRALRVAR RSAVGHRADV QRQIKALIVT APESLRAQLR ALPDRELIKV CADQRPDRAG
AGDPGTATKI ALRSLARRHR ALSVEIADLD ELLGPLVAQI NPGLLALKGI GPDVAGQMLV
TAGENADRLT NEAAFAMLCG VAPLPASSGR TTRHRLNRGG DRAANSALWR IVITRMATDQ
RTKNYIARRT AQGLTKPEII RCLKRYVARE VFLALTSASA EKRPAKAA