Gene Namu_0221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0221 
Symbol 
ID8445801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp249108 
End bp250154 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content71% 
IMG OID645039368 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_003199643 
Protein GI258650487 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones71 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCGGCG GTGTGGACAC CCATAAGGAC ACCCACACCG CGGCAGCGGT GGACACCGCG 
GGGCGGGTGT TGGGCTCGGC CCAGTTCCCC ACCGACGCCG CCGGCTACCG GGCGTTGCTA
CGGTGGCTGC GCGGGTTCGG GACGCTGCTG CTGGTCGGTG TCGAGGGCAC CGGTGTCTAC
GGGGCCGGCC TGGCCCGATT GCTGGCCGCC CAGGGCGTGG CCATGGTCGA GGTCGACCGG
CCCGACCGCA AGGCCCGCCG GTGGCAGGGC AAATCCGATC CCGTCGATGC CGAGGCTGCG
GCCCGGGCCG CGTTGGCCCG GGTGCGCACC GGGCTGCCCA AGCAGCGAGA CGGTCGTGTC
GAGGCGCTGC GGGCATTGCG GGTGGCGCGC CGTTCGGCCG TCGGGCACCG CGCTGACGTG
CAGCGACAGA TCAAGGCGCT GATCGTCACC GCACCGGAAT CGCTGCGCGC CCAGCTGCGG
GCGTTGCCCG ACCGAGAACT GATCAAGGTC TGCGCCGACC AGCGGCCGGA CCGTGCCGGT
GCCGGCGATC CGGGCACGGC CACCAAGATC GCGCTGCGCT CTCTTGCTCG GCGCCACCGG
GCGCTCAGCG TCGAGATCGC CGATCTCGAC GAGCTGCTCG GTCCGCTCGT GGCCCAGATC
AACCCCGGGC TGCTCGCACT CAAAGGCATC GGTCCCGACG TGGCCGGGCA GATGCTCGTC
ACGGCCGGCG AGAATGCCGA CCGCCTCACC AACGAGGCCG CCTTCGCGAT GCTGTGCGGC
GTGGCGCCCT TGCCTGCTTC GTCGGGCAGG ACGACCCGGC ACCGGCTCAA CCGCGGCGGA
GACCGAGCCG CCAATAGCGC ACTCTGGCGC ATCGTCATCA CCCGCATGGC CACCGACCAG
AGAACCAAGA ACTACATCGC CCGACGCACC GCCCAGGGGC TGACCAAGCC CGAGATCATC
CGCTGCCTCA AGCGATATGT CGCCCGAGAA GTCTTCCTCG CGCTTACGTC CGCGTCCGCA
GAAAAACGAC CCGCCAAAGC AGCTTGA
 
Protein sequence
MVGGVDTHKD THTAAAVDTA GRVLGSAQFP TDAAGYRALL RWLRGFGTLL LVGVEGTGVY 
GAGLARLLAA QGVAMVEVDR PDRKARRWQG KSDPVDAEAA ARAALARVRT GLPKQRDGRV
EALRALRVAR RSAVGHRADV QRQIKALIVT APESLRAQLR ALPDRELIKV CADQRPDRAG
AGDPGTATKI ALRSLARRHR ALSVEIADLD ELLGPLVAQI NPGLLALKGI GPDVAGQMLV
TAGENADRLT NEAAFAMLCG VAPLPASSGR TTRHRLNRGG DRAANSALWR IVITRMATDQ
RTKNYIARRT AQGLTKPEII RCLKRYVARE VFLALTSASA EKRPAKAA