Gene Namu_1872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1872 
Symbol 
ID8447479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2056544 
End bp2057590 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content71% 
IMG OID645041002 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_003201250 
Protein GI258652094 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.137932 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00933127 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGTCGGCG GTGTGGACAC CCATAAGGAC ACCCACACCG CGGCAGCGGT GGACACCGCG 
GGGCGGGTGT TGGGGTCGGC CCAGTTCCCC ACCGACGCCG CCGGCTACCG GGCGTTGCTA
CGGTGGCTGC GCGGGTTCGG GACGCTGCTG CTGGTCGGTG TCGAGGGCAC CGGTGTCTAC
GGGGCCGGCC TGGCCCGATT GCTGGCCGCC CAGGGCGTGG CCATGGTCGA GGTCGACCGG
CCCGACCGCA AGGCCCGCCG GTGGCAGGGC AAATCCGATC CCGTCGATGC CGAGGCTGCG
GCCCGGGCCG CGTTGGCCCG GGTGCGCACC GGGCTGCCCA AGCATCGAGA CGGTCGTGTC
GAGGCGCTGC GGGCATTGCG GGTGGCGCGC CGTTCGGCCG TCGGGCAACG CGCTGACGTG
CAGCGACAGA TCAAGGCGCT GATCGTCACC GCACCGGAAT CGCTGCGCGC CCAGCTGCGG
GCGTTGCCCG ACCGAGAACT GATCAAGGTC TGCGCCGACC AGCGGCCGGA CCGTGCCGGT
GCCGGCGATC CGGGCACGGC CACCAAGATC GCGCTGCGCT CTCTTGCTCG GCGCCACCGG
GCGCTCAGCG TCGAGATCGC CGATCTCGAC GAGCTGCTCG GTCCGCTCGT GGCCCAGATC
AACCCCGGGC TGCTCGCACT CAAAGGCATC GGTCCCGACG TGGCCGGGCA GATGCTCGTC
ACGGCCGGCG AGAATGCCGA CCGCCTCACC AACGAGGCCG CCTTCGCGAT GCTGTGCGGC
GTGGCGCCCT TGCCTGCTTC GTCGGGCAGG ACGACCCGGC ACCGGCTCAA CCGCGGCGGA
GACCGAGCCG CCAATAGCGC ACTCTGGCGC ATCGTCATCA CCCGCATGGG CACCGATGAT
CGAACCAAGA ACTACATCGC CCGACGCACC GCCCAGGGGC TGACCAAGCC CGAGATCATC
CGCTGCCTCA AGCGATATGT CGCCCGAGAA GTCTTCCTCG CGCTTACGTC CGCGTCCGCA
GAAAAACGAC CCGCCAAAGC AGCTTGA
 
Protein sequence
MVGGVDTHKD THTAAAVDTA GRVLGSAQFP TDAAGYRALL RWLRGFGTLL LVGVEGTGVY 
GAGLARLLAA QGVAMVEVDR PDRKARRWQG KSDPVDAEAA ARAALARVRT GLPKHRDGRV
EALRALRVAR RSAVGQRADV QRQIKALIVT APESLRAQLR ALPDRELIKV CADQRPDRAG
AGDPGTATKI ALRSLARRHR ALSVEIADLD ELLGPLVAQI NPGLLALKGI GPDVAGQMLV
TAGENADRLT NEAAFAMLCG VAPLPASSGR TTRHRLNRGG DRAANSALWR IVITRMGTDD
RTKNYIARRT AQGLTKPEII RCLKRYVARE VFLALTSASA EKRPAKAA