Gene Ndas_1428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1428 
Symbol 
ID9245278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1748150 
End bp1749307 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content67% 
IMG OID 
Producttransposase, IS605 OrfB family 
Protein accessionYP_003679366 
Protein GI297560392 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.996106 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTGTGA AGATCGTTGT GCAGGTGAAG CTGCTCCCCG ACTCCGCTCA GGAGGCGACG 
CTGTGGGAGA CGCTGAAGTT GTGCAACCAG GCGGCCAACC GGGCCTCGCG GAGTGCGCAC
GAGTCCGGGG TGAAGGCGAA GACACCGTTG CAGCGGCTGG TCTACGACGA CCTCAAAGCG
ATGGGGTTGT CCGCGCAACC GGCGATCCAC TGCGCACGCA AGGTCGCCGG GGCCTATGCC
ACGCTGAAGG CCAACCTCAA GGCCGGAAAC CACGGCTGTG AGGGGTCCAA GCGGCGTACC
CGGGTGGAGA ACACGCCGGT TCGGTTCCGA AAAGACGCCG CGCAGCCCTT CGACGACCGG
TGCCTGTCCT GGCAGATAGA CGCGCGCACG GTGTCGATCT GGACCGTGCA CGGCCGGGTG
AAGGGCATGG CGTTCGCGTG TTCACCGGAC CAGGCCAAGG TGCTCACCGA GTTCCGCAAG
GGCGAATCCG ACCTGGTCAA GCGCGGCGAC AACTGGTACC TGTACGCCAC CTGCGAGGTT
CCCGAGGCCG ACGAGTACAC CCCCGAAGGT TTCGTGGGTG TGGACCTCGG CATCGCGAAC
ATCGCCACCA CCAGCGACGG CACCGTTCAC TCGGGCAAAC AGGTCAACCA GGTCCGCCAC
CGCAACCGCC ACCTCAGGCG CCGTCTCCAG AAGAAGGGAA CCAAGTCCGC CAAGCGTGTC
CTGCGCCGCC TCTCGGGCCG CGAGGCGCGG TTCGCGGCCG ATACCAACCA CCGCATCGCC
AAGCAGATCG TGACCGAGGC TCAACGCACC TCACGGGGTG TCGCCCTGGA AGACCTGGGC
GGCATCCGCA AGAGGGTACG GCTACGCAAG CCCCAGCGGG TCACGCTGCA CTCCTGGTCC
TTCGCTCAAC TGGGCGCTTA CATCGCCTAC AAGGCGAAAC GGGTCGGAGT GCCGGTGGTG
TACGTGGACC CGGCCTACAC CTCCCAGGGG TGTAGCGCGT GTGGCCACGT CGACAAGAGG
AACCGGCCGA GCCAGGCCAC TTTCCGATGC ACGTCGTGCG GTTTCGCTGG GCACGCCGAT
GTGAACGCGG CCCGCAACAT CTCCTCCAGG GGTGTGGCGG GCTGGGCAGT GAGTCACGCT
GCCGACGACG CGGCCTGA
 
Protein sequence
MGVKIVVQVK LLPDSAQEAT LWETLKLCNQ AANRASRSAH ESGVKAKTPL QRLVYDDLKA 
MGLSAQPAIH CARKVAGAYA TLKANLKAGN HGCEGSKRRT RVENTPVRFR KDAAQPFDDR
CLSWQIDART VSIWTVHGRV KGMAFACSPD QAKVLTEFRK GESDLVKRGD NWYLYATCEV
PEADEYTPEG FVGVDLGIAN IATTSDGTVH SGKQVNQVRH RNRHLRRRLQ KKGTKSAKRV
LRRLSGREAR FAADTNHRIA KQIVTEAQRT SRGVALEDLG GIRKRVRLRK PQRVTLHSWS
FAQLGAYIAY KAKRVGVPVV YVDPAYTSQG CSACGHVDKR NRPSQATFRC TSCGFAGHAD
VNAARNISSR GVAGWAVSHA ADDAA