Gene Ndas_2016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2016 
Symbol 
ID9245866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2436030 
End bp2437061 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content76% 
IMG OID 
ProductPirin domain protein 
Protein accessionYP_003679948 
Protein GI297560974 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.656131 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAATC TGGACACCCA TCCCGCCGAA GAGCGCGTCT GCGGCGGAGA GGTCGACCCC 
GCCGACACCG CCGCCTCCGG CGACACCGAA CTCCTCGAAC CGCGCGAGGT CCCGCTCGGC
GGCCCGCGCG CCATGCTGGT CCGCCGCGCC CTGCCCGGGA AGAACCGCCG CATGGTCGGC
GCGTGGTGCT TCGCCGACGT CTACGGACCC ACCGGCGTCG CCGACGGCCC CGGCATGCAG
GTGCCGCCGC ACCCCCACAT CGGCCTGCAG ACCGTGAGCT GGCTGGTCCG GGGGTCGGTC
CACCACATGG ACGGACTGGG CTCGGACCAG CTGGTCCGCC CCGGCCAACT CAACCTCATG
ACCGCCGGAC ACGGCATCGC CCACTCCGAG CGCTCCCCCG CCGACGCCCC GCCGCTGCTG
CACGGCGCCC AGCTGTGGAT CGCCCTGCCC GAGGACGACC GCGAGGGACC GGCCCGCTTC
GAACACCACG CCGACCTGCC CGCCTTCGAC CTGCCCGGCA CGGACGCGCC GGGGGCCAGG
GTCACGGTGA TCGCCGGAGA GGTGGACGGC CGCCGCTCCC CCGCCCGCGT GCACACCCCG
CTCATGGGAG CCGAGGTCGT GCTGGAACCC GGGGCCCGGG TGCGCCTGCC CCTGGACGCG
TCCTTCGAAC ACGGCGTCCT GCCCCTGGAC TCGCCCGTGC GGGTCCTGGG CCACACCGTC
GAGGCCGGTG CGCTGCTCTA CGCGGGCGAG GGGCGCACGG AGGTGGAACT GCGGGCCGAG
GAGACCGCGC ACGTGCTGGT GATCGGCGGC GAGCCCTTCA CCGAGGACCT GGTCATGTGG
TGGAACTTCG TCGGCCGCGA CCACGACGAG ATCGTGCGGG CCCGCCGCGC CTGGGAGGAC
GACCGGGAGG ACGCCGACGA CGCGGGCGGG CGGCGCTTCG CCGCGGTCGC GACCGACGAC
GGAGCACCCC TCCCGGCTCC GGAGCTGCCC AACGCGCGCC TGCGCGCGCG CCCGCGCCAC
CGCGGCGCGT AG
 
Protein sequence
MSNLDTHPAE ERVCGGEVDP ADTAASGDTE LLEPREVPLG GPRAMLVRRA LPGKNRRMVG 
AWCFADVYGP TGVADGPGMQ VPPHPHIGLQ TVSWLVRGSV HHMDGLGSDQ LVRPGQLNLM
TAGHGIAHSE RSPADAPPLL HGAQLWIALP EDDREGPARF EHHADLPAFD LPGTDAPGAR
VTVIAGEVDG RRSPARVHTP LMGAEVVLEP GARVRLPLDA SFEHGVLPLD SPVRVLGHTV
EAGALLYAGE GRTEVELRAE ETAHVLVIGG EPFTEDLVMW WNFVGRDHDE IVRARRAWED
DREDADDAGG RRFAAVATDD GAPLPAPELP NARLRARPRH RGA