Gene Ndas_2512 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2512 
Symbol 
ID9246362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2977691 
End bp2978782 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content78% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003680437 
Protein GI297561463 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0253477 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCGC ACACGCCGCC GCGCGCGGCC CGCGAACCGC GTTCCCCCGC GTCCGGCCCG 
CCGGAACGCC CCGGGCGCGC CGAGCGCGCG CCCCGGCGTG AGGCGCCGGG GCGGGCGGGC
CGACCGAAAC CCCCGTGGCA CACGGACCGC CCGGACCGCA GGGAAACAGG GGACCGGGCG
CAGGCGCCCG GCGGGCACGA CACGGGCGTC GCCCCCGAGC TGGCCGAGGA GTACGCGCGC
ATCGGCGCCG CCTCCGAGGA GGAGCTGGCC GCCGACCTGC TCGCGCTGAC CGCCACCCAC
GAGGGCCGCG GCCGAGACCT GCTGCTGTCC TGGGCCGCCA CGGCGCTGCG CTTCGGTGAG
CGCATGGCCG ACGAGGACGC CGCGAGTGCC GGAGCGTCCG GGGACCCCGG GGCCGCTGGG
CCCGCCGGGG TTGCCAGGAC GGGAGGGAGC ACGGGGGCCA AAGGGGGCTC TGAGGGCCCT
CAAGGGACCT CATGCAGCTC GGGTGCTTCA GAGGTCCCAA GCGGGTCAGG CGCCTCAGAC
ACCCGTGAGG TCCCAAGCGG CTCTGGTGCC CCGGGTGCCA CGGGTTCCTC GGGTGCGGTG
AGCGCTCCGA GTCGCACGGG TCCTTCGGAC CTGCGTGTGG AGGAGCGGGC CTCGGGCGGG
CTGCGTCCGG GCGAGGTCCT GCTCGCCGAG TACCTGCACC GGGCCTCCGG CGACCGGGTG
GTCGTCTACG CCGACGCCCT CGACCACGCC CACCGCGTGG CCCGCGACGC CGGGTGGGGC
CGCGACCTCA CCCCCGAGGC CCTGCGCGAG ACCGCGCTGG CCCACGAGCA CGCCCACCGG
ATGCTGCACG ACGGACGGGG CCGCGCGCTG CGGCGCGAAC TCGACCACGT CCTGGTGCGC
CTGGGGCCGC TGCGCCTGCG CGGCCACGTG GTCGGCGCCG ACGAGATCGC CGCGCACGCC
TACGCCCGCC GTCGCGCGGG GCTGCGCCGC AGCCCGATCG CGCTCACCGC GGCCATCGCC
GCCACCCTTG CCCACCGGGG TACGGGCCAC GACCGGCCCG CCCCGCGAAC CTCCCCCGGA
GAACGCCCGT GA
 
Protein sequence
MSAHTPPRAA REPRSPASGP PERPGRAERA PRREAPGRAG RPKPPWHTDR PDRRETGDRA 
QAPGGHDTGV APELAEEYAR IGAASEEELA ADLLALTATH EGRGRDLLLS WAATALRFGE
RMADEDAASA GASGDPGAAG PAGVARTGGS TGAKGGSEGP QGTSCSSGAS EVPSGSGASD
TREVPSGSGA PGATGSSGAV SAPSRTGPSD LRVEERASGG LRPGEVLLAE YLHRASGDRV
VVYADALDHA HRVARDAGWG RDLTPEALRE TALAHEHAHR MLHDGRGRAL RRELDHVLVR
LGPLRLRGHV VGADEIAAHA YARRRAGLRR SPIALTAAIA ATLAHRGTGH DRPAPRTSPG
ERP