Gene Ndas_5437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5437 
Symbol 
ID9249340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp624560 
End bp625786 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content67% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003683322 
Protein GI297564349 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.145957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCA CCACCGCGGT CAGCGACCTC GCCGCCCGCA TCCACGCGAT CGACTGGGAC 
GGCGACGTGG AACACACGCG GTCGCGGGTC GCGCTGATGC GCGAGTACCT GCGCCGCTCG
GCCGTGTGGA CACGTGCCCT GGGTACCAGG GGATGGCCGT TCTACGACAT CGCGGAATTC
GCGGCGCCGG GCGTGCGCGC CGCGGACGAG GTGGTCGAAG GTGTGCTGGA GAGCCCGGTG
GTCATCGATC AGTACCCGAC GGTGGGCAGG AGCTGCGTGT GGGCGCTGCA CCTGGCAGCA
GCCCGCGATG CCGGAGTGCC CCTGCCGGAC CTGCCCGATC CCTTCGAACC GCTGATCCGC
ATGTACGAGC GCGGCGGCGG CTTCTCCCTG TCGACCACCG GCACGATCGA CATCGACACG
GCGGGCCTGT ACCGGGGCAG GCTCCCCGAC CACCTCGGGG GCGAGCCCAG GGCTCCCGAG
ACCGAGGCCG GGCTCGACGC CCTCGACGGC GTCGGCGGCT GCCCCCCGGT GCCGTACGCC
CGGCGCACGG CACCGCCAAC GAACCCCCTC CCTCCTTTCA CTCCCGGATT TCGACCGAAA
AGGATTCCAG GAGAGGCAGG GAAACCTCAA CAGTCATGCG GAGCGGCACG GCAGGAACCG
CCCCGAGGAA ACGAATGGAA CAGGAGCCCT ACCTTCGGAA ACCCCGGTGG ACTGGATACT
CAACGGGGTG GAACAGACGA CTCCGGATTA CGGCAACCGT CCCACGCGGA ACCCGGGGAA
GGAGCGGAAG GAAGTTCGAT GGATCTTGCC GAGCAGCAGG AGCTGGTGCG CGACATGGCA
CTTGAACTGG TCGAGGCCGC TCCGGACGGC TGGACCTCGA TGAACTACCG GTATGACTAC
ATCGGGGGCG GCGCGGCGAG CGAGAACCTC GTGACCTTCG AGAACGGAGA GACGGAGAGG
AAGCGCCATC CGCGCTCCGT CGACAAGAAG GCCAAGTTCC TCAAGAGTGA GATGTACCAG
GAGGGCAAGG GGACCTGGCT CGGCATGTCG ATCTCGGTGA CCAGGCCCGG GAAGTTCAAC
GCGCAATTCC ACTACGACAA GGAGCTGGGG GTCCACCCGA TCCCCCCGTC TCCGGACAGC
TACGTCTTCG AACTGGGGAA GTTCCCCCGG AACGACGACG CGCTCTCCGA CTGGCTCAGG
GAGCGGATCG ACCAGGCGCG GGGCTGA
 
Protein sequence
MSTTTAVSDL AARIHAIDWD GDVEHTRSRV ALMREYLRRS AVWTRALGTR GWPFYDIAEF 
AAPGVRAADE VVEGVLESPV VIDQYPTVGR SCVWALHLAA ARDAGVPLPD LPDPFEPLIR
MYERGGGFSL STTGTIDIDT AGLYRGRLPD HLGGEPRAPE TEAGLDALDG VGGCPPVPYA
RRTAPPTNPL PPFTPGFRPK RIPGEAGKPQ QSCGAARQEP PRGNEWNRSP TFGNPGGLDT
QRGGTDDSGL RQPSHAEPGE GAEGSSMDLA EQQELVRDMA LELVEAAPDG WTSMNYRYDY
IGGGAASENL VTFENGETER KRHPRSVDKK AKFLKSEMYQ EGKGTWLGMS ISVTRPGKFN
AQFHYDKELG VHPIPPSPDS YVFELGKFPR NDDALSDWLR ERIDQARG