Gene Ndas_1707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1707 
Symbol 
ID9245557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2077949 
End bp2079367 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content73% 
IMG OID 
ProductRhodanese domain protein 
Protein accessionYP_003679642 
Protein GI297560668 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.140865 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.999922 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTGG AACGCCTCTA CGACGACGAC CTCGCCCAGG CCGGCTACTT CGTCGGCTGT 
CAGGCCAGCG GAGAGGCCGT CGTGGTCGAC GCCCGCCGCG ACATCGACGA CTACCTCGGA
CTGGCCCGCG CACACGGCAT GCGCATCGTC GCCGTGACCG AGACCCACAT CCACGCCGAC
TACCTCTCCG GCACCCGCGA ACTCGCCGCG GCCACCGGCG CCACCGCGTA CGTCTCCGGC
GAGGGCGGTC AGGACTGGCA GTACGGATTC GACGCCGAAC GGCTGCTCGA CGGTGACGCC
ATCACCGTCG GCAACATCAC GGTCCGCGCG AGCCACACCC CCGGGCACAC GCCCGAGCAC
CTCTCCTTCC TGGTGACGGA CGGGGCGTTC AGCCGCGACC CCGGCTACCT GCTCTCGGGT
GACTTCGTCT TCGCCGGCGA CCTGGGCCGT CCCGACCTGC TCGACGAGGC CGCAGGGGGG
AGCGGCACCC GGTTCGAGGG CGCCCGCCAG CTCTTCGAGA GCCTGCGCCG GGTGTTCCTG
AACCTGCCGG ACCACGTCCA GGTCCTTCCC GGACACGGGG CGGGCAGCGC CTGCGGCAAG
GCCCTCGGCG CTCTTCCGGC CACCACGGTC GGGTACGAGC GGCTCAACGC CTGGTGGGGT
CCCTACCTGC GCGCCGGCGA CATGGAGGGC TTCGTCGCGG AACTGCTCGA CGGACAGCCC
GACGCCCACG CCTACTTCGC GCGGATGAAG CGCCAGAACA GGGAGGGGCC CCGGGTCCTG
GGCCGCCTCG CCCCGCCGCC CGCGCTCTCC GACGAGGAGG TCGGCAGGTC CCTGAACGAG
GGCGGGAGCG TCCTCGTCGA CACCCGGCCC CACGTCGAGG TCCACCGGGG CACCGTGGCG
GGGGCGCTGA ACATCCCGGG TCCGGACAAG GCCGCGACCT TCGGGGCCTG GGCCCACGAC
CCCGAGTCCG AAACCCGCCC CCTGGTGCTG CTCGCCGACG ACGGGGACAC CGCCCGCCGG
GTGCGCGACC ACCTGCTGCG GGTCGGCATC GACCACGTGT CCGGTTACAC GACCAGCACG
GAGGGCCTGC CGAGGACCGT GCCGCCGCGC GTCACCCCCG AGGAACTCGA ACAGGCCGGT
GCCGCGCTCC TGCTGGACGT GCGCACCAGG GGCGAGCACG CCGAGGGGCA CATCCCCGGT
TCGCGGCAGC TCAGCGCGGG CCGTGTGCTG TGGAACCTCG ACGCGCTGCC GACCGACGGC
ACCATCGTGA GCTACTGCCA GTCGGGCGCC CGCAGTTCCG TGGCGGCCTC GGCTCTGCGC
CGCGCGGGAT ACGACGTCGT CGAACTCGAC CGGGGTTACG GCGCCTGGGA GGACTGGAGG
CGGGACCGGG ACGCGCCGGT GCGCGCTGGT GCCGACTGA
 
Protein sequence
MLLERLYDDD LAQAGYFVGC QASGEAVVVD ARRDIDDYLG LARAHGMRIV AVTETHIHAD 
YLSGTRELAA ATGATAYVSG EGGQDWQYGF DAERLLDGDA ITVGNITVRA SHTPGHTPEH
LSFLVTDGAF SRDPGYLLSG DFVFAGDLGR PDLLDEAAGG SGTRFEGARQ LFESLRRVFL
NLPDHVQVLP GHGAGSACGK ALGALPATTV GYERLNAWWG PYLRAGDMEG FVAELLDGQP
DAHAYFARMK RQNREGPRVL GRLAPPPALS DEEVGRSLNE GGSVLVDTRP HVEVHRGTVA
GALNIPGPDK AATFGAWAHD PESETRPLVL LADDGDTARR VRDHLLRVGI DHVSGYTTST
EGLPRTVPPR VTPEELEQAG AALLLDVRTR GEHAEGHIPG SRQLSAGRVL WNLDALPTDG
TIVSYCQSGA RSSVAASALR RAGYDVVELD RGYGAWEDWR RDRDAPVRAG AD