Gene Ndas_1361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1361 
Symbol 
ID9245211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1668427 
End bp1670451 
Gene Length2025 bp 
Protein Length674 aa 
Translation table11 
GC content71% 
IMG OID 
Producthydrolase CocE/NonD family protein 
Protein accessionYP_003679299 
Protein GI297560325 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.307565 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.298567 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGTGG TCAACAACCT GCCCAACGCC GTCCAGGAGG ACGAGCACCT GTGGATACCG 
ATGTCGGACG GTGTGCACCT GGCCGCCAGG GTGTGGCGGG CGACCTCCTC CGACGTGTCC
CCCGTCCCGG CGGTCCTGGA GTACCTCCCC TACCGCAGGC GCGACCTGAC GTCGGTGCGC
GACTCCATGC ACCACCCCTA CATCGCCGGG CACGGCTACG CCTGCGTCCG CGTGGACCTG
CGGGGTACCG GTGACTCGGA GGGCGTGCTC ACCGACGAGT ACCTGGAGCG CGAGCAGCTG
GACGCCGAGG AGGTGCTGGC CTGGCTGGCC GAGCAGCCCT GGTGCAACGG GAAGACCAGC
ATGATGGGGC TGTCCTGGGG AGGGTTCGCC GCGCTCCAGG TGGCGGCCCG CCAGCCCCCG
AGCCTGGGCG CGATCGTCAT CAGCTCCTTC ACCGACGACC GGTACGGCGA CGACTTCCAC
TACATGGGCG GTTGTCTGCT GTCGGACAAC CTCGCCGAGG CGGGGACGAT GTTCTCCGCG
GGCACCTGCC CGCCGGACCC GGTGACCGTC GGCGACGACT GGCGGCGGAT GTGGCACGAG
CGGCTGGAGG CCACCGAACC GTGGGTCCTG GAGTGGCTGC GCCACCAGCG GCGCGACGAC
TACTGGCGGC ACGCGTCGGT GAGCGAGGAC TACTCGAAGG TGCGCTGCCC GGTGCTGGCC
TCCAGCGGGT GGGCGGACGG CTACTCCAAC GCCGTCTTCC GGCTGCTGGA GAACCTGGAG
GCGCCCAGGC GGGGGCTGAT CGGCCCGTGG TCGCACCGCT ACCCGCACAT GGGCAGCCCC
GGCCCGGCGA TCGGCTACCT CCAGGAGGTC GTGCGCTGGC TGGACCGCTG GCTCAAGGAC
AGGGAGAACG GTGTCGACGA GGGCCCGTCC CTGTGGGCGT GGATGCAGGA CAGCGTGCTC
CCCTCCACCG CCTACACCGA GCGCCCCGGC CGGTGGGTGC GCGAGGACGT GTGGCCCTCG
CCGAGCGTGG AGTACCGCGG CTACCCGCTG GCCAGGTACC GGATCGGCCG CCCCGGAGAG
GAGGTGCACT CGGAGTCGCT GACCGTGCGG TCGCCGCTGA CCGTGGGCCA GTTCGCGGGC
AAGTGGGCCT CCTACAACGC CCCGCCGGAC CTGCCCTACG ACCAGCGCGA GGAGGACGGC
GGTTCCCTGG TCTTCGACAG CGACGTGCTG TCGGAGGACG TGGAGATCCT GGGCGCCGCC
GAGGTGGAGC TGGACGTGTC GGTCACCGAG CCGGTGGCCA TGGTCGCCGC GCGGCTGGTG
GACGTCGCGC CGGACGGCAG CGCCACGCGG GTGACCTACG GGCTGCTCAA CCTCACCCAC
CGCGACGGCC ACGAGCACCC CGAGAAGCTC GAACCCGGTG AGATCTACCG GGTGAAGGTC
ACGATGAACG GTGTCGCGCA GGCGTTCCCG GTGGGGCACC GCATCCGGCT GTCGCTGTCC
ACCTCCTACT GGCCGCTGGC CTGGCCGCCG CCCAAGCCCG CCCTGCTGAC CGTGCACCCG
GAGAACAGCA GGCTGCTGCT GCCGGTGCGT CCTCACTCCG AGGCGGACGA GCCGCACCCG
GAGCCCTTCG GGGAGCCGGA GGCGGCGCCG GAGATCTCCA CCACGCGCCG GGAGAAGCCG
GAGCACAGCT GGACCGTCTA CAGGGACCTG GTGGACACCC GGTCGGCCCT GGAGATCGTC
AAGGACGGCG GCATCCTGCA CTTCGACGAC ATCGACCTGG ACGTCGGTCG GCGCGCCTAC
GAGTACTACG AGTCCGTGGC GGGCGACTTC ACGTCCGCGC GCGGTGAGTC GACGTGGACG
ATGCGCTTCG CGCGGGACGG GTGGCGGACC CGGACCGAGA CCCACACGTC GCTGGAGTGC
ACCGAGACCG AGTTCCGGGT GTACGCGACT CTGGACGCGT TCGAGAACGA CGAGCGGGTC
TTCTCCCGGC AGTGGACCGA GACGCTGCCC CGGGACCACC TGTGA
 
Protein sequence
MHVVNNLPNA VQEDEHLWIP MSDGVHLAAR VWRATSSDVS PVPAVLEYLP YRRRDLTSVR 
DSMHHPYIAG HGYACVRVDL RGTGDSEGVL TDEYLEREQL DAEEVLAWLA EQPWCNGKTS
MMGLSWGGFA ALQVAARQPP SLGAIVISSF TDDRYGDDFH YMGGCLLSDN LAEAGTMFSA
GTCPPDPVTV GDDWRRMWHE RLEATEPWVL EWLRHQRRDD YWRHASVSED YSKVRCPVLA
SSGWADGYSN AVFRLLENLE APRRGLIGPW SHRYPHMGSP GPAIGYLQEV VRWLDRWLKD
RENGVDEGPS LWAWMQDSVL PSTAYTERPG RWVREDVWPS PSVEYRGYPL ARYRIGRPGE
EVHSESLTVR SPLTVGQFAG KWASYNAPPD LPYDQREEDG GSLVFDSDVL SEDVEILGAA
EVELDVSVTE PVAMVAARLV DVAPDGSATR VTYGLLNLTH RDGHEHPEKL EPGEIYRVKV
TMNGVAQAFP VGHRIRLSLS TSYWPLAWPP PKPALLTVHP ENSRLLLPVR PHSEADEPHP
EPFGEPEAAP EISTTRREKP EHSWTVYRDL VDTRSALEIV KDGGILHFDD IDLDVGRRAY
EYYESVAGDF TSARGESTWT MRFARDGWRT RTETHTSLEC TETEFRVYAT LDAFENDERV
FSRQWTETLP RDHL