Gene Ndas_2913 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2913 
Symbol 
ID9246765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3483500 
End bp3485533 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content73% 
IMG OID 
Producthydrolase CocE/NonD family protein 
Protein accessionYP_003680829 
Protein GI297561855 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.355315 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.116315 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGACCG TCAGCGACCT GCCCCACGAG GTCCGCGAGG ACGAGTACGT CCTGATCCCG 
ATCAGCGACG GGGTCCGGCT GGCCGCGCGG ATCTGGCGTC CGGTGGGGAG CGAGGAGGCC
CCGGTGCCGG CGGTCCTGGA GTTCATCCCG TACCGCAGGC GCGACCTGAC CGCGCAGCGC
GACTCGGTGC ACCACCCCTA CATGGCCGGG CACGGGTACG CGTGCGCCCG CGTGGACCTG
CGCGGCAGCG GCGACTCCGA GGGGGTGCTC ACCGACGAGT ACCTCGAACG CGAGCTCCTG
GACGCGGAGG AGGTGCTGGC CTGGCTCGCC GAGCAGCCCT GGTGCGACGG CCGCACCGGG
ATGATGGGTA TCTCCTGGGG CGGGTTCAAC GCCCTCCAGG TCGCCGCGCG GCGGCCGGAG
AGCCTGCGGG CGATCGTGAC CGCCTGCTCC ACCGACGACC GCTACTCCGA CGACGTGCAC
TACATGGGCG GGTGCCTGCT GGGGGACAAC CTGTCGTGGG CCTCCACGAT GTTCGCCTAC
AACTCCTGCC CGCCCGATCC GGAGCTGGTC GGTGAGCGCT GGCGCGACAT GTGGCACGAG
CGGCTGGAGC ACAGCGGCCT GTGGCTCGAC ACCTGGCTGC GCCACCAGCA CCGGGACGCG
TACTGGAGGC ACGGTTCGGT GGCCGAGGAC CTGGACGCGA TCCGGGTGCC CGTCATGGCC
GTCAGCGGGT GGGCCGACGG TTACTCCAAC TCGGTCTTCC GGCTGCTGGA GGGGTTGAGC
GTCCCCCGTC TGGGCCTGCT GGGCCCCTGG TCGCACAAGT ACCCGCACCT GGGCCAGCCC
GGCCCCGCCA TCGGCTTCCT CCAGGAGGTG GTGCGCTGGT GGGACCGCTG GCTCAAGGGC
GTGGACAACG ACGTGATGGA CGCGCCCGTC CTGCGGGCCT GGATGCAGGA GAGCGTGGCG
CCCTCCACCT CCTACGAGGC CCGGCCGGGG CGCTGGGTCG GTGAGCGGGA GTGGCCCTCG
CCGGAGGTCG CGCTGGTACC GCGCGACCTG GGCGCGGGCC GGGTGCTCGC GGAGGGGGAG
CCCTCGGGGC GGGAGGACGT GCTGACCCTG TCCTCCCCGC TGTCCACCGG ACAGCACGCG
GGCAAGTGGT GTTCGTACAA CGCCCCCCCG GACCTGCCCT ACGACCAGCG CGAGGACGAC
GGCGGGTCCA TCGTCTTCGA CAGCGTGCCG CTTCCCCGGC GCTTGGAGAT CCTCGGCTCC
GCGGTGGTCG AACTCGAACT GGCGGTGGAC CGGCCCGACG CGATGGTCGC GGTGCGGTTG
TGCGACGTCG CGCCCCAGGG GCAGGCCACG CGGGTGACCT ACGGGCTGCT CAACCTCACC
CACGCCGACG GCCACGAGAG GCCGCGCAAG CTCGTGCCCG GGCGCCGGTA CCGCGTGTCG
GTCCCCCTCA ACGGTGTGGC CCAGGCATTC CCGGCCGGGC ACCGGGTGCG GGTCTCGGTC
TCCACCTCCT ACTGGCCGCT GGTGTGGCCC TCGCCCGAGC CGGTGACCCT CTCGGTGTTC
CAGGGGGAGC ACACCCGTGT GCTGCTTCCG GTGCGTCCGG TCGAGGGCGG TGGTGACGGG
CGGGGTGTGG CCGCTTTCGG GGAGCCCGAG GGCACCGCCC CGATCGCGAC GAGCCGGATC
GCTCCGGGCG AGGAGCGGTG GGACCTGACC CAGGACCTGG TGCGCTACGG GGCCGCGCTG
GAGGTGGTCA AGGACCTGGG GACGGTGCGC TTCGACGACA TCGGCCTGGA GGTGACCCGT
CGGGCGGAGG AGCGCTACAG CAGGGTCGGC GACGACCACG ACTCGGTCCG TGGCGAGGCG
GTGTGGACGA TGGGCTTCGC CCGCGGCGAC TGGTCCGTGC GGACCAGGAC CCACACGGTG
CTCACGTCCA CGGCGACCGA CTTCCACCTG CACGCGACGT TGGACGCCTA CGAGGGCACG
CGGCGCGTGG CCACCAAGAT CTACACCTCG GTGATCCCGC GGGACCACGT CTGA
 
Protein sequence
MRTVSDLPHE VREDEYVLIP ISDGVRLAAR IWRPVGSEEA PVPAVLEFIP YRRRDLTAQR 
DSVHHPYMAG HGYACARVDL RGSGDSEGVL TDEYLERELL DAEEVLAWLA EQPWCDGRTG
MMGISWGGFN ALQVAARRPE SLRAIVTACS TDDRYSDDVH YMGGCLLGDN LSWASTMFAY
NSCPPDPELV GERWRDMWHE RLEHSGLWLD TWLRHQHRDA YWRHGSVAED LDAIRVPVMA
VSGWADGYSN SVFRLLEGLS VPRLGLLGPW SHKYPHLGQP GPAIGFLQEV VRWWDRWLKG
VDNDVMDAPV LRAWMQESVA PSTSYEARPG RWVGEREWPS PEVALVPRDL GAGRVLAEGE
PSGREDVLTL SSPLSTGQHA GKWCSYNAPP DLPYDQREDD GGSIVFDSVP LPRRLEILGS
AVVELELAVD RPDAMVAVRL CDVAPQGQAT RVTYGLLNLT HADGHERPRK LVPGRRYRVS
VPLNGVAQAF PAGHRVRVSV STSYWPLVWP SPEPVTLSVF QGEHTRVLLP VRPVEGGGDG
RGVAAFGEPE GTAPIATSRI APGEERWDLT QDLVRYGAAL EVVKDLGTVR FDDIGLEVTR
RAEERYSRVG DDHDSVRGEA VWTMGFARGD WSVRTRTHTV LTSTATDFHL HATLDAYEGT
RRVATKIYTS VIPRDHV