Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2913 |
Symbol | |
ID | 9246765 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3483500 |
End bp | 3485533 |
Gene Length | 2034 bp |
Protein Length | 677 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | hydrolase CocE/NonD family protein |
Protein accession | YP_003680829 |
Protein GI | 297561855 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.355315 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.116315 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGACCG TCAGCGACCT GCCCCACGAG GTCCGCGAGG ACGAGTACGT CCTGATCCCG ATCAGCGACG GGGTCCGGCT GGCCGCGCGG ATCTGGCGTC CGGTGGGGAG CGAGGAGGCC CCGGTGCCGG CGGTCCTGGA GTTCATCCCG TACCGCAGGC GCGACCTGAC CGCGCAGCGC GACTCGGTGC ACCACCCCTA CATGGCCGGG CACGGGTACG CGTGCGCCCG CGTGGACCTG CGCGGCAGCG GCGACTCCGA GGGGGTGCTC ACCGACGAGT ACCTCGAACG CGAGCTCCTG GACGCGGAGG AGGTGCTGGC CTGGCTCGCC GAGCAGCCCT GGTGCGACGG CCGCACCGGG ATGATGGGTA TCTCCTGGGG CGGGTTCAAC GCCCTCCAGG TCGCCGCGCG GCGGCCGGAG AGCCTGCGGG CGATCGTGAC CGCCTGCTCC ACCGACGACC GCTACTCCGA CGACGTGCAC TACATGGGCG GGTGCCTGCT GGGGGACAAC CTGTCGTGGG CCTCCACGAT GTTCGCCTAC AACTCCTGCC CGCCCGATCC GGAGCTGGTC GGTGAGCGCT GGCGCGACAT GTGGCACGAG CGGCTGGAGC ACAGCGGCCT GTGGCTCGAC ACCTGGCTGC GCCACCAGCA CCGGGACGCG TACTGGAGGC ACGGTTCGGT GGCCGAGGAC CTGGACGCGA TCCGGGTGCC CGTCATGGCC GTCAGCGGGT GGGCCGACGG TTACTCCAAC TCGGTCTTCC GGCTGCTGGA GGGGTTGAGC GTCCCCCGTC TGGGCCTGCT GGGCCCCTGG TCGCACAAGT ACCCGCACCT GGGCCAGCCC GGCCCCGCCA TCGGCTTCCT CCAGGAGGTG GTGCGCTGGT GGGACCGCTG GCTCAAGGGC GTGGACAACG ACGTGATGGA CGCGCCCGTC CTGCGGGCCT GGATGCAGGA GAGCGTGGCG CCCTCCACCT CCTACGAGGC CCGGCCGGGG CGCTGGGTCG GTGAGCGGGA GTGGCCCTCG CCGGAGGTCG CGCTGGTACC GCGCGACCTG GGCGCGGGCC GGGTGCTCGC GGAGGGGGAG CCCTCGGGGC GGGAGGACGT GCTGACCCTG TCCTCCCCGC TGTCCACCGG ACAGCACGCG GGCAAGTGGT GTTCGTACAA CGCCCCCCCG GACCTGCCCT ACGACCAGCG CGAGGACGAC GGCGGGTCCA TCGTCTTCGA CAGCGTGCCG CTTCCCCGGC GCTTGGAGAT CCTCGGCTCC GCGGTGGTCG AACTCGAACT GGCGGTGGAC CGGCCCGACG CGATGGTCGC GGTGCGGTTG TGCGACGTCG CGCCCCAGGG GCAGGCCACG CGGGTGACCT ACGGGCTGCT CAACCTCACC CACGCCGACG GCCACGAGAG GCCGCGCAAG CTCGTGCCCG GGCGCCGGTA CCGCGTGTCG GTCCCCCTCA ACGGTGTGGC CCAGGCATTC CCGGCCGGGC ACCGGGTGCG GGTCTCGGTC TCCACCTCCT ACTGGCCGCT GGTGTGGCCC TCGCCCGAGC CGGTGACCCT CTCGGTGTTC CAGGGGGAGC ACACCCGTGT GCTGCTTCCG GTGCGTCCGG TCGAGGGCGG TGGTGACGGG CGGGGTGTGG CCGCTTTCGG GGAGCCCGAG GGCACCGCCC CGATCGCGAC GAGCCGGATC GCTCCGGGCG AGGAGCGGTG GGACCTGACC CAGGACCTGG TGCGCTACGG GGCCGCGCTG GAGGTGGTCA AGGACCTGGG GACGGTGCGC TTCGACGACA TCGGCCTGGA GGTGACCCGT CGGGCGGAGG AGCGCTACAG CAGGGTCGGC GACGACCACG ACTCGGTCCG TGGCGAGGCG GTGTGGACGA TGGGCTTCGC CCGCGGCGAC TGGTCCGTGC GGACCAGGAC CCACACGGTG CTCACGTCCA CGGCGACCGA CTTCCACCTG CACGCGACGT TGGACGCCTA CGAGGGCACG CGGCGCGTGG CCACCAAGAT CTACACCTCG GTGATCCCGC GGGACCACGT CTGA
|
Protein sequence | MRTVSDLPHE VREDEYVLIP ISDGVRLAAR IWRPVGSEEA PVPAVLEFIP YRRRDLTAQR DSVHHPYMAG HGYACARVDL RGSGDSEGVL TDEYLERELL DAEEVLAWLA EQPWCDGRTG MMGISWGGFN ALQVAARRPE SLRAIVTACS TDDRYSDDVH YMGGCLLGDN LSWASTMFAY NSCPPDPELV GERWRDMWHE RLEHSGLWLD TWLRHQHRDA YWRHGSVAED LDAIRVPVMA VSGWADGYSN SVFRLLEGLS VPRLGLLGPW SHKYPHLGQP GPAIGFLQEV VRWWDRWLKG VDNDVMDAPV LRAWMQESVA PSTSYEARPG RWVGEREWPS PEVALVPRDL GAGRVLAEGE PSGREDVLTL SSPLSTGQHA GKWCSYNAPP DLPYDQREDD GGSIVFDSVP LPRRLEILGS AVVELELAVD RPDAMVAVRL CDVAPQGQAT RVTYGLLNLT HADGHERPRK LVPGRRYRVS VPLNGVAQAF PAGHRVRVSV STSYWPLVWP SPEPVTLSVF QGEHTRVLLP VRPVEGGGDG RGVAAFGEPE GTAPIATSRI APGEERWDLT QDLVRYGAAL EVVKDLGTVR FDDIGLEVTR RAEERYSRVG DDHDSVRGEA VWTMGFARGD WSVRTRTHTV LTSTATDFHL HATLDAYEGT RRVATKIYTS VIPRDHV
|
| |