Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3729 |
Symbol | |
ID | 9247598 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4477940 |
End bp | 4479616 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003681633 |
Protein GI | 297562659 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTGT CCGTTCCGCA CGCCCTGGCC CCGGCCTGCG TGGGACTCGC ACTGATCGCC ACCGCATGCA CCACAGAGGA GGTGGGTTCC TCGCAGGAGG ACCGCGCAAC CCAGCCGTCG GCCGAGGCCG AGCCCTTCGA GCCCGTGCTC GCACCCCGGC TGCTGTCCGA GATGGACCTG GACGACAAGA TCGGCCAGCT CCTCGTCCTG ACCGCGCAGG GCACCTCGGC CGCCGAGAAC GCAGCCCAGA TCGAGGCCTA CCGGCCCGGC GGCCTCATCT ACTTCGACGC CAACCTCACC GACGCCGAGC AGATCGCCAC CATGTCGGCG GGCGTGCAGG ACCTCGCCGC CGACCAGGGG CGGGGCGTTC CGCTGTTCGT CGGCATCGAC CAGGAGCAGG GCCTGGTGGC CCGGCTGCCC GTGGGCACCC GCTTCCCCGA CGCCATGGCC GTCGGCGCCA CCCGCGACAC CGAGCTGGCC GAGCTGCGCG CCTCCACCAC CGCCGAGGAG CTCACCGCCC TGGGAGTCAA CCTCAACTAC GCGCCCGACG CCGACGTCAA CACCGACCCC GGCAACCCGG TCATCGGCAT CCGCTCCTTC GGATCCGACC CCGATCTGGT CGCGCAGATG GCCGTCGCCG AGTCGGACGC CTACTCCGGC GCCGGTGTGG TGTCGGTGGT CAAGCACTTC CCCGGGCACG GCGACACCGA CGTGGACAGC CACAGCGGCC TGCCCGTCAT CGACATGCCG CGCGAGCAGT GGGAGGCCGG GCACCTGCCG CCGTTCCGGG CGGCCATCGA CGCCGACGTG GACGCCATCA TGACCGCCCA CGTGCTCATG CCGCAGCTGG ACGGGAGCGA GGACCCCGAG CCCGCCACCA TCTCCCCGGA GCTGATCGAC GGCATCCTCC GCGACGAGCT GGGCTACGAC GGCGTGGTGA CCACCGACGC CCTCAACATG GAGGGTGTGC GCCAGCGCCA CTCCGACGGC GAGATCGCCG TGCGCGTGCT GGAGGCGGGC GTGGACCAGC TGCTCATGCC GCCGGACCCC GCCGCCGCGG TGTCCGCGAT CCGCGAGGCC GTCGAGCAGG GCCGCCTGAC CGAGGAGCGC ATCGACGAGT CCGTGCTGCG CGTCCTGGCG CTCAAGGAGA AGCGCGGGAT CCTGGAGGCC GAACCGGTGG ACGCGCAGGG CGCCGCGGCG GTCCTGGAGG ACCCCGAGCA CGCCGAGGCC GCCCAGCGCG TGGCGGACGC CTCCGCGACC CTGCTGCGCA ACGAGGGCGA CCTGCTGCCC CTGGCCGAGG GCAGCGGAGT CCGCGTCCAG GGGGTGGGCG CGGAGCAGAT CGCGGCCGCG CTCACCGAGG CCGGGATCGA CGTGGTGGAG AGCGGGGCCG ACGCCGTGGT CGTGGGCACC GGGGGCGGCA GCGGGGCCTC GGAGCAGAGC GGCCTGGTCC AGGCCGCGCG CGCCGAGGGG CTGCCGGTGG TCGTGGTGTC CCAGGGCACG CCCTACGACC TGGCGGCCTT CCCGGAGGCG GAGGCGTTCG TGGCGGTGTA CTCGTCCATG GACGTGTCGC GGGCGGCCGC CGCGCGGGTC GTCGCCGGAC AGGTGAAGCC CTCCGGCAAG CTGCCGGTGG ACATCCCCGC GGCCGACGTG GAGATCGGGA CCGGGCTGAC CACCTGA
|
Protein sequence | MKLSVPHALA PACVGLALIA TACTTEEVGS SQEDRATQPS AEAEPFEPVL APRLLSEMDL DDKIGQLLVL TAQGTSAAEN AAQIEAYRPG GLIYFDANLT DAEQIATMSA GVQDLAADQG RGVPLFVGID QEQGLVARLP VGTRFPDAMA VGATRDTELA ELRASTTAEE LTALGVNLNY APDADVNTDP GNPVIGIRSF GSDPDLVAQM AVAESDAYSG AGVVSVVKHF PGHGDTDVDS HSGLPVIDMP REQWEAGHLP PFRAAIDADV DAIMTAHVLM PQLDGSEDPE PATISPELID GILRDELGYD GVVTTDALNM EGVRQRHSDG EIAVRVLEAG VDQLLMPPDP AAAVSAIREA VEQGRLTEER IDESVLRVLA LKEKRGILEA EPVDAQGAAA VLEDPEHAEA AQRVADASAT LLRNEGDLLP LAEGSGVRVQ GVGAEQIAAA LTEAGIDVVE SGADAVVVGT GGGSGASEQS GLVQAARAEG LPVVVVSQGT PYDLAAFPEA EAFVAVYSSM DVSRAAAARV VAGQVKPSGK LPVDIPAADV EIGTGLTT
|
| |