Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1008 |
Symbol | |
ID | 9244854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1231574 |
End bp | 1233196 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | Pectate disaccharide-lyase |
Protein accession | YP_003678957 |
Protein GI | 297559983 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAACCA GACCCGCCCT CGCGTGCGCC GCCCTACTGG CAGGCACGCT CGTCACGCTT TCCGGCACCG CGGCGCACGC CGCCACCACC CGCTACGAGG CCGAGGGCAC CTCGGCCTCC TGCAACGGCA GCATCGAGAC CGAGTGGCCG GGCTACTCCG GCGACGGGTT CTGCGACACC GAGAACGAGG AGGGCGCCCA CGTGCAGTTC AGCGTGAACG CCCCCGCCTC GGGAACGGCC ACGCTGACCG TGCGCTTCGC CAACGGCACC TCCTCCGCCC GCCCCGCCGA CGTCCTCGTG AACGGCTCCC GGACGGCGTC GGTCTCCTTC GAGGGCACCG GCGACTGGGA CGCGTGGACG ACCAAGACGC TCACCGTCCA GCTGGCCGAG GGCGCCAACA CGATCCGGTT CGACCCGACC GGTTCCCAGG GCATGCCCAA CGTCGACTAC CTCGACGTGG AGACCAGCGG CGGTGGCGAC GACGGCGGAG GCGATGACGG CGGCGGTGAC GACGGGGGCG GCGACGACGG CAACCCGCCG ACCTCGGACG CGTTGTACGT GGCCCCGGGC GGCAGCGCGG GCGCGGACGG CACCGAGTCC GACCCCACCA CGCTCACCTC GGCCATCGAC CGGATCGAGC CCGGCGGGAC GATCCACATG CGCGGCGGGA CCTACTCCTT CTCGGACACG GTCACCATCC CGCTGGGCGA GGACGGCACC TCCGGAAACC GCACGGAGCT GTCCGCCTAC CCGGGTGAGA CCCCGGTGCT GGACTTCTCC GCCCAGAGCG AGGACTCCGC CAACCGCGGC CTGGCGCTGG AGGCGTCCTA CTGGCACGTC GAGGGCATCG TCGTCGAGCA CGCCGGGGAC AACGGCATCT TCGTCAGCGG CAGCCACAAC GTCATCGAGC GCACGGTGAC CCGCTTCAAC CGCGACACCG GCCTCCAGCT CTCGCGCCGC GTCTCCAGCA CCCCCGAGAG CGACTGGCCC GCCCACAACC TCATCCTGAG CGCGGAGTCG CACGACAACG CCGACTCCGA CGGCGAGGAC GCCGACGGCT TCGCCGCCAA GCTCACCTCC GGCCCCGGCA ACGTCTTCCG CTACGCCGTG GCCCACAACA ACATCGACGA CGGCTGGGAC CTCTACACCA AGGACGACAC CGGCCCCATC GGCACCGTGA CCATCGAGGA CTCCCTGGCC TACGAGAACG GCATCCTCAG CGACGGCTCC CAGGCCGGCA ACGGCGACCG CAACGGCTTC AAGCTCGGCG GCGAGGACAT CGGGGTCGAC CACGTCATCA CGGGCAACAT CGCCTACGAC AACGGCAAGC ACGGGTTCAC CTACAACAGC AACCCGGGCT CGATGACGGT GTCGGACAAC GTCAGCATCG GCAACGAGGA GCGCAACTTC AACTTCGACG ACGGCTCCTC GGTGTTCCGC GGCAACACCT CGTGCGACAG CGGTTCCAAC GACCGGATCA TCGGCAACGC CGACAGCTCC AACCAGTTCT GGTCCGGCTC GAACGGGTCG CGGTGCTCCT CCTACGACGG CGGCCTGGAC TGGTCCTACG CCGCCGACGG CACCCTGGTC GTGACCTTCG GGGGCCAGCG GGTCACGCCG TAA
|
Protein sequence | MRTRPALACA ALLAGTLVTL SGTAAHAATT RYEAEGTSAS CNGSIETEWP GYSGDGFCDT ENEEGAHVQF SVNAPASGTA TLTVRFANGT SSARPADVLV NGSRTASVSF EGTGDWDAWT TKTLTVQLAE GANTIRFDPT GSQGMPNVDY LDVETSGGGD DGGGDDGGGD DGGGDDGNPP TSDALYVAPG GSAGADGTES DPTTLTSAID RIEPGGTIHM RGGTYSFSDT VTIPLGEDGT SGNRTELSAY PGETPVLDFS AQSEDSANRG LALEASYWHV EGIVVEHAGD NGIFVSGSHN VIERTVTRFN RDTGLQLSRR VSSTPESDWP AHNLILSAES HDNADSDGED ADGFAAKLTS GPGNVFRYAV AHNNIDDGWD LYTKDDTGPI GTVTIEDSLA YENGILSDGS QAGNGDRNGF KLGGEDIGVD HVITGNIAYD NGKHGFTYNS NPGSMTVSDN VSIGNEERNF NFDDGSSVFR GNTSCDSGSN DRIIGNADSS NQFWSGSNGS RCSSYDGGLD WSYAADGTLV VTFGGQRVTP
|
| |