Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3706 |
Symbol | |
ID | 9247575 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4445755 |
End bp | 4450275 |
Gene Length | 4521 bp |
Protein Length | 1506 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | coagulation factor 5/8 type domain protein |
Protein accession | YP_003681610 |
Protein GI | 297562636 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACACCGA CGGAATCCAC CGCCAGGCGC GAACCGGAGG ATTCCCGCAC ACCACCTTCC ACCGGCCCCG GCGGTGCCGA TCCCCGCCTC CTGGCCCGGC TGAAGGTCCT CACGGTGTGC CTGCTCCTGG GAGCGCTGGC CGCCAGCATC GACCCCGGCA GGATCGTCAG CGACACCAAG CTCGACCTGA CGGCCGACCC CCTGGGGTTC ATGGAGCGCG CCCTGCACCT GTGGGACGCC TCCTACTTCG GCCAGATCCA GAACCAGGCC TACGGGTACT TCTTCCCCAA CGGTCCCTTC CACCTGCTGT TCGACCTGCT CGGTATGCCC GACTGGCTCA TCCAGCGCCT GTGGATGGCC GTCCTGCTCG TCGCCGCGTT CACCGGCGTG TACAAGGTCG CCGGGGCGCT GGGCATCGGC ACCGTCAACA CCCGCATCCT GGCCGGGGTC GCCTACGCGC TGGCCCCCCG CGTGCTGACG CTGCTGTCGT ACAACTCCGC CGAGTTGCAG CCCATGCTGC TCATGCCGTG GATCCTGCTG CCCCTCGTGC TCGGGGCCAG GCACGGCCGC TCGCCCGCCC GCATGGCCCT GCTGTCGGGC CTGGCCTTCC TGCTGTGCGG CGGCACCAAC GCCGCCTCCG AACTGGCCGT CCTGGTCGTG CCAGGGCTCT ACCTGCTCAC CCGGGCGAAC GGTCCGCGCA AGTGGCGCCT GACGGCCTGG TGGACGGTCG CGCTGGTCCT GGCCTCGTTC TGGTACGTCG CGCCGCTGCT CCTCATGGCG CGGTACGTGT TCTCGTTCAT GCCCTACACC GAGGACGCCG CCGTCACCAC CGGCGTCACC TCGCTGCTCA ACGCGCTGCG CGGCACCTCC AACTGGATGG GCTTCCTGCC CGACCAGGGA AACACCGCGC TGCCCTCGGG GGCGGAGCTG TCCACCACCC CCTGGCTGAT CGCGGCCACC GCGCTGGTGG CGGGACTGGG CCTGGCCGGG CTGGTCAACC GGCGCACCCC CGAACGGCTC TTCCTCATCG CCAGCCTCCT CACCGGCACC GCGGTCATCG TCGCGGGCTT CACCGGCGAC CTCACGGGCC CCTTCGCGGG GACCATGCGC GAGCTGTTCG ACGGTCTCCT CTCCCCGTTC CGCAACGTGC ACAAGTTCGA CGCGCTCGTG CGCCTGCCCC TGGTGCTGGG CCTGGCCCAC CTGCCCGTGG TGGTGGCCCG GGACCTGGCC GACCGCAGGG GCACGCCCGT GTCCGGACGC GTGCGCCGCA CCGTCGCGGG CGCCACCGCC GCCGTCCTCG CGGCCACCCT CGTGCCCGTC GGCACCGTCG GGATCGCCCC CGCGGGCGGC TTCGCGCAGA TCCCCGACTA CTGGTACGAG GCCACCGACT GGCTGGAGGC GCGCGCCGAC GAGCGCGGTA TGACCATGGC CCTTCCCGGT TCGGCGCGCG GCGAGTACGA GTGGGGCCGC CCCATGGACG AACCCCTGCA ACCGCTGTTC GAGGGGGCCT GGACCAACCA CCAGATCATC CCGTGGGGCT CGGCGGGCGT CTCCCGCGTC ACCCACGAGA TCGACCAGCG CGTCTCCTCG GGACGCGGCT CCGCCGGGCT CGCCGACACC TTCGCGCGCA TGGGCGTCAC CCACCTCCTC GTGCGCAACG ACCTCCAGCG CACCGGCAAC AACGGCGGCT GGCCCGCCCG CGTCCACCAG GCCCTGACCG ACTCGCCGGG CATCACCCGG GCCGCCGAGT TCGGCCCGGT CATCGGCTCC CTGGACCACC AGTCCGCCTC CCAGTGGTTC GACCAGCCCT ACCGGGCGCT GAGCGTCTAC GAGGTGGAGG ACGCCGCGCC CACCGTCGGC ACCGTGCCCG CCGACGAGGT GCTGCGGGTC ACCGGCGGCC CCGAGTCCGT GCTCCACCTG GCCGAACAGG GGGTGGTCGA CGACGACCGC CCGATCCTGC TCGGGGACGA CCCCGGAGCC GGGGAGGTCG CCGCCCAGGA CACGGTCGTC ACCGACACCG CCCGCCGCCG CGAGGTGGTC TACTCCGATG TGCGCCGCAA CGTGTCGGCC ACCCTCACCG GGGACCAGGA GCTCGAACGC GACGTCCCCG CCCCCGACGT CCTCGACCCG GCGTGGGAGG ACCACGTCGC CCACGCCGAG GACGTCGGGA TCGCCTCGGT GCGCGCCTCC TCCGCGGAGA GCGGGGCGGG GGCGCGCGCC GCCGACCGCG ACCCGGGGCA CGCCCCGCAC GCCGCGCTCG ACGACGACCT CACCACCTCC TGGCGGTCCA GCGCCTTCAC CGGCGCCCTG GGGGAGTGGA TCGAGGTCGA GTTCGAGGAG CCCCAGGACC TGACCGGGCT CAGCGTCGCC TTCGAGCACC TGCCCGGCGA GCCGCCGCCC TCCCGGGTCA CCCTCGTCAC CGACGGCGGC GAGGCGCAGG TGCCCGTCGC CGAGACCGAG GAGCCCCAGG AGCTGGCGGC ACCGCCCGGC GCGACCACCA CACTGCGCGT GCGCGTGGAC GAGCTGGCCT GGGAGCCGGA GTACCGGTTC GGCACCCGGG TGGGTGTCGC CTCGATCTCC GTGCCCGGCC TGGAACCGGC CCGCACGCTG CGGGTCCCCG GCCCCGCCGA CGCCGGAACG CTGCTGTTCA CCGGCTCCAC CGGTACCGCG CCCGGTTGCA TGGAGGGCTC GCACGTGTGG GTGTGCAACC CCGACCTCCA GGTGCGGGGC GAGGACGCGC GCCGCCTGGA CCGCACGTTC GAGCTGTCGG CGGAGTCGGC CTCCGCCCCG CACACGGTCT CGGGCGAGGT CGTGCTCACC GACCCCCGGG AGGCGGAGAA CGCCGCCAAC CGGGCCAGCC CCCACCCGCA CGTGACGGCC TCCTCCACTG CCGTGCAGCA CCCCGCCGCG ATGGGCCGGG GCGCGCTGGA CGACGACGAG AGCACCGTCT GGTACCCGGA CCCGGAGGAG AAGAACCCCT GGCTCGACAT CGAGCTGGGC GCGCCCACCG AGATCGGGCA CCTGGAGGTG GAGTTCCCGC GCGCCGACAG CGTGTTGCGG CCGATCCGGG TGACCGTCGA GGGCGGCGGC ACCGTGCGCG AGGGGTGGCT GGACGGCAGC GGCCGGGTGG ACTTCGCCGA GTTCACCGCG GAGTCGCTGC GGGTGACCTT CGAACGCCCC GAGGGGCAGG CCCTGGAGAT CGGCACCGTC ACCCTGCCCG GGGTGGAGCC GGTGGAGCCG CTGCCCGAGG GCGACGCCTC CACCGCGTGC GGCCTGGGCC CCACGCTGCG GGTCAACGAC CAGCGGGTGG AGACGCGGAT CAGCCGGGGC ACCCTCGCCG ACCAGCTCAC CGGCCGCCCG CTGCGCTACG AGAGCTGCAC CGACCTGGAC CTGGTCGGGG GCGGGAACCG GATCGTGGTC GATCCGGGCA ACCGCTACGA GGTGCGCTCG GCGCTGGTGG AGTCCGCCGA CCCCGTCTCG GACCGCCCCG AGGTCACCAT GGCCGAGGTG GAGCGGGTGC ACGCCTGGGG TCCGGGCGAG CGCCGCTTCG ACGTGGACGT CGCCGAGGAC AGCCTGCTGG TGGTCAACGA GAACTTCAAC GAGGGCTGGC GGGCCCGCCT GGAGGGGGCT GACGCCGCAC TGGAGCCGAT CCGGCTGGAG GGCTGGAAGC AGGCCTGGGT GCTGCCCGCG GGCAGCGCGG GCACCGTCAC CCTGACCTAC GCGCCCGACA CCGCCTACCA CCGGGCCCTG GCCGTGGGCG CCGCGCTGGC CGCGGTGCTG GTGGTCGCCG CCCTGTGGCC CAGGCGCCTG CTGCCCGGGG GCGCGGCGGC CGGGGCCGCG GCCCCGCGGG GCGCCCGCGC CCTGCCCGAC GCCGGGCCCG GATGGCTGGG GCGGCGGGTC GTGCTGCCGC TGGGCCTGGC CTACGGGGTG TGGGTGGCCG GTGCGGTCGG TGCCGCCCTG GTCGCGGTGA TCCTGGTGTG CATGTGGTGG CTGGGGCGCC GGGCGCCCCG GAGGCTGCGG CACGCCAAGC CGGGACGGCC CTCCATGGGC GGGCGGACGC TGCCGCTTCT GGCGGGGCCC TGGCCGGTGG CGGTGTCGTT GGCGCTGGCC GGGCTGGCCA TGGGCGCGGG TACCCACCTG GCGCTGTACA TGCCCTTCCA CGAGGTGACC GAGGTGTTCG GGGGAGCGCT GCGCGGCTGG GTCTCCCAGC TGCTGTGCCT GCCGGCGCTG GTCCGGCTCG TGCTGGCCCT GGGCCAGCCG GGCGACGGCG AGGACGCCGA CGCCGATCCG CCGTCGGTCC GCGTCCGAGC GGCCGCTGCG GACGCCGGTG GCGGGGCGGA CGGGCCCGCT CCGCCCGGCG GTGCGGCCGG TCTGGCCGAT ACTGCCGGGG TCAGTGGGTC TAACGGTCCT ACCGGCACCC CGGCGACTCC TGTGAGCGAT GAGGCCGGCG GCGCTCCCCA CAACGGGGGC CCCTCGGACG ACGACCGGTG GGAGAACGAC CCCGAGGAGG CCCGGACATG A
|
Protein sequence | MTPTESTARR EPEDSRTPPS TGPGGADPRL LARLKVLTVC LLLGALAASI DPGRIVSDTK LDLTADPLGF MERALHLWDA SYFGQIQNQA YGYFFPNGPF HLLFDLLGMP DWLIQRLWMA VLLVAAFTGV YKVAGALGIG TVNTRILAGV AYALAPRVLT LLSYNSAELQ PMLLMPWILL PLVLGARHGR SPARMALLSG LAFLLCGGTN AASELAVLVV PGLYLLTRAN GPRKWRLTAW WTVALVLASF WYVAPLLLMA RYVFSFMPYT EDAAVTTGVT SLLNALRGTS NWMGFLPDQG NTALPSGAEL STTPWLIAAT ALVAGLGLAG LVNRRTPERL FLIASLLTGT AVIVAGFTGD LTGPFAGTMR ELFDGLLSPF RNVHKFDALV RLPLVLGLAH LPVVVARDLA DRRGTPVSGR VRRTVAGATA AVLAATLVPV GTVGIAPAGG FAQIPDYWYE ATDWLEARAD ERGMTMALPG SARGEYEWGR PMDEPLQPLF EGAWTNHQII PWGSAGVSRV THEIDQRVSS GRGSAGLADT FARMGVTHLL VRNDLQRTGN NGGWPARVHQ ALTDSPGITR AAEFGPVIGS LDHQSASQWF DQPYRALSVY EVEDAAPTVG TVPADEVLRV TGGPESVLHL AEQGVVDDDR PILLGDDPGA GEVAAQDTVV TDTARRREVV YSDVRRNVSA TLTGDQELER DVPAPDVLDP AWEDHVAHAE DVGIASVRAS SAESGAGARA ADRDPGHAPH AALDDDLTTS WRSSAFTGAL GEWIEVEFEE PQDLTGLSVA FEHLPGEPPP SRVTLVTDGG EAQVPVAETE EPQELAAPPG ATTTLRVRVD ELAWEPEYRF GTRVGVASIS VPGLEPARTL RVPGPADAGT LLFTGSTGTA PGCMEGSHVW VCNPDLQVRG EDARRLDRTF ELSAESASAP HTVSGEVVLT DPREAENAAN RASPHPHVTA SSTAVQHPAA MGRGALDDDE STVWYPDPEE KNPWLDIELG APTEIGHLEV EFPRADSVLR PIRVTVEGGG TVREGWLDGS GRVDFAEFTA ESLRVTFERP EGQALEIGTV TLPGVEPVEP LPEGDASTAC GLGPTLRVND QRVETRISRG TLADQLTGRP LRYESCTDLD LVGGGNRIVV DPGNRYEVRS ALVESADPVS DRPEVTMAEV ERVHAWGPGE RRFDVDVAED SLLVVNENFN EGWRARLEGA DAALEPIRLE GWKQAWVLPA GSAGTVTLTY APDTAYHRAL AVGAALAAVL VVAALWPRRL LPGGAAAGAA APRGARALPD AGPGWLGRRV VLPLGLAYGV WVAGAVGAAL VAVILVCMWW LGRRAPRRLR HAKPGRPSMG GRTLPLLAGP WPVAVSLALA GLAMGAGTHL ALYMPFHEVT EVFGGALRGW VSQLLCLPAL VRLVLALGQP GDGEDADADP PSVRVRAAAA DAGGGADGPA PPGGAAGLAD TAGVSGSNGP TGTPATPVSD EAGGAPHNGG PSDDDRWEND PEEART
|
| |