Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0543 |
Symbol | |
ID | 9244384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 668808 |
End bp | 671594 |
Gene Length | 2787 bp |
Protein Length | 928 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | Fibronectin type III domain protein |
Protein accession | YP_003678496 |
Protein GI | 297559522 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTGGG ACGCACAGGA ACGCACACGC TACGAGGACG AGGTGCTGGT GGCCGCGCGC GCCCACGGGC TGCCCGCGGA CCTGTTCACG CGCTACGGTG TCGAACCCCG CCTGGAGCTG CGGCTGTGCG CCGATCCGGC CGCGTTCACC GAGCACGTCA ACGAGGTGTG CGCCCACTGG CGCGCGCTCC GGAGCAGCCG CCGCTCCCTG GGGGAGGTGC TGGACGAGCT CATCGCCGAG CACGAGCGGC TGGAGGGCGA GGGCGCGCTC ACGCACGCCC ACTTCCGGCA GGTCCGCGCC GACGAGGCCC GACGGGTCCT GAGCGCGTGG TCCGAGATCG CCGGGGGTCT GACCACCTCC CTCCTGGACC GCGACACCCT CCGGTCGATG ATCGCGCCGG TGGGCGTCAC CGAGGCCGAC GCCGAGCGCG TCCTGCTCGA ACACGACGTG CGCGTCGTGG ACCGGCTTCC CGAACTGGAC GACGCCCCGT CCGAGGACAC CCTGCGCGCC CTGCGCGGAC ACCTGCGCGC CCGGGGCGTG CCCTTCTCCC CCATGGAGGT CTTCGGGGAG CAGCGCCTGG CCGAGGGGTT CACCGTGCTG GACGGCTTCC GGCTCCACGA CGGCACCGCC CTGGACAAGG CCGCCCTCGA CGAGGCGGTC CAGCGCGTAC GGCTGGAACC GGAGTCCGAG GGCAGGGCGG CGGCCGAGAA CGTGCTGGAG ATCCTGCGCG CCGAGGGCGA CCGGGCCGAC GCCCTCGTGC TGGGGGAGAT CGTCGGCGAC CTGCGGGAGC GCCCGGAGTC CCTGAACGAG TCCGGGCTGG CCCGGCACTG GGTCAGCCGG GGGCTGGCCG AGGAGGAGGC GCTGCTGCTG TCCGCGGCCG TGCGGCGGTC CGCCTCCGGC GCCGACGCGC CCGCACGGGC GGAGCGCGAC GTGCACGACC TGCTCGCGGA CAACCGGCTG CGCCAGGCGC AGGAGGCCGC CGAGGACCTG CCCGAGGACC ACGACCTGCA CGAGCGCCTG CGCGCGCGGA TGCAGCAGGT GGAGGAGCTG ACCGCGCAGG CGGAGCGGGC GCTGCGCGCG GGCGACCGGG AGGAGGCCGC CCGGGCCCTG GCCGCGGCGG CGGACGCCGC GGCCGACGAC GAGGCGCTCG CCGCCCGGCT CGGCGAGGTG GCGACGCCGC CGCCGCGCGG TGTGGAGGCC CGGGTCGACG GCCGGAACGT GGTGGTCTCC TGGCAGCCCG GCCCCGCTCC GGACGAGTCG GTCACCTACC GGGTGACGCG GCGCACCGGA CGGGACGGGA GCGCCCGCGA GGCGCTGGTG GGCGAGTCGT CGGGAAACGA GGTCACCGAC ACCGGAGCAC CCGTGGGAGC TGAGGCGCGC TACGCGGTGG TGGCGGTGCG CGACGGCCTC GGGGTGTCGG AGGAGGCGTC GTCCGCGCCG GTGATGATCG CGCCGGAGGT CTCCTCGCTG CGGGTGCGCG CCGGGGAGCG GTCGGTGTCC GGCTCCTGGC AGGCCCCGGC GGAGGCGGTG CGCGTGGAGG TGCTGCGCGG TGTGGGGGCG CCGCCGCGCG GCGCGGGCGA CGGCGTGCGC GTGGAGACGG ACGGGTCGGG GTTCTGCGAC ACCGACGTCG AGATGGACGT GGAGTACCAC TACCGGATCC GCGCGGTCTA CGTGACCTCG AACGGGCACG CGCGGGGGTC GGCGGGGCTC GTGCGCCGGG CCAGCCCGGG TCCGTGCCCG GAAGCGGTGC GGGACCTGTC CGTGGTGCCG GACGGCGGCG CGGACTTCCG GGCCTCGTGG ACACGGCCCT CCCGGGGGAG GGTCGTGCTG CGGGTCGGGG AGGAGCCGCC CGAGTGGCCT CTGGGCACGG TGCTCGGCCC CGACGACCTG GAGTCCTACG GCCGTGAGGT GGCCCAGACC CCGGTGGCCG ACGACGAGGG GTCCTCTGAG GGCAGGGAAG CCGACAGGAG CAGCGGAACG GGTGTTCGCG CGGGCGCGGG CCGTGCCTCG GGGCCGCCCC CGGGGTCGGC GAACGGCCAC GCGGACCCCG GGCGCGGGGG GACGCCCGGC GAGGAGAGGG AGGGCGTGCG CGTACTCGGG CCCGGGGAGA GAAGCACGGG CCACACCCGC ACCGCCCCCG TCCAGGACGC CCGCGCCCCG CACGCGCCGC GCGCCGGCGC CGCGGCGCCC GAGGGGGGCA GGGCCCGTCC CCGTGAGGGC GCGTTCCGCG ACACGGGCCA CCCGCGCACC ATGCCATCCG AGGAGACAGG GGCCAGCCCC CGTGAGGGCG CGTCCCACGT AACGGACCAG CCGAACACAG TGCCCTCGGA GGAGGGCGGG ATCCGCTCCT CTGAGGGGGC GTTCCACGCC GCGGGCCACC CGAGCACAGC GCCCTCCGAG GAGGACGGGC ACGGGGGGAC CGAGGGCATC CGCGTCCTGG GTCCGGGCGA GGGGAGCCCC GCCCGGAGCC GGGACGCCGA CCACGGCACG GGTGCCCGAC GCGGAGGCTT CGCGGACGAG GGGGGAGAGG GGGTCCGGAT CCTGGCCCCC GTTGAGCGGC CCCCGGGGCG GACCCGGACC GTTCCCGGCC AGGACGCCCG CACACCGCAC GCGCCCTCCA GCGGTTCCGC GGCGCCCGGA GGTACCGCGG GGGACCGTGC TCCCGCCCCC GCCGAGGGGC CCGGTGAGCG CATCCGCAGT GTCGGAAGGC ACGCCAGGCT CTCCCCCGCC GCGGAGTCCC CGGCACCGGC GGCCACCGGG GTGGGGGACG GTGCCCTCGG CACTTAG
|
Protein sequence | MPWDAQERTR YEDEVLVAAR AHGLPADLFT RYGVEPRLEL RLCADPAAFT EHVNEVCAHW RALRSSRRSL GEVLDELIAE HERLEGEGAL THAHFRQVRA DEARRVLSAW SEIAGGLTTS LLDRDTLRSM IAPVGVTEAD AERVLLEHDV RVVDRLPELD DAPSEDTLRA LRGHLRARGV PFSPMEVFGE QRLAEGFTVL DGFRLHDGTA LDKAALDEAV QRVRLEPESE GRAAAENVLE ILRAEGDRAD ALVLGEIVGD LRERPESLNE SGLARHWVSR GLAEEEALLL SAAVRRSASG ADAPARAERD VHDLLADNRL RQAQEAAEDL PEDHDLHERL RARMQQVEEL TAQAERALRA GDREEAARAL AAAADAAADD EALAARLGEV ATPPPRGVEA RVDGRNVVVS WQPGPAPDES VTYRVTRRTG RDGSAREALV GESSGNEVTD TGAPVGAEAR YAVVAVRDGL GVSEEASSAP VMIAPEVSSL RVRAGERSVS GSWQAPAEAV RVEVLRGVGA PPRGAGDGVR VETDGSGFCD TDVEMDVEYH YRIRAVYVTS NGHARGSAGL VRRASPGPCP EAVRDLSVVP DGGADFRASW TRPSRGRVVL RVGEEPPEWP LGTVLGPDDL ESYGREVAQT PVADDEGSSE GREADRSSGT GVRAGAGRAS GPPPGSANGH ADPGRGGTPG EEREGVRVLG PGERSTGHTR TAPVQDARAP HAPRAGAAAP EGGRARPREG AFRDTGHPRT MPSEETGASP REGASHVTDQ PNTVPSEEGG IRSSEGAFHA AGHPSTAPSE EDGHGGTEGI RVLGPGEGSP ARSRDADHGT GARRGGFADE GGEGVRILAP VERPPGRTRT VPGQDARTPH APSSGSAAPG GTAGDRAPAP AEGPGERIRS VGRHARLSPA AESPAPAATG VGDGALGT
|
| |