Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5537 |
Symbol | |
ID | 9249440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | - |
Start bp | 731025 |
End bp | 733259 |
Gene Length | 2235 bp |
Protein Length | 744 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | Fibronectin type III domain protein |
Protein accession | YP_003683422 |
Protein GI | 297564449 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.320231 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGACC AGCCCACCGC ACCCCCGACG GATCCCGCGC CGAAGCCCCG CCCCTCCCCC GTGCGGCGCG CGGCCTCGGC GCTCTGGCGC CGCACCCGCT CCAGCGCCCC GGGCCTGATG ATCTCGCTCC TGGCCGCGGG CCTGCTCAGC ACCGCGCTCG GCGCCGGGGC CATGGGCCGT GCGGACGAGA TGTCCGACGG CGCCGTGTGG CTCTGGGACA GCCCCGCGGG CGAGAGCTTC CGCGTCAACG GCGACAACGC GCGGATCGAC CTCGTCGCCG CGCTGCCCGG CTCCGCCGGA CGCCCCGTCC AGGTCACCCA GAACGACGAC TACCTCCTGC TCCACGACCC CGAGACCGGC CGGGTGACTT CGGTGGACCT GCGGGAGATG GGGTTCTCCG GTGTGCTCGA ACTCGGCACC GGCGGCGACT TCGGCCTCGC GCTGGGCGAG GAGGCCGCGG TGGTCATCGA CCGCGCCAGC GGTGAGGTCA AGGCGGTGGA CCCCGCGACC CTCCAGCCGA CCGGACCGTC CCTGAGGATC CCGGCCCCGC TGGTCGGCGG GGCCTTCGAC GACTCCGACA CCCTCTGGCT GGGGGTGCCC ACCCAGGGCA CGGTCGTCGG AATCCGGGTC GAGGCCGAGG AGGCCGTCAT CACGCAGACC GCGTCGGTGG CCGACCCCGG CGCCGACATC GCCGTCACGG TCCTCGACGA CGGCGTGCTC GCGGTCGACC GGAACGGCGA CCGCATGGTC GCCGTCCGCA ACGGTGGCGA GTCCCGGACC ATCACCTCTC CCGTCCCGCT GGAGGGCGCC GAGGTACCGC CGCGCACCCG GGGCGACCTG GCCGCCGTGA CGCTGCCCGG CTCCGGCGAC GTCGTCACCG TGTCCGACCC CACCGGGTCG GCGGGCGTGG ACCACTTCTC CACCGGCCGT GAGGGCGGCG GCACCGCCGT CCCCTACGAG GGCCGCTTCT ACGTGCCCTT CCCCGAGGAG GGCGCGGTGC GCGTCTTCGG CCCCTCCGGG GACGAGCTCA ACCCCATCAC CCTGCCCGGT GCCGAGGGGC CGCTGGAGCT GGAGGCGCGC GAGGGCAGCC TGTACATCAA CTCACCCGAC ACCGGGGTGG CGGCGGTCGT CGACCCCGGG GGCCGGGCCA CCGTCATCGA CAAGACCGCC CCGCCGCCCG GCCCCGGCGA GACCGACGAG GACGAGCCCG CGCCCCGGGA ACCGGCCCCG GACGGGACCA CCGCGCCCGA GCCCGGTGAC GCCCCCGTGC CCGACGCGGG TGCGCCCGAC GCCGGTGACA CCACCGCCCC GGAGTCCGGC GGCACCGAGG GCGGCGCCGC CCCCAGGGCC CCGGCCGTGG AGAGCCCCCG GGACGACGGC GAGGAGGAGG ACGAGGGCAC GGCGCCGGGC GCTCCGACAC CCGTCTCCTT CACCGCCGGG GACGGATCGG TCACGCTGTC CTGGCCGGAG GCCTACTCCC CGGACTCCCC CGTGGAGACC TACGACATCA CGTGGCAGGG CGGCAGCACG ACCGTCGACG GCTCGGAGCT GGAGGCCACG ATCACCGGAC TGGAGAACGG CACCTCCTAC CGCTTCCGGG TGCGGGCATC CAACGCCTTC GGCACCGGTC CCGCGGCGCA GACCGAGGAG GTCACTCCCA GCCCCCGGGC CCCCGGCGCG CCGAGCGGCG TCGCCGTCGC CGCGGCGGGA AGCGACAGCG TGACCGTGTC CTGGGAGGCC GCCGAGGGGG CCGCGGACTA CCTCGTCTCC GCCTCCTCCG ACTCCGACCC GGTGAGCGAC CGCACGTCCA CGGGCACGTC GGTGGAGGTC GCCGGGCTGG CGCCGGGCGG CACCTACACC TTCACCGTGA CGGCGCGCGG CGCGGGGGGC GTCAGCGGCG AGTCCGCCAC GAGCGCCCCG TTCACCATGC CCACCCAGGA GATCGGAGCG CCCGCCGGCG TCTCCTTCAG CGCTTCCGGA GACACGGTGA CGGTCACCTG GTCACAGGTG GAGGGGGCGA CCCAGTACAC CATCACGCCG CACGGCGACG GCTCCAGGTC GCTGAGCGAG GTGAGCACGC CCGGCACCGC GGCCGGGGGC GGACAGCTCT CCTACACCTA CCAGCCGCGC GGCTCGGGGC GCTGCTACTC CTTCACCGTG CTGGCGGCGT CCGAGAGCGC CACCGCCGAC AGCGGCACGA CCAGTACCGC GAGCTGCTCA CGGGAGTTCA GATGA
|
Protein sequence | MVDQPTAPPT DPAPKPRPSP VRRAASALWR RTRSSAPGLM ISLLAAGLLS TALGAGAMGR ADEMSDGAVW LWDSPAGESF RVNGDNARID LVAALPGSAG RPVQVTQNDD YLLLHDPETG RVTSVDLREM GFSGVLELGT GGDFGLALGE EAAVVIDRAS GEVKAVDPAT LQPTGPSLRI PAPLVGGAFD DSDTLWLGVP TQGTVVGIRV EAEEAVITQT ASVADPGADI AVTVLDDGVL AVDRNGDRMV AVRNGGESRT ITSPVPLEGA EVPPRTRGDL AAVTLPGSGD VVTVSDPTGS AGVDHFSTGR EGGGTAVPYE GRFYVPFPEE GAVRVFGPSG DELNPITLPG AEGPLELEAR EGSLYINSPD TGVAAVVDPG GRATVIDKTA PPPGPGETDE DEPAPREPAP DGTTAPEPGD APVPDAGAPD AGDTTAPESG GTEGGAAPRA PAVESPRDDG EEEDEGTAPG APTPVSFTAG DGSVTLSWPE AYSPDSPVET YDITWQGGST TVDGSELEAT ITGLENGTSY RFRVRASNAF GTGPAAQTEE VTPSPRAPGA PSGVAVAAAG SDSVTVSWEA AEGAADYLVS ASSDSDPVSD RTSTGTSVEV AGLAPGGTYT FTVTARGAGG VSGESATSAP FTMPTQEIGA PAGVSFSASG DTVTVTWSQV EGATQYTITP HGDGSRSLSE VSTPGTAAGG GQLSYTYQPR GSGRCYSFTV LAASESATAD SGTTSTASCS REFR
|
| |