Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0957 |
Symbol | |
ID | 9244802 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1172670 |
End bp | 1173917 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | Extracellular ligand-binding receptor |
Protein accession | YP_003678907 |
Protein GI | 297559933 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.408649 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTAGCA AGAAGGTCCT AGGCCTCACT GCCGCAACCG CGGCCCTCGT CCTCGGCCTG ACCGCGTGTG GCAGCGACGG GGGAGAGGGA GGCGGCGGCG GTGAGGGCGG TGAAGAGTTC ACCTACGGCA TCCTCTACCC GCAGACCGGC AACCTCGCCT TCCTCGGCCC GCCCCAGATC ACGGCCGCCG AGTACGCGAT CTCCGAGATC AACGCCGCGG GCGGCATCCT CGGCACCGAG GTCCCCGCCA TCGTCGAGGG CGACGAGGCG GGCGACAACG CCCAGGCCAA CGAGGCCGCC AACAACCTCG TCTCCGACCA GGTGAACGCC GTCATCGGCG CCGCGGCCTC CGGCATGACC CAGGCGACCT ACGACACCAT CACCGGTGCC GAGATCGTCC AGTGCTCGGG CTCCAACACC GCCGCCGAGC TGAGCGAGAT CGAGGACAAC GGCTACTACT TCCGCACCGC GCCGAGCGAC CTGCTCTCCG CGGTCGTGAT GGCCCGCACG ATGGTCGAGA ACGGCAACCA GAACATCGCC ATCGTCGCGC GCGCCGACGA CTACGGCGGC GGCTACGCGG GCGCCCTCCA GACGGAGCTG GAGAACCTGG GCGCCCAGGT CGTGGTCAAC GAGACCTACG ACCCGCTGGC CACCACCTTC GACTCGGTCG TCAACAGCGT CACCACCGAG GAGCCGGACG CCGTCGCGCT CATCGCCTTC GAGGAGGGCG CGCAGGTCAT CGCCCAGCTC CTGGAGGGCG GCACCGAGGG CGAGCAGCTC TACGTCACCG ACGGCCTCAA CGACCCGAAC CTGGGCGAGA CCGTCAGCGC CGACAGCCCC GAGAGCGTCA CCGGCATCAC CGGTATCGCC CCGAGCGCGG ACAACCCCGA GTTCACCGAG GGCCTGACCA GCTTCAACGA GGAGCTGGAG GTCTTCCAGT TCGCCCCGCA GGTCTACGAC TGCGTCACCG TGATCGCCCT GGCCGCCGAG GCCGCGGGTA GCGTGAACCC GTCCGAGTAC GTCGCCGAGC TGCCCAACGT CAGCCGTCCC GAGGGCACCG AGTGCGGCAC CTTCGAGGAG TGCCGCGACC TGCTGGCCGA CGGTGAGGAG ATCAACTACC AGGGCGTCAG CGGCAACATC GACTTCAACG ACAACGGCGA CCCGACCGCC GCCACCTTCG AGATCTTCCA CTACGGGGAG GACGGCCACG AGATCCTGGC CTACGAGGAG CACTCCCTGG AGGAGTAG
|
Protein sequence | MASKKVLGLT AATAALVLGL TACGSDGGEG GGGGEGGEEF TYGILYPQTG NLAFLGPPQI TAAEYAISEI NAAGGILGTE VPAIVEGDEA GDNAQANEAA NNLVSDQVNA VIGAAASGMT QATYDTITGA EIVQCSGSNT AAELSEIEDN GYYFRTAPSD LLSAVVMART MVENGNQNIA IVARADDYGG GYAGALQTEL ENLGAQVVVN ETYDPLATTF DSVVNSVTTE EPDAVALIAF EEGAQVIAQL LEGGTEGEQL YVTDGLNDPN LGETVSADSP ESVTGITGIA PSADNPEFTE GLTSFNEELE VFQFAPQVYD CVTVIALAAE AAGSVNPSEY VAELPNVSRP EGTECGTFEE CRDLLADGEE INYQGVSGNI DFNDNGDPTA ATFEIFHYGE DGHEILAYEE HSLEE
|
| |