Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0038 |
Symbol | |
ID | 9243865 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 49274 |
End bp | 50449 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | Extracellular ligand-binding receptor |
Protein accession | YP_003677996 |
Protein GI | 297559022 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0144084 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.178822 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAAGA CCCCCCTGAC CGCCGCCTCC TGCGCGGTCG TCCTCGCCGT GACCGCCTGC GGAACCCCCG GCGAGGCCGC CGGGGGAGGC GAGGACGCGC CCGTCAGGGT CGGCATCGTC TACTCCGCCA CCGGCCCCCT GGCCACCTAC GGCGAGCAGT ACCGGCAGGG CTTCGAGGCC GGACTCGACC ACGCCACCGG CGGGACGATG GAGATCGACG GCCGCCCGAT CGAGGTCGAG TACATGGACG ACGCAGGCGA CCCCACGAAG GCGGTCACGG CCACCCGCGA CCTCATCGGC ACCGGCCACG ACATCATCGC CGGGTCCACC GCCTCCGGCA TCGCCGTCCA GGTCGCCCCG CTCGCCGAGC AGAACGACAT CCTCTTCATC TCCGGATCCG CCGCCACCGA CGCCGTCACC GGCGTCAACG ACCACACCTT CCGCTCCGGG CGCCAGACCT ACCAGGACAT CCTCACCGCC GGAACCTTCA TGGACGACCC CGAGGGCGCG GACGTCCTGG TCCTGGCCCA GCAGAACGCC TTCGGCCAGG ACAACGTCGC CGCCGTCACC GACGTCCTCG GGGCCGAGGG GGCCGACGTC GACAGCGTCC TGGCCCCGCC GGAGACCACC GACCTCACCC CGTTCGCCGA GCAGGTCAGC CAGGCCGAAC CCGACCTGGT CTTCGTCGCC TGGGCGGGCG AGACCGCCTC CGCCATGTGG CGGGCACTGG ACCAGCAGGG CATCCTCGAC TCCACCGAGG TCGTCACCGG ACTGGACATC AAGCCGTCCT ACCCGGTCTT CGGCGAGGCG GGCGGCCGCA TCTCGTTCCT CTCCCACTAC TTCGACGGCG CCTCCGACAC CGAACTGGCC CGGACCATGA AGGAGTCCGT CGAGGAGGCG GGCGGCACCG TGGACCTCTT CACCCCCGAC GGGTTCACCG CCGCGCAGAT GGTCGTGCAC GCCGCCGGGG CCGGGGACGA AGTCCGGGAG CGCATCGACG CCCTGGAGGG CTGGACCTTC GACGGCGTCA AGGGTGAGCT CACCATCCGC GCCGAGGACC ACGCCCTCCT CCAGCCCATG TACCAGGTCG AACTGGTCGG CGAGGGCGAG GACGCCCACC CCGAACTGGT CGCGGAGATC CCCGCCGCGG ACGTCGACCC CGCGGTCGCG GAGTAG
|
Protein sequence | MRKTPLTAAS CAVVLAVTAC GTPGEAAGGG EDAPVRVGIV YSATGPLATY GEQYRQGFEA GLDHATGGTM EIDGRPIEVE YMDDAGDPTK AVTATRDLIG TGHDIIAGST ASGIAVQVAP LAEQNDILFI SGSAATDAVT GVNDHTFRSG RQTYQDILTA GTFMDDPEGA DVLVLAQQNA FGQDNVAAVT DVLGAEGADV DSVLAPPETT DLTPFAEQVS QAEPDLVFVA WAGETASAMW RALDQQGILD STEVVTGLDI KPSYPVFGEA GGRISFLSHY FDGASDTELA RTMKESVEEA GGTVDLFTPD GFTAAQMVVH AAGAGDEVRE RIDALEGWTF DGVKGELTIR AEDHALLQPM YQVELVGEGE DAHPELVAEI PAADVDPAVA E
|
| |