Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1487 |
Symbol | |
ID | 9245337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1820670 |
End bp | 1821935 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | protein of unknown function DUF21 |
Protein accession | YP_003679423 |
Protein GI | 297560449 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.121973 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.308123 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAGCT ACGGGGTTCA ACTCGGCCTC GTCCTTGTCC TCGTTCTCGT CAACGCCCTG TTCGCGGGGA GCGAGATCGC CCTGATCACC CTCCGGGAGG GGCAGATCAA ACAGTTGGCG GCGCGGGGTC CCGGCGGCCG GGCGGTGGCT CGCCTGGCAC GGGACCCCAA CCGTTTCCTG GCCACCATCC AGATCGGCAT CACCCTCGCG GGCTTCCTGG CCTCGGCCAC CGCGGCCGTG TCCCTGGCGC AGCCCCTCAT CGAGCCCCTG GGCTTCCTGG GCTCGGCCGC CAGCCCGGTG GCGATCGTCC TGGTGACCGT GCTCCTGTCC TTCGTCACGC TGGTCTTCGG AGAGCTGGCG CCCAAGCGCA TCGCCATGCA GCGGGCCGAG ACGTGGGCGG TGCTGGTCTC CCGACCGTTG GACCTGCTCG CCATGCTCTC CCGCCCCGTG GTGTGGCTGT TGAGCGTTTC CACCAACCTC GTGGTGCGCC TGACGGGCGG TGACCCCTCC GCGGCCAAGG AGGAGGTCAG CGAGGAGGAG CTGCGCGACA TGCTCGCCAC CCAGCGGGGC ATGACCCGGG AGCAGCGCAC CATCATCTCC GGAGCCTTCG AGATCGACGA CCGGCGCCTG CGCCAGGTCG TCGTTCCCCG TGGTGAGGTG TTCACCATCC CCGCCCGCAC GCCCGCGGCC CAGGCGGCGC AGATGCTCGC CGAACACGGG CACTCCCGGG CGCCGGTGGT CAACGACGAC GACCTGGACG ACGTGCTCGG TGTCGTGCAC TGGTCCGACC TGGTGCGCGG TGGGGCCGAC GCCGGAGAAC TGGCCCGCGA ACCGCTGCTC CTGCCCGATT CCCTGGTGGT GTCGTTGGCC CTGCGCCGCA TGATCGCCGA GCACCAGCAG CTGGGCGTGG TCATCAACGA GGTCGGTGGC GTCGACGGCA TCGTGAGCCT GGAGGACCTG CTGGAGGAGA TCGTCGGGGA GATCTACGAC GAGACCGATT CCGACATACG CACCGTGACC CGCAACGCCG ACGGGTCCTT CACCCTGCCC GGGACCTATC CCGTGCACGA CCTGCCCGAC ATCGACATTC ATCTGGACGA TCTGCCCGAA GGGGACTACG TCACCGTCGC CGGGCTGGTC ATCGCGGTGC TGGGCCACAT CCCCCAGGAG CCGGGGGAAG AGGTGGTGCT GGACTCCTGG AAGGCCAGGA TCGACCAGGC CAACGGGCGC ACCGTCACCC AGGTGACCAT GTCCCCCGCG GCGTAA
|
Protein sequence | MESYGVQLGL VLVLVLVNAL FAGSEIALIT LREGQIKQLA ARGPGGRAVA RLARDPNRFL ATIQIGITLA GFLASATAAV SLAQPLIEPL GFLGSAASPV AIVLVTVLLS FVTLVFGELA PKRIAMQRAE TWAVLVSRPL DLLAMLSRPV VWLLSVSTNL VVRLTGGDPS AAKEEVSEEE LRDMLATQRG MTREQRTIIS GAFEIDDRRL RQVVVPRGEV FTIPARTPAA QAAQMLAEHG HSRAPVVNDD DLDDVLGVVH WSDLVRGGAD AGELAREPLL LPDSLVVSLA LRRMIAEHQQ LGVVINEVGG VDGIVSLEDL LEEIVGEIYD ETDSDIRTVT RNADGSFTLP GTYPVHDLPD IDIHLDDLPE GDYVTVAGLV IAVLGHIPQE PGEEVVLDSW KARIDQANGR TVTQVTMSPA A
|
| |