Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2240 |
Symbol | |
ID | 9246090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2680860 |
End bp | 2682455 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 80% |
IMG OID | |
Product | transcriptional regulator, PucR family |
Protein accession | YP_003680168 |
Protein GI | 297561194 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0227847 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.453495 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGGGAC AGAGCGGGGC GCAGGGCCAC ACGGTCCCGT TGAGCACCGT CGTGGGCCGC CGCGACCTGG GGCTGACCAC CCTCGTGGCC GCCGGGAACC CGGGGATCGG CTGGGCGGTG GCCAGCGAGC TGGCCGACCC GGCCGCCTAC CTGCGCGGCG GTGAACTGCT GCTCACCGCG GGCGTCAACC TGCCCGCCGC CCCCGCCGGG CTGCGCGGCT ACGTGGACTC CCTGGTCGGC GCGGGGGTCA GCGCCCTCGG TTTCGGTGTG ACCCCGGTGC ACGACACCGT CCCCGCCGGG CTGGTCGAGC AGTGCCGCGC ACGGGGGCTG CCCCTGGTGG AGGTGCCCCG CCCCACACCC TTCGCCGCCG TCAGCCAGGC CGTGGGCGCC GAACTCCAGG AGCTGCACCT GCGCGACCTG CGCCGCCTGG GCGAGGCGCA CCAGGCCCTG GCCCTGGCCG TCACCGCCGA CGCCCCCGTG GACCGGGTCC TGCGGGTCCT GGCCGACGCC CTGGACGGCT GGGCGGTCCT GGCCCGCCCG TCGCCCGCCG TGCCGGGCGG CGCCCACCGC ACGCCGGGCG CCCCGGCGGA GCTGGACCCC GAACTGCGCG GGCTCGCGGA CCGGCTCACC CGGCCGCGCG GCCCGCGCGG CGCCAAGGCC CGTGTGCGGG GGGACGAGGT CTTCCTGCAC ACCGTCGGCA CCCCGCCGCA GGAGCACGGA GTGGTCCTGG TCGGCCGCCC CGAGCCGCTG GACGTCACCG ACCGGGCCGT CCTGCGCACC GCCACCGCCC TGCTGGACCT GCTCGCCCGC GCCTCCCGGG GCGCCCCGCC CGCCCCGGGC CGCCTGATCA CCGGTCTGCT CCTGGACGGC GGGCTCACCG GCGCGGCCGT GCCGCTGCTG GCCGAACTCA CCGCCCCGAC GGACGCCTTC GTGGGGTCGG CGGCGACCGG GGCCTCCGTG AGGGCCGGCG CACCGGGGGA CCCCGCCGCC GCGGGCGCGC CGGGGGACCC CGCCGCCTAC CGGGTCCTGC GCGCCCGGCC CGCCGGGCGG GGCCGCCACA CCGCCCCCGC CGCGCTGCCC CTGGGCACCC GGCTGCTGGA CGCGGGCCCC GGCGAGGACC TGCGCGCCGT CCTCGCCGAC CGGGGCGAGG CCGCGCACCT GGCCCACCTG GACCGGCTGC TCGACCACGG CTGGATCGGC GCGCTGAGCG GGCCGGTGGA CCCGGCGGAG CTGGCCGCCG CGGACCGCCG GGCCGCCGCC CTGCTCACCC GTGCCCGCGC GGTCGGCGGG CCGCTGCTGG AGGAGCCCGC CGACCCCTTC GACGCCCTCC TGGGGCCGGG GGGAGGAGAG GACCTGGCCC GGCGTGTCCT GGGACCGCTG GCCGAGGACA CCGACTCCGC GCGCCTGCTG CGCCGCACCC TGCGGGTCTG GCTCACCCGG CACGGCAACT GGGACCGGGC CGCCGCCGAC CTGGGCGCCC ACCGCAACAG CGTCCGCTAC CGGATCGGGC GGATCGAGCG CGATCTGGGC GTGGACCTGG CCGACGCCGA GCAGCGCATG CGGCTGTGGT TCGCGCTCAC CCGCTGGCGG CACTGA
|
Protein sequence | MSGQSGAQGH TVPLSTVVGR RDLGLTTLVA AGNPGIGWAV ASELADPAAY LRGGELLLTA GVNLPAAPAG LRGYVDSLVG AGVSALGFGV TPVHDTVPAG LVEQCRARGL PLVEVPRPTP FAAVSQAVGA ELQELHLRDL RRLGEAHQAL ALAVTADAPV DRVLRVLADA LDGWAVLARP SPAVPGGAHR TPGAPAELDP ELRGLADRLT RPRGPRGAKA RVRGDEVFLH TVGTPPQEHG VVLVGRPEPL DVTDRAVLRT ATALLDLLAR ASRGAPPAPG RLITGLLLDG GLTGAAVPLL AELTAPTDAF VGSAATGASV RAGAPGDPAA AGAPGDPAAY RVLRARPAGR GRHTAPAALP LGTRLLDAGP GEDLRAVLAD RGEAAHLAHL DRLLDHGWIG ALSGPVDPAE LAAADRRAAA LLTRARAVGG PLLEEPADPF DALLGPGGGE DLARRVLGPL AEDTDSARLL RRTLRVWLTR HGNWDRAAAD LGAHRNSVRY RIGRIERDLG VDLADAEQRM RLWFALTRWR H
|
| |