Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2214 |
Symbol | |
ID | 9246064 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2644103 |
End bp | 2645098 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | transcriptional regulator, LysR family |
Protein accession | YP_003680142 |
Protein GI | 297561168 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00637243 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.0000100352 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAACGCA GTGAACACGA CGACACCGAC GCCCTGGCCC AGATCCTGGC GCCCCGCCTG CGCATGGTGG CCGCGGTGGC CGCCACCGAG CACATCACCC AGGCCGCCGA AGCGCTGGGC ACCCCCCAAC CGACCCTGAG CCGGGCCCTG GCCCGAACAC AGGAGGAACT GGGCATGGCA CTGGTCGAAC GCACCGGACG CGGAGTACGA CTGACCCGCG CCGGAAGACT GCTACTGCCG CACGTGGAAC GAGCCCTGGC GGACCTGCGC CAGGGCCTGG CCGAACTGAC CGGAGCCGAG GAGGGCCGGG TCGCGCTGAC CTTCCTGCCC ACACTGGGCG TGGAAGTGGT CCCGGCACTC CTGCGGGACT TTCGCGCCCG GCACCCGGGG GTGCGCTTCT CCCTCACCCA GGAGCCCTGG TCGGAGTCCC TGCACCGGCT CACCGCCGGC GGCGCGGACC TGGCACTGAC CTCACCGCTG CCCTCGGGTC AGGGACTGGC GGCGGCCACA CTGCACACCC AGGTACTGCG CCTGGTGGTG CCCGAACAAC ACCCGCTCGC ACAGGAACAC CAGACACCGG ACGGCGGGAC GGCGGGCCAA GAAGCGGGCG CGGCACCGCC CGGGGTGACG GTGACCGCCG CGGCACACGA GGAGTTCATC CTGCTCAAAC CCGGACGGGG CGTACGCCAC CTGACCGACC GCATCGTGGA ACAGGCGGGA TTCACCCCCC GGGTGGCCTT CGAGGCCGAC GACATCGCCA CCGCACGCGG ACTGGTCGCG GCGGGACTGG GGGTGTCCGT GCTCCCGGCC CGCCCCAAGG GACCGCTGAG CGGAACGGTG GAACTGGGCA TCGAGGGCGT GGACGCCCGC CGCCCCATCG GCGTGGTCTG GCCCCAGCAG GGGTCGGGCG GCGGCTACGA ACCCCCCGCG GTGGCACTGT TCCGCGACCA CGTGCGGCGG GTGGGCCCGC GCCTGATCCC CGACCTGACC GGCTGA
|
Protein sequence | MERSEHDDTD ALAQILAPRL RMVAAVAATE HITQAAEALG TPQPTLSRAL ARTQEELGMA LVERTGRGVR LTRAGRLLLP HVERALADLR QGLAELTGAE EGRVALTFLP TLGVEVVPAL LRDFRARHPG VRFSLTQEPW SESLHRLTAG GADLALTSPL PSGQGLAAAT LHTQVLRLVV PEQHPLAQEH QTPDGGTAGQ EAGAAPPGVT VTAAAHEEFI LLKPGRGVRH LTDRIVEQAG FTPRVAFEAD DIATARGLVA AGLGVSVLPA RPKGPLSGTV ELGIEGVDAR RPIGVVWPQQ GSGGGYEPPA VALFRDHVRR VGPRLIPDLT G
|
| |