Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_1672 |
Symbol | |
ID | 8602995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 1960254 |
End bp | 1961582 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | transcriptional regulator, XRE family |
Protein accession | YP_003299285 |
Protein GI | 269125915 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000955969 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGAGC AGGTGCGCCC ATGGTGGGCG GTCCGGTTGC GGAACGAACG CCGGGCCCGC GGCTGGACCC AGCGGGACCT GGCCGACAGG CTCGCCCAGC AGGTGACCGA GCAGTGTCCC GAGCTCGACT GCTTGATCAC CTACATCAAG CGCTGGGAGT CGGGTAAGAG CACAGTCAGC AAGCGTTACC GGCTCGCCCT GGCCGCCGCG TTCGAGGTGC GGCATGACGA ACTGTTCGCT CCCCCGGCGT CCGTCCGGGA CGAGACCATG CGGCAGATTC CGGCCACGCC ATGGGACGGC CCGGCGCGTA GCGTGGAGAA CGCGGACTCC TCCAGCGATT GGGACGAGAT GAAGCGTCGT CTGCTGCTGC AACTGGCCGC GGTGGGGGGC GCCGGCGCGC TCGCCGCCAC CGGCGAACCA GTCCGGCTGC TCCTTGAGCA GATCCTGGCC CGTGCTCCCC GGTCGGTCGG CGACTGGGAG ATCGCCTGCG CCGACCACCT GCATGCGATC CGCACCCGCC CCCCGGCCCA GATCCGCGAC GATCTGCCGG CCGACCTGCT GGTCCTGCAG CGGCAGATCG CCGACCCCGG CGACCAGGAC GTCACCGAGC TGCAGCGGAT AGCCGCCGCG CTGTCCACCC TGTACGCCAA CGTGCTGACC CGCCTGGGCG AGCACGGCGC GGCCCTGCGC TGGTGGCGCA CCGCCCGCGA CGCCGCCGAC GCCTCCGGGG ATCTGGAGCT GCGGGTGCAG GTCCGTTCCG AGGAGGCCGG GTTCGGTCTT TACGGTCAGC GGGATCCGGT GACCGTGCTG CGCCTCACCG AGGAGGCACG GCGGCTCGCC GCAGGCCGGC CGTTCGCCGG GGTCGCCTCC CTGGCCGGCA CCCGGGCGAA GGCGTTCAGT CTGCTCGGCA GGCACGCGGA GGCGAAACGA GAGCTGCTGG TCCTCGCCGG CAGCACACCG CACACCTCCT CGTCCGGCCC GATCCCCAAC CTGTGGCGGG AAGATCAGGT GCACTTCGCC GAAAGCTGGG TCCACGCCGC CGCCGGGAAC GAGGCCGAGG CGGACGAGGC CCGGGAGCGG GTGCTGGCGT TCAAGGGCGA CTACCAGTAC GACGTCAACG TCCGGCTGCA CGAGGCGCTG TGCACGGTCG CCCAGGGCGG CTTCGAGGCG GGCGCACGGC ACGCGGCGAC GATCTTCGAC ACCATGCCCG CCGCCCGCCA CAGCCGAATG ATCACCGAGA CGGGCCGGAT GATCCTGCGC GCCGTCCCGG CCGAGCACCG CGACCACCCG GCGGTCGCCG ACTTCCGCGC CCTCCTGGCC GCCGCCTGA
|
Protein sequence | MAEQVRPWWA VRLRNERRAR GWTQRDLADR LAQQVTEQCP ELDCLITYIK RWESGKSTVS KRYRLALAAA FEVRHDELFA PPASVRDETM RQIPATPWDG PARSVENADS SSDWDEMKRR LLLQLAAVGG AGALAATGEP VRLLLEQILA RAPRSVGDWE IACADHLHAI RTRPPAQIRD DLPADLLVLQ RQIADPGDQD VTELQRIAAA LSTLYANVLT RLGEHGAALR WWRTARDAAD ASGDLELRVQ VRSEEAGFGL YGQRDPVTVL RLTEEARRLA AGRPFAGVAS LAGTRAKAFS LLGRHAEAKR ELLVLAGSTP HTSSSGPIPN LWREDQVHFA ESWVHAAAGN EAEADEARER VLAFKGDYQY DVNVRLHEAL CTVAQGGFEA GARHAATIFD TMPAARHSRM ITETGRMILR AVPAEHRDHP AVADFRALLA AA
|
| |