Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_1803 |
Symbol | |
ID | 8603130 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | + |
Start bp | 2111502 |
End bp | 2114423 |
Gene Length | 2922 bp |
Protein Length | 973 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | putative sensor with HAMP domain |
Protein accession | YP_003299415 |
Protein GI | 269126045 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0110319 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCGCAGC GGCTGGTCGT CCTGGTCCTG ATCCCGACCG CCGCGGCCAC GGTGCTCGGC GGGCTGCGCA TCACCGAGTC CACCGTCAGC GCCGACGCCT ACGGGCGGGT GGAACGGATG GCGCGGCTGG GCAACGGCAT CGCCGCGTTC GCCCAGGAGT TCGCCGCCGA GCGTGACATG GCCGCCGGAT ACATCAGCGC CGGCCGCTCC GGAAGGGGGG CCGATGAACT CCAGGCGCAG TACCGCAAGA CCGACGCCGC CGCCGACCAG GTCCGCCAGG TCGCCCTCGG CATCGACGAC TCCTTCTCCG CCGAGGCGGT GCGCGACGCC CGCAGCGTGC TCAACCGCCT CGACACCATC GGCTCGCTGC GCTCCACCGC CACCCAGAGC AGGGCGCCCG CCCTGGTCGT GGTGGAGAAG TACTCCGAGG TCATCAACGA GCTGATCGGT GTGCTGGACG GCGTCGCCCA GGGCGTGGCC GACGAGCAGC TGGCCGAGAC GACCCGGGCG ATGGCCGCCC TCTCCCGCGC CAAGGAGCAG GTCTCCCGGC AGCGCGCCCT GCTGACCATC GGCGCCGTCC AGGGCCGGCT CAGCGTCGAG GAGCTGGCCG CGCTGGAGGC CTCCCGCGAC CGGGAGAACA GCGAGCTGAA CGCCTTCCTG CAGACGGCCA CCCTGCCGCA GCGGCAGATG TTCGAAGACA CCGTGGTCGG CCCCCAGATC GACCGGGCCC GGGTCATCCG GCAGCAGGCC ATCGCCTCGG CCAACGCCAC CGGCGGCCGG CTGCCGCGCT CGCTGCGCAC CGCGGAGGCC ATCGAGACGC TGACCGCCTC GATGACCGCG ATGGTCGACC AGATGCGCAC CGCCGAACGC AACCTCGGCG AGGAGCTGCT GAAGCGGGCC GGCGACGAAA AGAGCTCCGC GCAGACCTCG GCCCTCATCG ACGGGGCCAT CACCGCCGTG ATCGTGCTGC TGGTGCTGCT GATCACCTCG ATCATGGCCC GCTCCCTGGT GCGCCCGCTG CGGCGGCTGC GCGACAGCGC CCTGGAGGTG GCCGGCACCC GCCTGCCCGG CCTGGTGGAG CGGCTGCGCG ACCCGCAGGC GGCGGCCGGC GGCATCGAGG TCGAGCCGAT CGACATCGAC AGCACCGACG AGATCGGCCA GGTGGCCCGC GCCTTCGACG AGGTCCACCG CGAGGCGGTG CGGCTGGCGG CCGACGAGGC GGTGCTGCGC GGCAACATCA ACGCCATGTT CGTCAACCTC TCCCGGCGCA TGCAGTCGCT CATCGAGCGC CAGCTGCGCC TGATCGACGA ACTGGAGCAG AACGAGCAGG ACTCCGAGCA GCTGGCCAAC CTGTTCCAGC TGGACCACCT GGCCACCCGC ATGCGCCGCA ACTGCGAGAA CCTGCTGGTT CTGGGCGGCC AGGAGCAGGT GCGGCGGTGG AACCAGCCGG TGCCGCTGAT CGACATCGTG CGGGCCTCGC TGTCGGAGGT CGAGCAGTAC GAGCGGGTCA CCCTGCGGGT GCAAAGCGAT GTGTCGGTCA CCGGCCAGGT GGTCAACGAC CTGGTCCACC TGGTGGCCGA GCTGGTGGAG AACGCCACCG TCTTCTCCCC GCAGCACACC AAGGTCACCG TCTCCGGGCA CCTGCTGTCC GGCGGCGGGG CGATGCTGCA GATCACCGAC AACGGCGTCG GGATCTCCCC CGAGGACCTG GAGCAGGCCA ACTGGCGGCT GGCCAACCCG CCGGTCATCG ACTTCTCCGC GGCCCGCCGC ATGGGACTGT TCGTGGTCGG CCGGCTGGCG ATGCGCCACA ACATCCGGGT CGAGCTGCGG CCCGCGCTGT CGGGCGGGAT CACCGCGTTC GTGCTGCTGC CGTCCTCGGC CATCGCCCAC GACGAGGAGT CCACCGGCCA GGACGCGCCG CTGAGCCCCG ACCCGGCCGT CGACCAGCCG GTGACCAGCA CGGCCTGGAC GCCGACCCCG GTGGCCGCCG GAGTGGCCTC CTCCTCCTCC TCCGGCACCG GCCCGCTGCC CGCCGTGGGC GGGCGGCCGT CCTCGCCGCT GGGCCGCCGC GACGGACGCG GCACCGGACC CCAGCCCACC CTGCGCGACT CCGGCCCGCT GCCGACCGTC GGCGATGGGG CCGTGCCGCC GCCTCGCACC CCCTCCTCCA CCGGAAACAC CGGCGCCCAG CCCTCGGTGC CCGAGCGTGA GACCGGCCCG CTGCCCGTCA CCGGCTCGGG CGGTCCGCGC CGGCCGCCGG CAGACCAGAC GGGACCACAG CCCATCGTGT CGGATACCCC TCAGCCTCTC GGACAGCCCA CCCTCGCCGC CCGCCTGCCC GGCTTCCCCT CGCCCGGCCG GGACGGCGCC GCCGGCTCAC CGCCCGCGTC GGCGACCGGC CCGGGCGACG ATGTCCTCTC CGGATCGCCG TCGGGCGCTT CCGGCTCCGA ACGCACGGTC TCCTGGGGGA ACGGCGAGGA GACGCGTCCG GTGGAGACGG GCGGACGCTC GCCGATCTTC GATGCGATGG AGTCGGAGTG GTTCCAGCGT CGCTCCGGCG GGTCCGGCGG CCAGAGCTGG AAGTCTCCCG GCGACGCCGG CTGGCAGGCC GCGCAGGTCG TCCGGCAGCC CGCCTCGGGC GGGGTGACCA AGGCGGGGCT GCCCAAGCGG GTGCCCGGCC GCAACCGGGT GCCCGGAGCG GTGAGCCAGC AGCCGGTCTC CCGGCCGACC CATATCGGGC CGAATCAGTC CGCCGACCGG ATGCGTGATC GCTTCGCTAG CCTGCAACGT GGTGTGCACC GCGGCCGATC GGAGGCCCGC TCGGGGCTCG GCCAGGCCGG TGCTTCGGAT GAGATCGTCC CGCCGGACAC CGGCGGCGCC AGGCAGGCGC AAGGGGACAC AGAAAGGGGA GAATCCGCGT GA
|
Protein sequence | MAQRLVVLVL IPTAAATVLG GLRITESTVS ADAYGRVERM ARLGNGIAAF AQEFAAERDM AAGYISAGRS GRGADELQAQ YRKTDAAADQ VRQVALGIDD SFSAEAVRDA RSVLNRLDTI GSLRSTATQS RAPALVVVEK YSEVINELIG VLDGVAQGVA DEQLAETTRA MAALSRAKEQ VSRQRALLTI GAVQGRLSVE ELAALEASRD RENSELNAFL QTATLPQRQM FEDTVVGPQI DRARVIRQQA IASANATGGR LPRSLRTAEA IETLTASMTA MVDQMRTAER NLGEELLKRA GDEKSSAQTS ALIDGAITAV IVLLVLLITS IMARSLVRPL RRLRDSALEV AGTRLPGLVE RLRDPQAAAG GIEVEPIDID STDEIGQVAR AFDEVHREAV RLAADEAVLR GNINAMFVNL SRRMQSLIER QLRLIDELEQ NEQDSEQLAN LFQLDHLATR MRRNCENLLV LGGQEQVRRW NQPVPLIDIV RASLSEVEQY ERVTLRVQSD VSVTGQVVND LVHLVAELVE NATVFSPQHT KVTVSGHLLS GGGAMLQITD NGVGISPEDL EQANWRLANP PVIDFSAARR MGLFVVGRLA MRHNIRVELR PALSGGITAF VLLPSSAIAH DEESTGQDAP LSPDPAVDQP VTSTAWTPTP VAAGVASSSS SGTGPLPAVG GRPSSPLGRR DGRGTGPQPT LRDSGPLPTV GDGAVPPPRT PSSTGNTGAQ PSVPERETGP LPVTGSGGPR RPPADQTGPQ PIVSDTPQPL GQPTLAARLP GFPSPGRDGA AGSPPASATG PGDDVLSGSP SGASGSERTV SWGNGEETRP VETGGRSPIF DAMESEWFQR RSGGSGGQSW KSPGDAGWQA AQVVRQPASG GVTKAGLPKR VPGRNRVPGA VSQQPVSRPT HIGPNQSADR MRDRFASLQR GVHRGRSEAR SGLGQAGASD EIVPPDTGGA RQAQGDTERG ESA
|
| |