Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_1670 |
Symbol | |
ID | 8602993 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 1955621 |
End bp | 1958587 |
Gene Length | 2967 bp |
Protein Length | 988 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003299283 |
Protein GI | 269125913 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.596851 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTGCGCGG GCGTGAACAA CCGCCCTCTG CCCGATGAGC ACACCTTCGA GATCCCGGCG TCCTGGCGCT CCCAGTTGCA TCCCAGGCGC GACGGCACGC CGGCGCCCCC CATTCACCCC GATCCGCAGG CGGCCGGCAC CGTGCGTGCG CTCATCGAGG AGCACGCCGA ACTGATCGAC CTGCTGGTGA CGGGGGGCGA CTGCGATCCT GAGCTCGTCG CGGCCGTCGG GCGGCATCTG GACGGCGATC CCGACCCGCT GGGCGCGGCG GCGATCGCGG CCGTCATCAC GCAGCGGTCC GTGATGCGGA AGATGGAGGA GAGCCAGGCC TTCTTCGACT CCTGGGTCGT CACGCACGGG CTCGGTTTCG CCGCCTGCGC GGTGGTGGAG CTGAGCGGGC TCTACGCCGT CCACTGGAGA GGCGGCTGGG ATGAAAAGCC GCATGTCCTG GTCGAGCCGA AGGGGGACAT CCACCAGCAC TGTCCCGTCA CCCTGCGCCG CGCCCGCAGC CTGCTGGCAT CCGCCGCGGA CGGCGTCTAC CCGGAGGCCG AAGCGGCCCT GGCCGGCCAC CGGCGCTCCG CGACGCAGCG AGGGCTGGTG TCCTACCTGG TGCCCACCCG GCAGGACTGG GTGGAGGAGT GCTGCGCCGA CCGGTCCATG GGGCCGAACG TCTCCCACTT GCTGTATTAC TCGCTGCGCA CCAGGAAGCA ATATCTCACC CTCCACAACA GGGTGGGTTT CGACGCCGGG CGCGTCTCCA TTTACAGCGT GCTCCCCACG CTGCTTGAAG GCATGGGCCC GGCCGCCCTT CCCCTGCTCT TGGACGTGCT CGACCGGGGC GATCTGGGCG AAGAAAAGCG CCAGTACGTC CTCGGCCTGA TCGCCGAGCT GCCCACGGAC GAGGCGTTCA AGGGGCTGCT GGGCCGGCTC GACCGCCGGG GCGTCCGGCC GGTGCTGCAG GCGATGGCGC GGCGCTTCCC CGTCCGGGCG CTGCGGTTGC TGGCCGAGGC CGCCCCGGTC TCCTTCGACG CCGCCCTGCT GCTGGCCGAT CACCTGTCGG CCGACGCCGA ACTGGCCGCG GCCGCGCTCC GCAGGCTCCC GCCCGGGACG CGCACGGCGG CGGAGTCGGC GATGGCGTCC CTCGCCCGGG TCCCGGAGGC GCCGGCGGAG GCGGTGCCCG CGGTCCTGGC CGGGCCCGCC GGCCCGCGGT CCCGGCCCGT CCTGGAAGAG CTGCAGGCGC CCGCGCCGCG CGTCGCGTGG GGTGCGGGCG AGCGGCGGCA GTGGCTGGAG GACGTTCCGG GCCTGCTCCC CGTGCCGCCG GATGCGGACT GGAAGGCGCT CATCGGGGAG TTCCGGTCCG GCACCCCGTC GATCAGCATC GCCCAGCTCG TGGTGTACGG GCCCGAGGAA CTGGCCTGGC CGCCGGACGA AGAGCGGGAC CGCCGGGCGC GGGAGGAGGT CCCCTGGCTC AAACCGCTCG TCGCCCGCCA CGGGCTGCGG GCGCTGCCGG TGGCGGTGGA GATGGCCGAG GCGGATCCCG CGCACTGCGG GACGGCGCTG CTGCCCTTCT GGCACGCCGA TGTGGCGCTC ATGGCCGCCG GCTGGCTGAC CCGCGGCGGC GACCGCGCCG ACACGGCCCG CCGGTGGCTG GACCGGCACG GCCCGGCGGC GGCGCCGCTG CTGGCTCCCG CGGCGCTCGG CAAAACCGCC GAACGGCGTC CGGCCGAATG GGCACTGCGG TACCTGCTGC TCCGGCACGA CCTGGAGGAG ATCGTCCGGG CCGCGCAGGC GGTTCACGGC CGGCGGGCCG CCGACGCGCT GGAGGCGTTG CTGAGCGCGC ACCCGGTGGA GACCGGCCTG TGCCCGACGC CGAAGATTGG CGACTGGCTG GACCCGGCGT CCCTCCCCCA GGTGCTGCTG CGCGACCGGC GGCTGGCCCT GCCCGCAGCA GCGGCCGAGC GGCTGGTCGC CCTGCTGGCG CTGCCGTTCC CCCACGGGGT GCGGGAGATC AGGCGGGCCT GTGACCCGCG GTCGCTGGCG AGGTTCGGCT GGGCGCTGCT GCGGCAGTGG CGGGAAGGCG GCGCACCGGC CCAAGACGAC TGGGCCCTGA CGCAGCTGGC CTGGACCGGC AACGAGGAGA CCGTGGAGCA GCTGGCGGCG CTCATCCCGC TGTGGGCCGA GGAGGGCAGG CACAAGGACG CGGCCAAAGC GCTGGGCGTG CTCGCCGACA TCGGCTCCGA TGCGGCCCTG AAGCACCTGC ACGACGTTCC CCGCAAGGCC ACGTCCAAGA AGCTGCGGCA GGAGGCGCAA AGGGGAATGC GCCGGATCGC CGGCCGCAGG GGGCTGTCCG CCGAGCAGCT GGCCGACCGG ACCGTCCCCG ACCTGGGGTT GGCGGTCGAT GGCGGTCTGG TGCTGGACTA TGGGCCGCGA CGCTTCACGG TCGATCTCGA CGAGCGTTTC AAGCCGGTCG TGACCGACCA GAAGGGAAGG ACGCGCAAGA CCCCTCCCAG CCCCGGCGCC AAGGACGACC TGCTGCTGGC GCCCGCCGCC CACCAGAGGT TCACCGCCTT CCGGAAAGAG GCGCGCGCGG CCGTCGCCGA CCAGATCCGG CGGCTGGAGA CGGCCATGGC GTCCGGGCGG CGGTGGACGG CCGACCAGTT CGGCGACTTC ATCCTCGGTC ATCCGCTGAT GTGCCGGCTC GCCCGAAGCC TGGTGTGGGT GAGCGAGGAC GGCGACGCGG TGACCGCGTT CCGCATCACC GAGGACCGCA CGCTCGCCGA CGTCGAGGGC GAGAAGTTCA CCCTGTCCCC CTCGGCCCGC GTCGGCGTCG CCTCTCCGGA GCTGCTGGGG GACGCCGTGG CCGCCTGGTC GCAGACGTTC GCCGCCCACG GGATCCACCA GCCGTTCCCC CAGCTCGCCC GGCCGCTGCC CGCACCGGGC GAGCCGAAGG CGGCACTCGG CGGCTGA
|
Protein sequence | MCAGVNNRPL PDEHTFEIPA SWRSQLHPRR DGTPAPPIHP DPQAAGTVRA LIEEHAELID LLVTGGDCDP ELVAAVGRHL DGDPDPLGAA AIAAVITQRS VMRKMEESQA FFDSWVVTHG LGFAACAVVE LSGLYAVHWR GGWDEKPHVL VEPKGDIHQH CPVTLRRARS LLASAADGVY PEAEAALAGH RRSATQRGLV SYLVPTRQDW VEECCADRSM GPNVSHLLYY SLRTRKQYLT LHNRVGFDAG RVSIYSVLPT LLEGMGPAAL PLLLDVLDRG DLGEEKRQYV LGLIAELPTD EAFKGLLGRL DRRGVRPVLQ AMARRFPVRA LRLLAEAAPV SFDAALLLAD HLSADAELAA AALRRLPPGT RTAAESAMAS LARVPEAPAE AVPAVLAGPA GPRSRPVLEE LQAPAPRVAW GAGERRQWLE DVPGLLPVPP DADWKALIGE FRSGTPSISI AQLVVYGPEE LAWPPDEERD RRAREEVPWL KPLVARHGLR ALPVAVEMAE ADPAHCGTAL LPFWHADVAL MAAGWLTRGG DRADTARRWL DRHGPAAAPL LAPAALGKTA ERRPAEWALR YLLLRHDLEE IVRAAQAVHG RRAADALEAL LSAHPVETGL CPTPKIGDWL DPASLPQVLL RDRRLALPAA AAERLVALLA LPFPHGVREI RRACDPRSLA RFGWALLRQW REGGAPAQDD WALTQLAWTG NEETVEQLAA LIPLWAEEGR HKDAAKALGV LADIGSDAAL KHLHDVPRKA TSKKLRQEAQ RGMRRIAGRR GLSAEQLADR TVPDLGLAVD GGLVLDYGPR RFTVDLDERF KPVVTDQKGR TRKTPPSPGA KDDLLLAPAA HQRFTAFRKE ARAAVADQIR RLETAMASGR RWTADQFGDF ILGHPLMCRL ARSLVWVSED GDAVTAFRIT EDRTLADVEG EKFTLSPSAR VGVASPELLG DAVAAWSQTF AAHGIHQPFP QLARPLPAPG EPKAALGG
|
| |