Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_4025 |
Symbol | |
ID | 8605381 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 4597117 |
End bp | 4598748 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003301592 |
Protein GI | 269128222 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00621584 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGCGCAA AGCGCTCGCT GGCGCTGGGC GCGGCGGCGC TGATGCTGTC GCTGGCCGGA TGCAGCGGAG CGGGAACGGG AAAGGCCGTG CCGGAGAACG CCGGGGAGGC GACCACCGCC GCGCCGAGCG CGGACCGGAC CGCCTGGGTC TTCGACGTCG TCGCGAAGAT GAGCCTGGAG GAGAAGGTCG GCCAGCTGTT CGTGCCCACC TTCGCCAGCC GGCAGGACGC CGAGCAGAAG ATCAAGAAAT ACCACGTCGG CGGGCTCATC TACTTCCCCG ACAACGCCCG CAGCCCACAG CAGACGGCCC GGCTGTCCAA CGCCCTGCAG CGGGCCTCCA AGATCCCGCT GCTGCTCGCC GTGGACGAAG AACAGGGCCT GGTCACCAGG CTGTCGTATG TGACCCGCTT CCCCGGCAAC ATGGCGCTGG GCGCCACCGC CAAACCGCAG ACCGCCCGCG AGGCCGCCAA GGTGATCGGC ACCGAGCTGC GGGCGGTGGG GATCAACCAG AACTACGCCC CGGTCGCCGA CGTCAACGTC AACCCGGCCA ACCCGGTGAT CGGCGTCCGC TCCTTCGGCT CGGACCCGGG ACTGGTGTCG CTGATGCTGG GCGGCGCCCT GCAGGGGTAC CGGGACGCGG GCGTGGCCGC CACCGTCAAG CACTTCCCCG GCCACGGCGA CACCGACACC GACAGCCACA CCGGGCTGCC GGTGATCAGG CACTCGCGGG CCGAGTGGGA GCGACTGGAC GCGCCGCCGT TCCGGGCCGC CATCGCCCAG GGCGTGGACG CCATCATGAC CGCGCACATC GTGATGCCCG GGCTGGACGA CTCCGGCGAC CCGGCGACCT TGTCGCAGGC GGTGCTGACC GGGCTGCTGC GCAAGGAGCT GGGCTACCGG GGCGTGGTGG TCACCGACTC CCTGAGCATG GCGGGCGCCC GCACCCGGTA CGGAGCCGAG CAGGCCGCGG TGCGGGCCGT GCAGGCGGGC GCCGACCAGC TGCTCATGCC GCCGGATCTG GCGGGCGCGC ACGCCGCGGT GCTGGCGGCG GTGCGCGACG GCCGGATCTC CCAGCGGCGG CTGGAGGAGT CGGTGACCCG CATCCTGCGC CTGAAGGCCG AGCGCGGGCT GTTCGGCGAC GTCCAGGTCG ATCCCGCCCG GGCCGGGCAG GTCATCGGCT CGGCCGCGCA CCGGGCGGTG GCCCGCCGGG TCGCCGAGCA GTCGATCACG CTGGTGCGCA ACCGGGGCGG CCTGCTGCCG CTGGCCGGCA GGCGGGTGCA CGTCACCGGC CCGCACGCCC AGGCCCTGGC GGCGGCGCTG CGCAGGCGGG GAGTGGAGAC GGCCGCCTCC CCGGCCGCGG CCGATGTGAC GGTGCTGACC GCGGTGGACG GCGGATCGGG CGTCGCCGCT CAGGTCGCCG CGCTGGCGGG CCGGCCGCTG GTGTTCGCGG CGCTGGGCAG CCCTTATGAC CTGGCGTATG CGACCCGGGC CCAGGCGGCG CTGGCCGCCT ACTCCTCGAG CGCGCCGTCC CTGGAGGCGC TGGCGGAGGT GATGACCGGC CGGGTGAAGC CCACCGGCAG GCTCCCGGTG GAGGTGCGCG GCTACCGATT CGGGCACGGC CTGACCTTCT GA
|
Protein sequence | MRAKRSLALG AAALMLSLAG CSGAGTGKAV PENAGEATTA APSADRTAWV FDVVAKMSLE EKVGQLFVPT FASRQDAEQK IKKYHVGGLI YFPDNARSPQ QTARLSNALQ RASKIPLLLA VDEEQGLVTR LSYVTRFPGN MALGATAKPQ TAREAAKVIG TELRAVGINQ NYAPVADVNV NPANPVIGVR SFGSDPGLVS LMLGGALQGY RDAGVAATVK HFPGHGDTDT DSHTGLPVIR HSRAEWERLD APPFRAAIAQ GVDAIMTAHI VMPGLDDSGD PATLSQAVLT GLLRKELGYR GVVVTDSLSM AGARTRYGAE QAAVRAVQAG ADQLLMPPDL AGAHAAVLAA VRDGRISQRR LEESVTRILR LKAERGLFGD VQVDPARAGQ VIGSAAHRAV ARRVAEQSIT LVRNRGGLLP LAGRRVHVTG PHAQALAAAL RRRGVETAAS PAAADVTVLT AVDGGSGVAA QVAALAGRPL VFAALGSPYD LAYATRAQAA LAAYSSSAPS LEALAEVMTG RVKPTGRLPV EVRGYRFGHG LTF
|
| |