Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_1364 |
Symbol | |
ID | 8602677 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | + |
Start bp | 1551666 |
End bp | 1553522 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | glycoside hydrolase family 5 |
Protein accession | YP_003298984 |
Protein GI | 269125614 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00194405 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCTTC TGCTCGCCGC GTCCCTGCTC GGCGCGCCGC CGCCGGCGAA CGCGGCCGCG CCGCCCGCCG CCCAGCGGCA GGCCCTGTCG GTCGCCCCCA AGGCGGGGGA GTGGGTGTTC CGCGACCGGA CGGGCCGTGA GGTGGTGCTG CGGGGCTTCA ACGTCTCCGG CAGCGCCAAG CTGCGGGAGA CCGGCCTGCT GCCGTTCCGC GACGCGGCGG ACGCGGCCCG CTCGGCGCAG GCCATGCGGG ACCTCACCGG CGCCAACGCG ATCCGCTTCC TGATCACCTG GGAGGGCGTG CAGCCGGCCC CCGGCCGCAT CGATCACGCC TACCTGGACC GGGCGGCCGA GCAGATCCGC GCTTTCATCT CCCGCGGCTT TTACGTCCTG CTCGACTATC ACCAGGACCT GTACTCGGCG CACCTGTTCC ACGCCGGCAG CTGGTACACC GGGGACGGCG CCCCCAGGTG GGCGGTCGAG GGCGGCGGCT ATCCCAAGGA GTTCTGCGGC CTGTGCATCG TCTGGGGCCA GAACATGCTG ACCAACCAGG CGGTGCGCCG GGCCGCCCGC GACTTTTGGC ACAACCGGGC GATGCCGACC GCCGTGGGGC CGGTCCGCGT CCAGGACGCC TTCGTCGAGC AGGCGACCGC GGCCATGGCC CACCTCAAGC GGAGGCTGAC CGCCGGGGAG TTCCGATCGA TCCTCGGCCT GGACCCCTTC AACGAGCCCT TCGACGGCGG CCTGGACGAC GTCCGCGGCG CCGACTGGGA ACGGGAGTAC CTGCTGCCGT TCTACCGGCG GATGCGCGCC GCGATGGACG CGGCCGGCTG GGACGCCAAG CCCATCTTCG TGGAGCCGCT GCCGTTCTGG AACGTCGGCT TCTTCGAACA GGGCGGCCTG TCGGCGGTCG GGAAACTGGG CGCCCGCTAC GTGTTCAACA GCCACTTCTA CGACGGCGCC CGCATGACCA TCGACCCCCG CCCCGCCGGG GACGGCGCTT ATGCCGCGGC CATGAACGAG ATCCGCGACC GGGCGCGCAC CCTGGCCACC GCCCCGTTCC TTTCTGAGTT CGGCAACCGC ATGTCCGGGT TCGGCAGCGA CCGCACCCCG TGGATGGTGC GCGCGATGTA CCAGGGGGCG GACCACGGCG TCCGCGGCGC CGACTGGTGG CGGCGGGCCG CCTCCGGCGG CACCGTGCTG TCGGCCACCC ACTGGCACTG GGACATCTAC AGCGGACGCC ACCACGAGCC GATGAACGGC AACCCCCGCA AGGTCCGAAC CGAAGGCGAC GCCTGGAACG ATGAGGACTT CTCGGTGGTG CGCACCGACG AGACCGGCCG GGTGAGGCTC CGGCTCGACC GGCGCGTCCT GGACCGCCTC TATCCGAGCG CGGTGGCCGG CGACATCCTG GCGTTCGCCT ACGAGGACCT GGCCCGCTCC GGCTTCGCCG GGCGGGGGAG CCAGGCCGCA TGGCTGGCGG CCCCGGCCGC GATGCCCAAC GTGGCGGCCC TGGTCCACGG CCGCCAGTTC GGCGTGCTGG TCTGGCGCGC CCCCGCCGCC GCGCCGCAGG CGCCGACCGA ACTGCACCTG CCCGGCTCGT TCACCCCCGG GCGCACGGTC GTCGTCGGCG ATCTGACCGC CCGCCGCGGC CTGTCGTCCT CCGGCCCGGT CCGCATCGCT CCCGAGCCGG GTTCCTCCTC CGCCCGCCGC CTGCTGGTCG ACCACGGCGG CGGCGGTGCG GTCCACGTGA TGCTGGTCGT CAACGCCGCG ACCGGCCCGC CCGTCACCGC CGCCCAGTTG GCCGCCGCCC GCGCCGAACT GACCGCTTGG GCGGCCCGGT ACTTTCCGGC GCGCTGA
|
Protein sequence | MPLLLAASLL GAPPPANAAA PPAAQRQALS VAPKAGEWVF RDRTGREVVL RGFNVSGSAK LRETGLLPFR DAADAARSAQ AMRDLTGANA IRFLITWEGV QPAPGRIDHA YLDRAAEQIR AFISRGFYVL LDYHQDLYSA HLFHAGSWYT GDGAPRWAVE GGGYPKEFCG LCIVWGQNML TNQAVRRAAR DFWHNRAMPT AVGPVRVQDA FVEQATAAMA HLKRRLTAGE FRSILGLDPF NEPFDGGLDD VRGADWEREY LLPFYRRMRA AMDAAGWDAK PIFVEPLPFW NVGFFEQGGL SAVGKLGARY VFNSHFYDGA RMTIDPRPAG DGAYAAAMNE IRDRARTLAT APFLSEFGNR MSGFGSDRTP WMVRAMYQGA DHGVRGADWW RRAASGGTVL SATHWHWDIY SGRHHEPMNG NPRKVRTEGD AWNDEDFSVV RTDETGRVRL RLDRRVLDRL YPSAVAGDIL AFAYEDLARS GFAGRGSQAA WLAAPAAMPN VAALVHGRQF GVLVWRAPAA APQAPTELHL PGSFTPGRTV VVGDLTARRG LSSSGPVRIA PEPGSSSARR LLVDHGGGGA VHVMLVVNAA TGPPVTAAQL AAARAELTAW AARYFPAR
|
| |