Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_3572 |
Symbol | |
ID | 8604923 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 4102902 |
End bp | 4105850 |
Gene Length | 2949 bp |
Protein Length | 982 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | transglutaminase domain protein |
Protein accession | YP_003301145 |
Protein GI | 269127775 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0384986 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAGCGC TGATCTGGCT GAGCTGCGCC GGCGCATTGG CCCTGGCCGC CGGGGCGCTG CCGTCGCTGG CGCTGTTCGC GCTGGCCGTG GGAATGGTGA TCATGGTGCT GTGGGCGGCC GTCACGGTGC TGGCGGCGCG CTCGCGCCTG AACGCCACCC GGCTGATCGC CGAACGGGAG ATCGGTGAGG ACCGGCCGCT GCGGATGCGC CTGGACCTCG GCCTGCCCGA GCGGCTGCCG GTCCGCGTCC AGGTGCGCAT CACCCGGCGC GCCTGGGCGG AGCTGGAGCC CGCCGGCGGC GTCGTGGAGG TGCCCATCGG CCGGCGCGGC GCCTATCGCA TCCAGCCCAC CCGGCTGCGG ATCAGCGACG CGCTGGGCAT CTTCCGCCTC ACCGTCGAGG TCGGCGAGCG CGAGAAGGTG CTGGTCCTGC CCGTCCCCGA CGGCGGCGGG CAGCCCGCGC CCGCGCCCGG CCGGGCCGTT GAGCACACCG AGCCCGACGG GCTGCGGCCG TATGTGCCGG GCACCCCGGT CAGCCGCATC CACTGGCTGT CCCTGGCCCG CGGCATGGAG CTGCACGAGC GGCGCATGGG GCCCGCCGCC GTCCGGGCTG CCGCTGGTGG TCGTCGACAC CCACGGCGCC GGCGAGGCGG AGGTGGACTG GGTGGCCCGG GCCGCCGCCG GGCTGGTGCG CACCCTGTCC AGGTCCGGCG GGTGCGCGGT GCTGCTGCCC GGGGAGAGCG CCGCGACGCC GGTCGAGGAC GAAGAGTCCT GGCGGGCGCT GCATCGCCGC CTGGCCCTGC TGGAGGCCGG CGACCGGCCC GCCCGTCCGC CGGCTGGCGC CATCGTCGTC CGCTTCCCGC CGGACCGGAC GGTGGATCCG CCGCCGCCGC TGCCGCCGGG CGTGGTGCCG CTGCCCGCCG GGAAGGGACG GCCGCGATGA TCCGGGCGGC CGGACTGCTG GCGGCCGCCG CGGCGGCGAT GGGGGCCTGG TCGGCGCTGC TGGGCACGGC GGTGCTGTGG GGCTGCCTGG TCGCCGCCGT CGCCGCCGCC GTGGTGGCCT GGGGGCGCAC CGCCACCCGG GCGGTGGGCG CGCTGCTGCT GCTCTGGCCG CCGATGGCGC TGCTGGCCGG CGGCGTCCAG CCGGACATGC TGTGGCCCCG CCGCTGGCCG CAGCTGCTGC TGTCGCTGTC GGACGGGCTG CAGCGGCTGT CGTCGCTGGG GCCCGGCCGC GCCGCGGGCG ACCCGTGGCC GGCGGCGGTG TGGCTGCTGC TGGTCGGGCT GCTGTGGCTG TCGGCGGCCG GCCTGTCGGT CACCGCGCCG CGCTCGCGGC CCCGGCAGGC GCTGTCGCTG GCGCTGGTGG CGCTGCCATG GGTGATCGCG GTGGCGCTGC GGCAAAGCGA CGAGGTCTCC TGGCAGGGGG CGGCGATGGT GCTGGCCGTG CCGCTGTGGC TGGCGCCGCC CGGCGGCCGG GCCGCCCGCC CGATGGTCGC GCTCGGCACG GCCGCCGCGC TGGGCGCGGC CGCGCTGGCG CACACCCTGG GACCGCGCGG CCAGTGGCTG GCCATCGACG ACCTGATCGA TCGGGAACCG CAGTTCACCA CGCTGGACAC CACCCAGACC TACGGGCCGC TGTATGGGCG CCGCACCGGC GCGGCCATGC TGGAGATCTC TTCGCCCCGC CCGGCGCTGT GGCGGATGCA GGTGCTGGAG CGGATCGGCT GGCGCGGCTG GGAGACCGGG GGCCTGCCGG ATGAGGACCT GCCCGAGCCG GCCGCCCGTC CCGTCGAGAT CGAGGTGCGG GTCCGGGGCC TGCGCAACGA CATGGTGGTC TCCCCGGGCC GGATCATCAG CCTGCAGGCC GACGGGCGGG TCGACTCGGG GGCGGGGGAG TCCTGGCGGA TCACCCCGGC GCCGGACGCC GGGGACGTCT ACCGGGTGCG GGCCGCGGTG GTCACCGCCG ACCCCGCTAC CTTGCGCACC GTGCCCTGGC CGAACTCCGA CCCCCGGCTG GAGGAGTACA CCCGCGTCCG GGAACGCCGG CCCGGCGGGA TGTGGATGGG CGGACGGCAG GGCGAGTTCG GCCCCTATGC CGATCCCGGC TTCGGAGGGC ACGGCTACCT GGGGTACTCG CTGTACGACG AGGTGGCCGT GATGGCGAGG GCGGTGACGG CCGGGGCCCG CAACCAGTTC GAGGTCGTCG AGCGGGTGCA GCGCTACCTG ACCGAGGGCG GCCGGTTCCG CTACGACACC GACGTGGAGC GCACCTCCCG CGTCCCGCTG GTGGACTTCC TGCTGCGCAC CCGCACCGGC TACTGCCAGC ACTTCGCCGG GGCGGCGGCG CTGCTGCTGC GGCTGGCCGG GGTGCCCGCC CGGGTGGTCG CCGGCTTCGC CACCGGGCTG GAGCAAAACG GCCGGTACGT GGTGCGCGAC GCCGACGCCC ACGCCTGGAT CGAGGTGTAC TTCTCCGGTG TCGGCTGGGT GCCGTTCAAC CCCACTCCGG CCGATGCCGA TGCCGTGGTG GACCCCTCCC TGGACCCGTT CGCCCCGCCC GCCGCCGGCG GCGGGCGGCA GGGCCCGGTG CTGCTGCCCG CCGTGCTGCT GGGGCTGGTG CCGGCCGTGG TGCTGCTGGT CACCGTCCGC CGGCGCGGCC CGGGTGCCTC CGGTCTGCGC GGCGGGCGCG CCGACCGGCT GCTGGAGCGG CTGGCCGGCC ATGGCGGCGA GCCGGTGACG CCCGGCACCA CCTGGGGCGG GCTGCGCGTC CGCCTGGCCC GCCTGGGGCC GAACATCGCC GCCGTGGCCG CCGAGCTGGA ACGCGCCCGC TACGCCCCCG GCCCGCGGGC GCCGGTCCGC CGTCTCGGCC TGCGCATCGT CCGGGCCCTG GTCGCCGACC TGGGCCCGCT GCGGGCGGCC CGCGTGCTGG TGGCCGCCGT CGTCCGCACC GGCCCGTGA
|
Protein sequence | MRALIWLSCA GALALAAGAL PSLALFALAV GMVIMVLWAA VTVLAARSRL NATRLIAERE IGEDRPLRMR LDLGLPERLP VRVQVRITRR AWAELEPAGG VVEVPIGRRG AYRIQPTRLR ISDALGIFRL TVEVGEREKV LVLPVPDGGG QPAPAPGRAV EHTEPDGLRP YVPGTPVSRI HWLSLARGME LHERRMGPAA VRAAAGGRRH PRRRRGGGGL GGPGRRRAGA HPVQVRRVRG AAARGERRDA GRGRRVLAGA ASPPGPAGGR RPARPSAGWR HRRPLPAGPD GGSAAAAAAG RGAAARREGT AAMIRAAGLL AAAAAAMGAW SALLGTAVLW GCLVAAVAAA VVAWGRTATR AVGALLLLWP PMALLAGGVQ PDMLWPRRWP QLLLSLSDGL QRLSSLGPGR AAGDPWPAAV WLLLVGLLWL SAAGLSVTAP RSRPRQALSL ALVALPWVIA VALRQSDEVS WQGAAMVLAV PLWLAPPGGR AARPMVALGT AAALGAAALA HTLGPRGQWL AIDDLIDREP QFTTLDTTQT YGPLYGRRTG AAMLEISSPR PALWRMQVLE RIGWRGWETG GLPDEDLPEP AARPVEIEVR VRGLRNDMVV SPGRIISLQA DGRVDSGAGE SWRITPAPDA GDVYRVRAAV VTADPATLRT VPWPNSDPRL EEYTRVRERR PGGMWMGGRQ GEFGPYADPG FGGHGYLGYS LYDEVAVMAR AVTAGARNQF EVVERVQRYL TEGGRFRYDT DVERTSRVPL VDFLLRTRTG YCQHFAGAAA LLLRLAGVPA RVVAGFATGL EQNGRYVVRD ADAHAWIEVY FSGVGWVPFN PTPADADAVV DPSLDPFAPP AAGGGRQGPV LLPAVLLGLV PAVVLLVTVR RRGPGASGLR GGRADRLLER LAGHGGEPVT PGTTWGGLRV RLARLGPNIA AVAAELERAR YAPGPRAPVR RLGLRIVRAL VADLGPLRAA RVLVAAVVRT GP
|
| |