Gene Tcur_3572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcur_3572 
Symbol 
ID8604923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermomonospora curvata DSM 43183 
KingdomBacteria 
Replicon accessionNC_013510 
Strand
Start bp4102902 
End bp4105850 
Gene Length2949 bp 
Protein Length982 aa 
Translation table11 
GC content78% 
IMG OID 
Producttransglutaminase domain protein 
Protein accessionYP_003301145 
Protein GI269127775 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0384986 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGCGC TGATCTGGCT GAGCTGCGCC GGCGCATTGG CCCTGGCCGC CGGGGCGCTG 
CCGTCGCTGG CGCTGTTCGC GCTGGCCGTG GGAATGGTGA TCATGGTGCT GTGGGCGGCC
GTCACGGTGC TGGCGGCGCG CTCGCGCCTG AACGCCACCC GGCTGATCGC CGAACGGGAG
ATCGGTGAGG ACCGGCCGCT GCGGATGCGC CTGGACCTCG GCCTGCCCGA GCGGCTGCCG
GTCCGCGTCC AGGTGCGCAT CACCCGGCGC GCCTGGGCGG AGCTGGAGCC CGCCGGCGGC
GTCGTGGAGG TGCCCATCGG CCGGCGCGGC GCCTATCGCA TCCAGCCCAC CCGGCTGCGG
ATCAGCGACG CGCTGGGCAT CTTCCGCCTC ACCGTCGAGG TCGGCGAGCG CGAGAAGGTG
CTGGTCCTGC CCGTCCCCGA CGGCGGCGGG CAGCCCGCGC CCGCGCCCGG CCGGGCCGTT
GAGCACACCG AGCCCGACGG GCTGCGGCCG TATGTGCCGG GCACCCCGGT CAGCCGCATC
CACTGGCTGT CCCTGGCCCG CGGCATGGAG CTGCACGAGC GGCGCATGGG GCCCGCCGCC
GTCCGGGCTG CCGCTGGTGG TCGTCGACAC CCACGGCGCC GGCGAGGCGG AGGTGGACTG
GGTGGCCCGG GCCGCCGCCG GGCTGGTGCG CACCCTGTCC AGGTCCGGCG GGTGCGCGGT
GCTGCTGCCC GGGGAGAGCG CCGCGACGCC GGTCGAGGAC GAAGAGTCCT GGCGGGCGCT
GCATCGCCGC CTGGCCCTGC TGGAGGCCGG CGACCGGCCC GCCCGTCCGC CGGCTGGCGC
CATCGTCGTC CGCTTCCCGC CGGACCGGAC GGTGGATCCG CCGCCGCCGC TGCCGCCGGG
CGTGGTGCCG CTGCCCGCCG GGAAGGGACG GCCGCGATGA TCCGGGCGGC CGGACTGCTG
GCGGCCGCCG CGGCGGCGAT GGGGGCCTGG TCGGCGCTGC TGGGCACGGC GGTGCTGTGG
GGCTGCCTGG TCGCCGCCGT CGCCGCCGCC GTGGTGGCCT GGGGGCGCAC CGCCACCCGG
GCGGTGGGCG CGCTGCTGCT GCTCTGGCCG CCGATGGCGC TGCTGGCCGG CGGCGTCCAG
CCGGACATGC TGTGGCCCCG CCGCTGGCCG CAGCTGCTGC TGTCGCTGTC GGACGGGCTG
CAGCGGCTGT CGTCGCTGGG GCCCGGCCGC GCCGCGGGCG ACCCGTGGCC GGCGGCGGTG
TGGCTGCTGC TGGTCGGGCT GCTGTGGCTG TCGGCGGCCG GCCTGTCGGT CACCGCGCCG
CGCTCGCGGC CCCGGCAGGC GCTGTCGCTG GCGCTGGTGG CGCTGCCATG GGTGATCGCG
GTGGCGCTGC GGCAAAGCGA CGAGGTCTCC TGGCAGGGGG CGGCGATGGT GCTGGCCGTG
CCGCTGTGGC TGGCGCCGCC CGGCGGCCGG GCCGCCCGCC CGATGGTCGC GCTCGGCACG
GCCGCCGCGC TGGGCGCGGC CGCGCTGGCG CACACCCTGG GACCGCGCGG CCAGTGGCTG
GCCATCGACG ACCTGATCGA TCGGGAACCG CAGTTCACCA CGCTGGACAC CACCCAGACC
TACGGGCCGC TGTATGGGCG CCGCACCGGC GCGGCCATGC TGGAGATCTC TTCGCCCCGC
CCGGCGCTGT GGCGGATGCA GGTGCTGGAG CGGATCGGCT GGCGCGGCTG GGAGACCGGG
GGCCTGCCGG ATGAGGACCT GCCCGAGCCG GCCGCCCGTC CCGTCGAGAT CGAGGTGCGG
GTCCGGGGCC TGCGCAACGA CATGGTGGTC TCCCCGGGCC GGATCATCAG CCTGCAGGCC
GACGGGCGGG TCGACTCGGG GGCGGGGGAG TCCTGGCGGA TCACCCCGGC GCCGGACGCC
GGGGACGTCT ACCGGGTGCG GGCCGCGGTG GTCACCGCCG ACCCCGCTAC CTTGCGCACC
GTGCCCTGGC CGAACTCCGA CCCCCGGCTG GAGGAGTACA CCCGCGTCCG GGAACGCCGG
CCCGGCGGGA TGTGGATGGG CGGACGGCAG GGCGAGTTCG GCCCCTATGC CGATCCCGGC
TTCGGAGGGC ACGGCTACCT GGGGTACTCG CTGTACGACG AGGTGGCCGT GATGGCGAGG
GCGGTGACGG CCGGGGCCCG CAACCAGTTC GAGGTCGTCG AGCGGGTGCA GCGCTACCTG
ACCGAGGGCG GCCGGTTCCG CTACGACACC GACGTGGAGC GCACCTCCCG CGTCCCGCTG
GTGGACTTCC TGCTGCGCAC CCGCACCGGC TACTGCCAGC ACTTCGCCGG GGCGGCGGCG
CTGCTGCTGC GGCTGGCCGG GGTGCCCGCC CGGGTGGTCG CCGGCTTCGC CACCGGGCTG
GAGCAAAACG GCCGGTACGT GGTGCGCGAC GCCGACGCCC ACGCCTGGAT CGAGGTGTAC
TTCTCCGGTG TCGGCTGGGT GCCGTTCAAC CCCACTCCGG CCGATGCCGA TGCCGTGGTG
GACCCCTCCC TGGACCCGTT CGCCCCGCCC GCCGCCGGCG GCGGGCGGCA GGGCCCGGTG
CTGCTGCCCG CCGTGCTGCT GGGGCTGGTG CCGGCCGTGG TGCTGCTGGT CACCGTCCGC
CGGCGCGGCC CGGGTGCCTC CGGTCTGCGC GGCGGGCGCG CCGACCGGCT GCTGGAGCGG
CTGGCCGGCC ATGGCGGCGA GCCGGTGACG CCCGGCACCA CCTGGGGCGG GCTGCGCGTC
CGCCTGGCCC GCCTGGGGCC GAACATCGCC GCCGTGGCCG CCGAGCTGGA ACGCGCCCGC
TACGCCCCCG GCCCGCGGGC GCCGGTCCGC CGTCTCGGCC TGCGCATCGT CCGGGCCCTG
GTCGCCGACC TGGGCCCGCT GCGGGCGGCC CGCGTGCTGG TGGCCGCCGT CGTCCGCACC
GGCCCGTGA
 
Protein sequence
MRALIWLSCA GALALAAGAL PSLALFALAV GMVIMVLWAA VTVLAARSRL NATRLIAERE 
IGEDRPLRMR LDLGLPERLP VRVQVRITRR AWAELEPAGG VVEVPIGRRG AYRIQPTRLR
ISDALGIFRL TVEVGEREKV LVLPVPDGGG QPAPAPGRAV EHTEPDGLRP YVPGTPVSRI
HWLSLARGME LHERRMGPAA VRAAAGGRRH PRRRRGGGGL GGPGRRRAGA HPVQVRRVRG
AAARGERRDA GRGRRVLAGA ASPPGPAGGR RPARPSAGWR HRRPLPAGPD GGSAAAAAAG
RGAAARREGT AAMIRAAGLL AAAAAAMGAW SALLGTAVLW GCLVAAVAAA VVAWGRTATR
AVGALLLLWP PMALLAGGVQ PDMLWPRRWP QLLLSLSDGL QRLSSLGPGR AAGDPWPAAV
WLLLVGLLWL SAAGLSVTAP RSRPRQALSL ALVALPWVIA VALRQSDEVS WQGAAMVLAV
PLWLAPPGGR AARPMVALGT AAALGAAALA HTLGPRGQWL AIDDLIDREP QFTTLDTTQT
YGPLYGRRTG AAMLEISSPR PALWRMQVLE RIGWRGWETG GLPDEDLPEP AARPVEIEVR
VRGLRNDMVV SPGRIISLQA DGRVDSGAGE SWRITPAPDA GDVYRVRAAV VTADPATLRT
VPWPNSDPRL EEYTRVRERR PGGMWMGGRQ GEFGPYADPG FGGHGYLGYS LYDEVAVMAR
AVTAGARNQF EVVERVQRYL TEGGRFRYDT DVERTSRVPL VDFLLRTRTG YCQHFAGAAA
LLLRLAGVPA RVVAGFATGL EQNGRYVVRD ADAHAWIEVY FSGVGWVPFN PTPADADAVV
DPSLDPFAPP AAGGGRQGPV LLPAVLLGLV PAVVLLVTVR RRGPGASGLR GGRADRLLER
LAGHGGEPVT PGTTWGGLRV RLARLGPNIA AVAAELERAR YAPGPRAPVR RLGLRIVRAL
VADLGPLRAA RVLVAAVVRT GP