Gene Tcur_1053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcur_1053 
Symbol 
ID8602363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermomonospora curvata DSM 43183 
KingdomBacteria 
Replicon accessionNC_013510 
Strand
Start bp1202047 
End bp1204014 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content73% 
IMG OID 
Productvon Willebrand factor type A 
Protein accessionYP_003298677 
Protein GI269125307 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.463528 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGAT ACCGATACGG CGCCTACCAA GGCGGCCCGG ACCCACTGGA GCCGCCCTAT 
GACATCCGCT CGGCGCTGGA CGCCATGGGC GATTCGGTGC TGGAGGGCTC CAGCCCCGGC
GAGGCGCTGC GCGCGCTGCT GCGCCGCGGA CTGCCGGGCG GCCGGGACCG GCGGGCCCGG
CGCGGCCTGG ATGAGCTGCT GCGGCAGGTG CGGCAGCGGC GGCGGGAGCT GCGCGAGCGG
GGACGGCTGG ACGGCGTCCT GGAGCAGGCG CGGGCGCTGC TGGACACCGC CATCGGGCAG
GAACGGGCCG AGCTGTTCCC CGACCCCAGC GACGAGGCGC GCATGCGCGA GGCCGAACTG
GACGCGCTGC CGTCCGACAC CGCCCAGGCG ATCCGGCGGC TGAGCGACTA CGACTGGCGA
TCGCCGGCCG CCCGCGCCAC CTTCGAGCAG CTGAAGGACC TGCTGCGGCG CGACGTGCTG
GACGCCCAGT TCCAGGGCAT GCGGCAGGCG CTGGCCAACC CCGACCCGCA GGCCATGCAG
CGGGTCCGGG ACATGATGGC CGCGCTGAAC GACATGCTCG ACGCCGACGC CCGCGGCGAG
CACACCCAGG AGGACTTTGC GAACTTCATG CGCGAGTACG GGGACTTCTT CCCCGACAAC
CCGCGCAACC TGGAGGAGCT GGTCGACTCG CTGGCCCGGC GGGCGGCGGC GATGGACCGG
CTGCTGGCCT CGCTGAGCCC CGAGCAGCGC CAGGAGCTGG CCGCGCTGAT GGCGCAGGTG
ATGGAGGACG CCGGGCTGGC GATGGAGATG ACCCGGCTGG GCGAGGCGCT GCGGGCCCGC
CGCCCCGACC TGGGCTGGGG GACCCCGGAG CGGATGAGCG GCTCGGACCC GCTGAGCGTC
AGCGACGCCA CCGCCGCGCT GGCGGAGCTG GCGGACCTGG CCGAGCTGGA GGCTGCGCTG
GCGCAGGACT ATCCGGGCGC CAGCCTGGAC GACATCGACG AGGAGGCGGT GCGCCGGGCG
CTGGGCCGCC AGGCGGTCGA CGACCTGGCC GAGCTGCGCC GCATCGAAAA AGAGCTGGAA
CGCCAGGGCT ACCTGCAGCG CAGTGGCGGC CGGCTGGAGC TGACCCCCAA GGCGGTGCGC
CGGCTGGGCG AGACCGCGCT GCGCCGGGTG TTCTCCCACC TGGAGGGGGG CCGGCGCGGC
GACCACGACC AGCGCGACGC CGGGCAGGCC GGGGAGCTGA CCGGCTCCTC ACGTCCCTGG
CGGTTCGGCG ACGAGCAGCC GCTGGATGTG GTCCGCACGG TCGGCAACGC CATCCGGCGC
AACGCCCAGA ACCCCACCGG CGACCGGTCG GTCAAGCTCA GCGTGGACGA TTTCGAGGTG
CTGGAGACCG AGCGGCGCAC CGCGGCGGCG GTGTGCCTGC TGGTGGACCT GTCGTACTCG
ATGGTGCTGC GCGGCGCGTG GGGGGCGGCC AAGCAGACGG CGCTGGCGCT GCACTCGCTG
GTCACCGGCA AGTACCCGCA GGACGCCATC CAGATCATCG GGTTCTCCAA TTACGCCCGG
GTGCTGCGTC CCACCGAGAT GGCCGCGCTG GACTGGGACA TGGTGCAGGG CACCAATCTG
CACCACGCGC TGATGCTGGC CGGACGGCAC CTGGACCGGC ACCCGGACTT CGAGCCGATC
GTGCTGGTGG TCACCGACGG CGAGCCCACC GCCCACCTGC AGCCCAACGG CCGTTCGCTG
TTCGACTACC CGCCCTCCCG CCAGACGCTG ACGCTGACGC TGGCCGAGAT CGACAAGATG
ACCCGGCGCG GCGCCACCTT GAACGTGTTC ATGCTGGCCG ACGACCCCCG GCTGGTGTCG
TTCGTGGAGG AGGTCGCCCG GCGCAACGGA GGCCGGGTGT TCGCCCCCGA GGCCGGCCGG
CTCGGCGAGT ACGTGGTCAG CGACTACCTG CGGATGCGCC GGGGATGA
 
Protein sequence
MSRYRYGAYQ GGPDPLEPPY DIRSALDAMG DSVLEGSSPG EALRALLRRG LPGGRDRRAR 
RGLDELLRQV RQRRRELRER GRLDGVLEQA RALLDTAIGQ ERAELFPDPS DEARMREAEL
DALPSDTAQA IRRLSDYDWR SPAARATFEQ LKDLLRRDVL DAQFQGMRQA LANPDPQAMQ
RVRDMMAALN DMLDADARGE HTQEDFANFM REYGDFFPDN PRNLEELVDS LARRAAAMDR
LLASLSPEQR QELAALMAQV MEDAGLAMEM TRLGEALRAR RPDLGWGTPE RMSGSDPLSV
SDATAALAEL ADLAELEAAL AQDYPGASLD DIDEEAVRRA LGRQAVDDLA ELRRIEKELE
RQGYLQRSGG RLELTPKAVR RLGETALRRV FSHLEGGRRG DHDQRDAGQA GELTGSSRPW
RFGDEQPLDV VRTVGNAIRR NAQNPTGDRS VKLSVDDFEV LETERRTAAA VCLLVDLSYS
MVLRGAWGAA KQTALALHSL VTGKYPQDAI QIIGFSNYAR VLRPTEMAAL DWDMVQGTNL
HHALMLAGRH LDRHPDFEPI VLVVTDGEPT AHLQPNGRSL FDYPPSRQTL TLTLAEIDKM
TRRGATLNVF MLADDPRLVS FVEEVARRNG GRVFAPEAGR LGEYVVSDYL RMRRG