Gene Tcur_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcur_2023 
Symbol 
ID8603350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermomonospora curvata DSM 43183 
KingdomBacteria 
Replicon accessionNC_013510 
Strand
Start bp2385919 
End bp2387679 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content75% 
IMG OID 
Productvon Willebrand factor type A 
Protein accessionYP_003299628 
Protein GI269126258 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value9.27527e-06 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGCGGAA CCGGCTCTGG TGACATCTTC GGGCGCGCGG CCGCGCCGCC CCGGCGCCGC 
ATCCCACGGC CGGGCCTGCG CACCGGTGTG GCGCTCCTCG CCGCCGCGCT GGCGGTGGCC
ATCGGCTGCC GCACGGTGCT GAACGCGGCC GGCTGCCGCG GCGAGGGCGC GGGCGGGCCG
CTGACCGTCG CCGCTGCCCC CGACATCGCT CCCGCCCTCA CCAGGGCGAT CGACCGCTTC
AACGAGGCCC AGGACACCTG CGTGCACGCC CTCGTGCGCC CCGTCGACCC GGCCGCGATA
GCGGTGCTGC TGTCCGGCCA GGGCGCCTCC GGCTCCCTGC AGCGGCCGGA TGTGTGGATC
CCCGACTCCT CGCTGTGGAC GTCCCTGGTG AACACCTCAC CGGAGCGGAT CGGCCCGCTG
CAGGTCAGCC ACGGCCCGCT GGCCTACAGC CCGGTCGTGC TGGGCCTGCC GCAGGGGCTG
GTGGACGAGC TGAGAAACCG CGGCGTCACC GCCGACCCCT CCTGGAACCT GCTGCTCGGC
GCCGTCCCCG GCGTCCCGGG CGGCGCCTCG GCGGTGCCGC CCGGCCTGGT GCGCCCGCAG
GTGCCCGACC CCACCCGCAG CGCCACCGGG ATGAGCGCCC TGGTCGTGGC CGACCGGCTG
CTGACCGGCC GGTCCGGGCG GCAGGAGATC TTCACCGCGC TGGTGCGCGC CGTGCGCGAG
AACACCGTTC CCTCGGTGGA GGCGGAGTTC CGGTCGCTGG ACGGCCGGGA GCGCAAGCGC
CACCCCGTGC TGCTGGTCCC CGAACAGGCG ATCTTCTCCC ACAACGGCAA CCGCCCGGCC
GAACGGATCA TCGCCCTCTA CCCGCGCGAG GGCACGCTGT CGCTGGACTA CCCGTTCGCC
ATGACCACCG GCGAGGCCGC CCGGCTGGAG GCGGCCCGGG CGCTGGAGCG GGCACTGCGC
AGCCCGGTCG CCGCCGCCGA GCTGCGCCGG GCCGGGTTCC GCGGCCCCGA TGGCCGCAAC
GTCCCGCACT TCGGCCCGTT CACCGGGGTC AGGCTGGACC CGCCCCGCCG GCTGCCCGCT
CCGCCGCCGC AGGCCGTGCG GGATCTGATG CAGACCTGGT CCAAGCTGAC CTTGAGCACG
CGGATGCTGG TGCTGTTCGA CGTCTCCGGC TCCATGCGGC GGCGGGTCGC CCCCGGCCTG
AGCCGGCTGC AGGCCACCGC CCGCGTCGCC CAAAGCGGGC TGCCGCTGCT GCCCGACGAC
AGTGAGCTGG GCATCTGGCT GTTCTCCACC GACCTGGAGG GCGGGCGGGA CTGGCGCGAG
GTGGTCCCGG TGGGGCCGCT GGGCGAGCGG GTCGGCTCGG TCACCCGCCG CCAGCTGATC
TTGTCGGAGC TGGGCCGCAT CCGGGCCGAG CGCAAGGGGC GCACCGGGCT GCACGAGTCG
GTGCTGGCGG CCGTCCGCCG GATGCGGGAG GGCTACAAAC CCGAGATGGT CAACACCGTG
CTGGTCTTCA CCGACGGCCG CAACCAGGAC GCCGACGGCC CCACCCTGGC GCAGACCGTG
GCGGCGTTGC GCCGCGAGCA CGACCCCAAC CGCCCGGTCC AGCTCATCAT CCAGGGCTAC
GGCCCCGACG TGTCGGTCCC CGAGCTGCGC GCCCTCACCG AGGCCACCGG CGGCCTGGTG
CAGATCGCCC GCACTCCCGA GGACGCCGGC AGGCTCTTGC TGCAGGCCAT GTCCCGCCGC
ATCTGCTCAC CCGAGTGCTG A
 
Protein sequence
MGGTGSGDIF GRAAAPPRRR IPRPGLRTGV ALLAAALAVA IGCRTVLNAA GCRGEGAGGP 
LTVAAAPDIA PALTRAIDRF NEAQDTCVHA LVRPVDPAAI AVLLSGQGAS GSLQRPDVWI
PDSSLWTSLV NTSPERIGPL QVSHGPLAYS PVVLGLPQGL VDELRNRGVT ADPSWNLLLG
AVPGVPGGAS AVPPGLVRPQ VPDPTRSATG MSALVVADRL LTGRSGRQEI FTALVRAVRE
NTVPSVEAEF RSLDGRERKR HPVLLVPEQA IFSHNGNRPA ERIIALYPRE GTLSLDYPFA
MTTGEAARLE AARALERALR SPVAAAELRR AGFRGPDGRN VPHFGPFTGV RLDPPRRLPA
PPPQAVRDLM QTWSKLTLST RMLVLFDVSG SMRRRVAPGL SRLQATARVA QSGLPLLPDD
SELGIWLFST DLEGGRDWRE VVPVGPLGER VGSVTRRQLI LSELGRIRAE RKGRTGLHES
VLAAVRRMRE GYKPEMVNTV LVFTDGRNQD ADGPTLAQTV AALRREHDPN RPVQLIIQGY
GPDVSVPELR ALTEATGGLV QIARTPEDAG RLLLQAMSRR ICSPEC