Gene Tcur_3539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcur_3539 
Symbol 
ID8604890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermomonospora curvata DSM 43183 
KingdomBacteria 
Replicon accessionNC_013510 
Strand
Start bp4060432 
End bp4061766 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content78% 
IMG OID 
Productcellulose-binding family II 
Protein accessionYP_003301112 
Protein GI269127742 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0054545 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGGAGT TGAGCACCGA CGAACCGGGC TACGTCCCGC CCGACCACGA GACGACCGTG 
GAGATCCCGC TGCCCGGCAA GCCGCCGGCC GGACATCCCG GCCAGGCCGA GGACACCGTG
GCCGACCACG GGCTCCCCGA TGGCCGGGAC GAGGCGGAGC AGACCCTCGC AGACCCGCCC
GGCCCGCGGG CCGGCGGCGC CGCGCCGGAT GCGACGGTGC GGGATCCGCA GGACCCCTGG
GCGCCTGCGC GCCATGACCT CCCAGGCGAT TCCGGGGAAC GCAATGCGGA GGCCGGACAG
GGAGAGTGGA CCGAGCTGTT CGGCAGCGAG GACGCCCGCC GGGAGGCGGC TGCGCCCCAG
CCCTCGGAAC GGCCCGCGGA ACCGGCCGCC ACGGCCTCAC TCAGCCCGCT CGCCGCGGCG
GCCGTGCCTC CCGGTGCCAC CGTGCCCGAC CGCGCGGACG CCTCCCGCCC GGAGCCGCAG
CCGGATGCAC CGGATTCCAC CGGTGCCCGG CCGTTGCCGC CGCCCATCGC CTCCCATGAA
CAGGCTTCCC CCGCGCCGGA CGCGCAGGCC CCGGCGACCG ATACGGCCCC CGCCGTGCTT
CCGCTTCCCG ACCGCCCCGT CGCCTCGGCG AGGCCGGTGA ACGCCCCCGG CGCGCAGCGG
CCGCCCGCGG GCGGCCGGCG CGGCTCACGG GCACCGCTGG CCGTGGCGGC GGCCCTGGTG
CTGCTCTTCG CCGTGGTCGC CGGGGTGTCG GCGCTCACCC TGATGCGCGG CGGGAAGGAC
GGCGAGGCGA CCACCGCCAA GCCCCCGGCC GGCGGCGCCT CCAGCGCGCC CGGTGAGGAC
GGGTCCGGAG GGACCGGCGC GCCGGCGGGA GAGGCGCCAC CGGGAGCGGG CGGCTCCGGC
GTCCCCGCAC CGGCGCAGGA CGCCCCGGCG CCCGGGGCGT CACCGCAGGA CGGCGCGCCG
GCCCCCGGGC GGGCGCCGGC CGACCCCACG CCGCCGCCTC GCGACCCCAT CGGCCCGGTG
CTGCGCGGCA AAGGGCTGAC CTACCAGCTC GTCCAGCACG ATCCCGGCTA CTACGAGGGA
CTGCTGATCA TCACCAACCA CGGTGCCGAG CCCATGCGGG AGTGGACGAT CACCTTCGAG
ACGCCCGGCG CCGACGTCAA GCACGTCTGG GGCGGTGAGC TGGTGCGCGG CGGCGACCGC
GTGCAGATCC GCAGCCTGGA CGGCGCTCCG CAGATCCCGC CGGGCGGCAC CTGGGAGGTC
CGCTTCGGCG CCGCCGGCAG CCCGGTCGAG CCGAGGAAAT GCCGCTTCAA CGACCGCGAG
TGCGGCCTGG AGTGA
 
Protein sequence
MTELSTDEPG YVPPDHETTV EIPLPGKPPA GHPGQAEDTV ADHGLPDGRD EAEQTLADPP 
GPRAGGAAPD ATVRDPQDPW APARHDLPGD SGERNAEAGQ GEWTELFGSE DARREAAAPQ
PSERPAEPAA TASLSPLAAA AVPPGATVPD RADASRPEPQ PDAPDSTGAR PLPPPIASHE
QASPAPDAQA PATDTAPAVL PLPDRPVASA RPVNAPGAQR PPAGGRRGSR APLAVAAALV
LLFAVVAGVS ALTLMRGGKD GEATTAKPPA GGASSAPGED GSGGTGAPAG EAPPGAGGSG
VPAPAQDAPA PGASPQDGAP APGRAPADPT PPPRDPIGPV LRGKGLTYQL VQHDPGYYEG
LLIITNHGAE PMREWTITFE TPGADVKHVW GGELVRGGDR VQIRSLDGAP QIPPGGTWEV
RFGAAGSPVE PRKCRFNDRE CGLE