Gene Tcur_1736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcur_1736 
Symbol 
ID8603063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermomonospora curvata DSM 43183 
KingdomBacteria 
Replicon accessionNC_013510 
Strand
Start bp2029895 
End bp2031202 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content68% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003299348 
Protein GI269125978 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.395658 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTTCTC CGAGACGACG CGGACTGACC GCCATCGCGG CGGTGCTGGC CGGCGTCCTG 
GCGGCCACCG CCGCCTGCGG CGGCGGCGAT GACGGTGACG GCGGCTCCGG CGGCACCATC
ACGCTCACCG TCGACACCTT CGGCGAGTTC GGCTACGAGG AGCTGTTCAA GCAGTACGAG
GCGTCCCACC CCAACATCAA GATCAACTCG CGCAAGGTGG CGGACCTGGA CACCTACAAG
CCGCGCCTGC AGCAGTGGAT CGGCACCGGC AAGGGCGCCG GCGACGTGGT CGCCCTGGAG
GAGGGCCTGC TGCCCACCTA CATGCAGCAG CGCGACAAGT TCCTCAACCT CTTCGACTAC
GGCGGCGCCG AGCTGGAGCA GAACTTCCTG CCCTGGAAGT GGCAGATGGG ACTGAGCCCC
GACGGCAAGC AGCTCATGGC GCTGGGCACC GACATCGGCC CGCTCGGCAT GTGCTACCGC
AAGGACCTGT TCGAGAAGGC CGGGCTGCCC ACCGACCGCG ACGAGGTCAC CAAGCTGTGG
CCGACCTGGG AGGAGTTCTT CAAGGTCGGC CAGGACTTCC AGCGCAAGGT CTCCGGCACC
AAGTTCCTCG ACGGCCCGCA GGCGCTGCTG CGCGTCACCG TGCTGCAGGA GGCCGGCAAG
GGCCCGGGCT ACAGCTACTT CGACAAGAGC GACAACTTCG TCTTCGACAC CAACCCGGCG
GTCAAGAACG CCTTCGACAC CGTGCTGAAG TTCCAGGAGG CCGGGCTGAC CGCCAACATG
CAGATCTTCA CCCCGCCGTG GCAGACCGCG CTCAAGCGGG ACACCTTCGC CACCGTGCCG
TGCCCGGCCT GGATGCTGGG CGGCCTGGAG GAGTTCTCCG GTGACTACGG CAAGGGCAAG
TGGGACGTGG CCGGGGTGCC CGGCGGCGGC GGTTACTGGG GCGGCTCCTG GCTGGCGGTG
CCCAAGCAGA CCAAGCACCC CAAGGAGGCC GCCGAGCTGG CCAAGTTCCT GACCAGCCCG
GAGGGCCAGC TGGCCGCCTT CAAGGACAAG AACACCTTCC CGTCCTCGCC CAAGCTCTAC
AGCGACCCGG CCGTCACCGA GGCCAAGAGC GAGTACTTCA ACAACGCCCC GATCGGCAAG
ATCTTCAGTG AGGCCGCCTC CCAGGTGCGG CCGGTCTACC TCGGCCCCAA GAACGAGGAC
GTCCGGCAGA ACGTGGAGAA CGTCCTGGTG GCCGTCGCCG AAGGCAAGAT CAAGCCCGAT
GAGGCCTGGA GCAAGGCGGT GGAGGAGGCC CGCAAGGCCG CCCGGTAA
 
Protein sequence
MGSPRRRGLT AIAAVLAGVL AATAACGGGD DGDGGSGGTI TLTVDTFGEF GYEELFKQYE 
ASHPNIKINS RKVADLDTYK PRLQQWIGTG KGAGDVVALE EGLLPTYMQQ RDKFLNLFDY
GGAELEQNFL PWKWQMGLSP DGKQLMALGT DIGPLGMCYR KDLFEKAGLP TDRDEVTKLW
PTWEEFFKVG QDFQRKVSGT KFLDGPQALL RVTVLQEAGK GPGYSYFDKS DNFVFDTNPA
VKNAFDTVLK FQEAGLTANM QIFTPPWQTA LKRDTFATVP CPAWMLGGLE EFSGDYGKGK
WDVAGVPGGG GYWGGSWLAV PKQTKHPKEA AELAKFLTSP EGQLAAFKDK NTFPSSPKLY
SDPAVTEAKS EYFNNAPIGK IFSEAASQVR PVYLGPKNED VRQNVENVLV AVAEGKIKPD
EAWSKAVEEA RKAAR