Gene Tcur_4973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcur_4973 
Symbol 
ID8606339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermomonospora curvata DSM 43183 
KingdomBacteria 
Replicon accessionNC_013510 
Strand
Start bp5626566 
End bp5628239 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content73% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003302525 
Protein GI269129155 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGACG GCTCTTGGAC GGGGTCACGG ATGCGGGCGG CCGGAGGCTG CGCGGCGGCG 
CTCGTGCTGG CCGGTGCGCT GACCGGGTGC GGCGCCGCGA TGGACGAGGG GTACGGGCCG
GGAAAGCCGG GCGGCGTGCT CCGCATCGTC GGTGACGCCG ACGTCGAGCG CCTCGACCCC
GCCGCCGCGA GCGAGCCTCC CGCTTACGCG CTGAACCGCG TCCTGGCGCG GACGCTTTTC
ACCACCAGGG CGTCCAACAA CTTCGAAGAG AGCCTGCCGG TCCAGGCCGA CATGGCCGAG
GCGGTCCCCT CCAAGGAGAA CGGCGGGATC GGCAAGGGCG GCAGGAAGTA CACCGTCCGG
CTGCGCGGGA ACGTCCACTG GGACACCAGC CCGCCCCGGC CGGTCGTGGC CGGGGATTTC
GTGCGCGGTT TCAAGCGGAT GTGCAACCCG GCGGCGCCGT CGCCCCACCG CGCGCACTAC
ATCGCCACGA TCAAGGGGAT GAAGGCGTTC TGCGACGGCT ACGCCAAGGT CGACGCCGAC
GACGCCGAGG CGATGGCCGC CTACCAGAAC GGGCACTCGA TCTCGGGACT GCGGGCCGCC
GACGAGACCA CGCTGGTGTT CGAGCTGACC CACCCGGCCG CCGACTTCCT CAACATCCTC
ACCCTGGCTC CGGCGGTGGC GGCGCCCAAG GAGTACGACC GGTACGTGCC CGACAGCCGC
GAGTTCCGGC AGAACATCGT CTCCAACGGC CCCTACCGGA TCGCCTCTTA TGAGCCGGGC
CGCTCCTACG TCCTGGAGCA CAACCCCGCC TGGCGGGCCG AGACCGACCC GATGCGGGAA
CGGTTCGCCG ACCGCATCCA GATCACCCTC GGCGTGGGCT CCGCGCAGGA GGTGCGGCGC
CGCATCGAGC GGGGCGAGGC CGACCTGTCG TGGGACCGGC CGGTCCCGGC CGCCGACATC
GAGGAGCTGA GCGGCACGGC GGGCTTTGCG CTCCGCCAGC TCCCCGGCCG CGGCCCGTAC
CTGGTGGCGG CCGGCGTGGC GGACCACCGG GTGCGGCGGG CCCTGCAGTA CACGGTGAAC
CGGACGGCCG TGATCGAGGC TCTGGGCGGG CACGAGGCGG CGCGCCCGCA GCACACCCTG
ATCGCCCCGG GCAACGCCGG GTACTTCGAA CACAACGCCT ACCCCACGCC CGACGACGCC
GGAGACCCCG GCAAGTGCCG CGAGCTGCTC GCCGAGGCCG GCCACCGCGG CGGGCTGCGG
CTGACGCTCG ACCCCGGGGA CGAGCCGGAG AAGGTCGTCC GGGCCGTCCG AAAGAGCCTG
TCGGACTGTG ACATCGAAGT CACGATCCGT AAGAGCGGGG CGGAGCTGAA ACTGGTCTCC
CCCGACCCCG AGTGGTTCGG CCTCAACGGC CGCTCGGCCC TGGCGCCGCT GCTCGACGGG
CACGGCGACC CCCAGATCCG GCAGCTCCTC CAGGAGGCCC TGCGAACCCC CGACACCGCC
CGCGCCACCG GTCTGTGGAA CCAGCTCGAC CGCCTGGTCA TGCAGGACGC CTCGATCGTG
CCGCTGGCCG ACCGGGCCTT CCCCATCCAG CACTCCGCCC GGGTGCGCGG CACCCGGTTC
CTGCCGCAGG CCAGGACCTA CGACTACAGC CGCATCTGGC TCACCGAGGA GTGA
 
Protein sequence
MADGSWTGSR MRAAGGCAAA LVLAGALTGC GAAMDEGYGP GKPGGVLRIV GDADVERLDP 
AAASEPPAYA LNRVLARTLF TTRASNNFEE SLPVQADMAE AVPSKENGGI GKGGRKYTVR
LRGNVHWDTS PPRPVVAGDF VRGFKRMCNP AAPSPHRAHY IATIKGMKAF CDGYAKVDAD
DAEAMAAYQN GHSISGLRAA DETTLVFELT HPAADFLNIL TLAPAVAAPK EYDRYVPDSR
EFRQNIVSNG PYRIASYEPG RSYVLEHNPA WRAETDPMRE RFADRIQITL GVGSAQEVRR
RIERGEADLS WDRPVPAADI EELSGTAGFA LRQLPGRGPY LVAAGVADHR VRRALQYTVN
RTAVIEALGG HEAARPQHTL IAPGNAGYFE HNAYPTPDDA GDPGKCRELL AEAGHRGGLR
LTLDPGDEPE KVVRAVRKSL SDCDIEVTIR KSGAELKLVS PDPEWFGLNG RSALAPLLDG
HGDPQIRQLL QEALRTPDTA RATGLWNQLD RLVMQDASIV PLADRAFPIQ HSARVRGTRF
LPQARTYDYS RIWLTEE