Gene Tcur_2296 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcur_2296 
Symbol 
ID8603633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermomonospora curvata DSM 43183 
KingdomBacteria 
Replicon accessionNC_013510 
Strand
Start bp2679992 
End bp2681746 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content67% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003299900 
Protein GI269126530 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000797709 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACTC CCAAAGTCTG GGCAGCGCTG ACGGCGGGCG CCGTCGCGCT CAGTCTGGGC 
CTGACCGCCT GCGGCGGCGG CGATGACGGC GACAGCGACA GCTCGGGCCT TAAGTTCGAC
GCCGGGACCA ACGGCATCGT CAACCCGTCC GACAAAAAGG GCGGCACGCT GAAGGTGGGG
ATGAGCGACG ACTTCGACTC CACCGACCCC GGCGACACCT ACTACGCCTT CGGCAACAAC
TTCATCCGGC TGTACACCCG GACGCTGATG ACCTACACCT CCCAGCCGGG CGCTGCGGGC
CTCAAGCCCT CGCCCGACCT GGCCGAGGCC CCGGGTGTGC CCAGCGACGA CAACAAGACC
TGGACCTTCA AGCTCAAGAA GGGCCTGAAG TACGAAGACG GCACGGAGAT CAAGGCCGAG
CACATCAAGT ACGCGGTCGC CCGCACCTAC GACCGCGGCG TCCTCGGCCA CGGCCCGGCC
TACTTCCCCC AGCTGCTGGA CGCCGACGGC TACAAGGGCC CCTACAAGGA CAAGAACCTC
GACAACTTCA AGGGCATCGA GACCCCCGAC GACTACACCC TGATCTTCAA GCTCAAGGAG
CCCTTCCCCG AGTTCAACGA GCTGGTGACC TTCTCCGGCC AGACCGCCCC CGTGCCGCCG
GACAAGGACA AGGGCGCCCA GTACCGGCTG CGTCCGCTCT CCTCCGGCCC CTACAAGTGG
GAGGGCAACT ACCAGCCGAA GAAGGGCGGC GTGCTGGTCC GCAACGAGCA CTGGGACCCC
AGCACCGACC CCAACCGCAA GGCGCTGCCG GACCGCATCG AGGTCATCGC CGGCATTGAG
GCCAACGAGG TCGACAACCG CCTGATGAAC GGCGAGCTCC ACGTCGACCT GGCCGGCAGC
GGCGTGCAGG ACGCCGCCCG GCAGAAGATC CTCACCAACC CGGACCTGAA GGCCAAGGCC
GACAACCCGC TGGCCGGCTT CCACTGGTAC ATCCCGATCA ACCTCAAGAC CATCCCCAAC
CTGGAGTGCC GCAAGGCGAT CGTGTACGCC GCCGACCGGG ACGCCATGTG GCGCGCCTAC
GGCGGTGACG TCGGCGGCGA GCGGGCCACC TCCATCCAGC CGCCGAACAT CGCCGGCCGC
CAGAAGGGCA CCGACTTCTA CACCTCCACC GCCCCCGGCT ACAAGGGCGA TGTGGACAAG
GCCAAGGAGG CCCTGCAGAA GTGCGGCAAG CCCGACGGCT TCTCGACCAC CATGGTCTAC
CGCAGCGACC GGCCCAAGGA GAAGGCCGTC GCCGAGGCCC TGGAGCAGTC GCTGGCCCGG
GTCGGCATCA AGCTGACCCT CAAGGGCTAC CCGGCCGGCA CCTACACCGG TGAGCAGCTC
GGCTCCCCGT CCTTCGTCAA GAAGGAGAAC ATCGGCCTGG GCACCTACGG CTGGGCGCCC
GACTGGCCCA CCGGCTACGG CTACCTGCAG GCGCTCACCG ACGGCAAGGC GATCGTCGAG
GCCGGCAACA CCAACGTCTC CGAGCTGGAC GACCCTGAGA TCAACAAGCT CTGGAACGAC
GTGGTGAAGA TCACCGACGC CGCCGAGCGC GAGAAGATCT ACAACCGGAT CGACGAGAAG
GCGCGCGAGC TGGCCGCCAT CCTGCCCAAC GTCTACGCCA AGTCCCTGCT GTACCGGCCG
GAGACGCTGA CCAACGTCTA CTTCCACCAG GGCTTCGGCA TGTACGACTA CGCCAACCTC
GGTGTGACCG GCTGA
 
Protein sequence
MKTPKVWAAL TAGAVALSLG LTACGGGDDG DSDSSGLKFD AGTNGIVNPS DKKGGTLKVG 
MSDDFDSTDP GDTYYAFGNN FIRLYTRTLM TYTSQPGAAG LKPSPDLAEA PGVPSDDNKT
WTFKLKKGLK YEDGTEIKAE HIKYAVARTY DRGVLGHGPA YFPQLLDADG YKGPYKDKNL
DNFKGIETPD DYTLIFKLKE PFPEFNELVT FSGQTAPVPP DKDKGAQYRL RPLSSGPYKW
EGNYQPKKGG VLVRNEHWDP STDPNRKALP DRIEVIAGIE ANEVDNRLMN GELHVDLAGS
GVQDAARQKI LTNPDLKAKA DNPLAGFHWY IPINLKTIPN LECRKAIVYA ADRDAMWRAY
GGDVGGERAT SIQPPNIAGR QKGTDFYTST APGYKGDVDK AKEALQKCGK PDGFSTTMVY
RSDRPKEKAV AEALEQSLAR VGIKLTLKGY PAGTYTGEQL GSPSFVKKEN IGLGTYGWAP
DWPTGYGYLQ ALTDGKAIVE AGNTNVSELD DPEINKLWND VVKITDAAER EKIYNRIDEK
ARELAAILPN VYAKSLLYRP ETLTNVYFHQ GFGMYDYANL GVTG