Gene Strop_2684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_2684 
Symbol 
ID5059147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp3018052 
End bp3020028 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content65% 
IMG OID640474940 
Productglycoside hydrolase family protein 
Protein accessionYP_001159506 
Protein GI145595209 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.954713 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0577257 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTGTGA AACCACGATT GCATGCTGCC GCGGTGGCGG CCTGCGTAGC TCTGGGGCTG 
ACAGCCGCCG CCCCGGCCGC GTTCGCCACC ACGTCACCCG AGCCGGAGGC GCAGGTCGTC
TCGGCCGTAC CGGCGGCGGA CTGGCTACAC ACCGATGGCA ACAAGATTGT GGATGAGGCT
GGTAACCAGG TGTGGTTGAC CGGCGCGAAC TGGTTCGGCT ACAACGCCAC CGAGCGAGTC
TTCCACGGAC TCTGGGCGGG CAACATCGAG ACCATCACCC GGCAGATGGC CGAGCGTGGA
ATCAACATCG TTCGGGTGCC GATCTCCACT GAGTTGCTGC TGGAGTGGAA GGCCGGCCAG
ACGGTGCTGC CGAACGTGAA CCTGTCGGTC AACCCGGAGT TGGCTGGCAT GGACAACCTG
CAGATCTTCG ACTACTGGTT GGCCCTCTGC GAGGAGTACG GCCTGAAGGT CATGCTCGAT
GTGCACAGTG CCGAGGCTGA CAACTCTGGC CATTTCTATC CGATGTGGCA CAAGGGGGCA
ATCACGCCGG AGCTGTTCTA CCAGGGCTGG GAGTGGGTAG CCGCTCGGTA CCAGAACAAC
GACACGATTG TCGCGATGGA TATCCAGAAC GAGCCGCACG GCACTCCGAA CAACCCGCCC
CGGGCCAAGT GGGACGGCAC CAGCGACATC GACAACTTCA AACACGCCTG TGAGACGGCC
GGCAACCGAA TTCTGGCGAT CAACCCGAAC GTGCTGATTC TCTGCGAGGG CGTCGAGGTC
TACCCGCGGC CGGGGGAGAG CTGGGACTCA CCCAACACCG ACCCGGACCA GAGCCCCAAC
TACCACTACA ACTGGTGGGG CGGCAACCTG CGTGGGGTGA AGGACCACCC GATCAACCTG
GGAGCCCACC AGGACCAGCT GGTCTATTCG CCGCACGACT ACGGGCCGCT GGTGCATGAG
CAGCCGTGGT TCCAGAAGGA CTTTGACAAG ACCACGTTGA CCAACGACGT GTGGCGGCCA
AACTGGCTCT ACCTCCACGA GGAAGACACC GCGCCGCTGC TGGTCGGCGA GTGGGGTGGC
CGGTTCGGGC AGGACGATCG GCAGGACAGG TGGCTGAAGG CCCTGCGGGA CCTGATGGCG
GAGATGGTGA TTCACCATAC CTTCTGGTGT CTCAACCCGA ACTCCGGCGA CACCGGCGGC
CTGCTGCAAC ACGATTGGCA GACCTGGGAC GAGGTCAAGT ACGACCAGGT GCTCAAGCCG
GCGCTCTGGC AGCACAACGG CAAGTTCGTC AGCCTTGACC ACCAGGTACG CCTCGGTGGC
GAAGCCTCGA CCACCGGCAT CAGCCTCACC GAGCGCTATG CCGGCGGCGG GAACGACACC
GTCGTGCCGA CCGCCCCGGG TCGTCCCGTG GCCAGTGACC TGACTTCCTC GGCAGTCACC
CTGACCTGGG AAGCGTCCAC TGACAACGTC GGAGTTGTCG CGTACGAGGT GCGGAACGCG
ACAGACGGTG GCCCGCCGAA CACGGTCGCC ACCGTCGCCG GTACCACCTA CCAGGTGACC
AACCTCGCGG CCGAGACCGA ATACACCTTC ACGGTACGGG CCCGGGACGC GGCCGGAAAC
TTCTCCGCCG CCTCCCCGGC CCGTACCGTC ACCACCCCAC CCGGTGGTGG TGGAGGCAGC
GGCTGTAGCG CCACGTACCT GCTGATGAAC ACGTGGTCGG GGGGCTTCCA GGGTGAGATC
ACCGTGGAAA ACACCGGTCC CGCAGCCATC GCCGGCTGGC GGGTCAGCTG GAACGACCCG
GGCGGGACTG CGATCACCTC GCTGTGGAAT GGCAGGTGGA CGGTCACCGA TGGCGCGAAT
GTGGTGATCA ACGAGTCGTA CAACGGTCAG CTGGCGGCCG GGAGTAGCAC CACGTTCGGC
TTCACCGGGA CCGGTCCGGG CACCGCGCCG GGCGGGCTGA CCTGCTCCGC CCCGTGA
 
Protein sequence
MSVKPRLHAA AVAACVALGL TAAAPAAFAT TSPEPEAQVV SAVPAADWLH TDGNKIVDEA 
GNQVWLTGAN WFGYNATERV FHGLWAGNIE TITRQMAERG INIVRVPIST ELLLEWKAGQ
TVLPNVNLSV NPELAGMDNL QIFDYWLALC EEYGLKVMLD VHSAEADNSG HFYPMWHKGA
ITPELFYQGW EWVAARYQNN DTIVAMDIQN EPHGTPNNPP RAKWDGTSDI DNFKHACETA
GNRILAINPN VLILCEGVEV YPRPGESWDS PNTDPDQSPN YHYNWWGGNL RGVKDHPINL
GAHQDQLVYS PHDYGPLVHE QPWFQKDFDK TTLTNDVWRP NWLYLHEEDT APLLVGEWGG
RFGQDDRQDR WLKALRDLMA EMVIHHTFWC LNPNSGDTGG LLQHDWQTWD EVKYDQVLKP
ALWQHNGKFV SLDHQVRLGG EASTTGISLT ERYAGGGNDT VVPTAPGRPV ASDLTSSAVT
LTWEASTDNV GVVAYEVRNA TDGGPPNTVA TVAGTTYQVT NLAAETEYTF TVRARDAAGN
FSAASPARTV TTPPGGGGGS GCSATYLLMN TWSGGFQGEI TVENTGPAAI AGWRVSWNDP
GGTAITSLWN GRWTVTDGAN VVINESYNGQ LAAGSSTTFG FTGTGPGTAP GGLTCSAP