Gene Strop_3602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3602 
Symbol 
ID5060077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4118974 
End bp4120836 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content65% 
IMG OID640475857 
Productglycoside hydrolase family protein 
Protein accessionYP_001160411 
Protein GI145596114 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.438732 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCAT CGACCGGGTC ACTCGGCCGG GGCCGTCCGC GGATCGGACT CTCGCCCAAG 
GAGATCTGTG GCATGAACGT GTGGAGAAGA CTCTCCGTCC CGCGCCCGGC CCTCGCGCTG
ACCGGCGCGG GCGTCCTGGT CGTAGGCGGG TTGGTGACCC TGCCGGTCAC CATGGCCCAC
GCTGCGACCC AGTGTGAGGT GTCGTACACC ACCAATGACT GGCCCGGCGG TTTCACCGCT
TCCCTCAGTA TCAAGAACAC CGGGGAGGCG CTGGATGGCT GGACGCTTCG CTTCACCTTC
CCGAACAGTA GCCAGCAGGT GGTGCACGGC TGGTCGGCTC GGTACAGCCA GTCCGGGCAG
AACGTTGCCG TGCAGAACGA GTCGTACAAC GGTTTGGTGC CCAGTGGCGC CACCATCGAG
ATCGGCTTCA ACGGTTTGTG GAGTGGCAGC AACCCCAAAC CGACGTCGTT CACGCTCAAC
GAGGTGGCCT GCAACGGCGG TGGTGGCCCC ACTACGGCAC CGCCGACAAC TACGCCGCCC
ACCACGGCAC CGCCGACGAC CCCACCTCCC ACCACTCCAC CTCCCACCAC CGCGCCTCCC
GGAGGCCGAG TGGACAACCC GTACCTGAAC GCGGTGGGCT ACGTGAATCC GGAGTGGAAG
GCCAAGGCCG AGTCAGTGTC CGGTGGCAAC CGGGTGTCGA ACACGTCGAC GGCCGTCTGG
ATCGACCGGA TTGCGGCCAT TGAGGGTACC GATGACAGCC AGTCCAATGG CCCGATGGGC
ATACGAGATC ACCTGGATGA GGCGCTGAGT CAGGGGGCGG ACTACATCCA GTTCGTGATC
TACAACCTAC CCGGTCGGGA CTGCGCTGCG CTCGCCTCGA ACGGTGAACT GGCACCGGAC
GAGTTGCCCC GCTACAAGGC CGAGTTCATC GACCCGATAG CGGCGATCCA GAGTGACGCG
GCATACCAGG ATCTCCGGAT CATCAACATC ATCGAGATCG ACTCGTTGCC GAACCTGCAC
GCCAACACCG GCAGCAATCC GGGCGCTACC CCGACCTGTG AGCTTGTCAA GCAGAACGGC
GCCTATGTCA ACGGCATCGG CTACGCGCTG GCCACGTTGG GCGCGATCAG CAACGTCTAC
AACTACGTGG ACGCTGCCCA CCACGGCTGG ATCGGCTGGG ACACCAACTT CAGCCCGGTC
GCCCTGCTCC TGAAGGATGC TGCCACAGCG TCCGGCAGCA CGCTCGACGA CGTGCATGGC
TTCATCGTCA ACACCGCCAA CTACTCGGCA TTGCGCGAGC CCTACTTCCA GATCACCGAC
ACGGTCAACG GCCAGACGAT CCGTCAGTCC ACGTGGGTGG ACTGGAACCA GTACGTCGAC
GAGTTGTCGT TTGCGCAGGA CTTCCGCGAC GAACTAGTTG CCAAGGGATT CGACTCGGGT
GTGGGAATGT TGATCGATAC TTCCCGCAAC GGTTGGGGTG GCAGTGCTCG ACCAACCGCC
CCTGGGCCGA CGACCGATGT GGACAGCTAT GTCGACGGTG GTCGGGTCGA CCGACGAATC
CACGCCGGGA ACTGGTGCAA CCAGTCTGGT GCGGGCCTGG GTGAGCGGCC GCAGGCCGCG
CCGGAGCCCG GTATCGACGC CTATGTCTGG GTAAAGCCGC CGGGCGAGTC GGACGGCTCC
AGCGAGGAGA TCCCGAACAA CGACGGCAAG GGCTTTGACC GGATGTGCGA CCCGACGTAC
GAGGGCAACG CCCGTAACGG CTTCAACCCC AGCGGTGCCC TGCCTGACGC GCCGATCTCC
GGTGCCTGGT TCCCGGCGCA GTTCCAGCAG CTCATGCAGA ACGCCTACCC GCCGCTGTCC
TGA
 
Protein sequence
MSASTGSLGR GRPRIGLSPK EICGMNVWRR LSVPRPALAL TGAGVLVVGG LVTLPVTMAH 
AATQCEVSYT TNDWPGGFTA SLSIKNTGEA LDGWTLRFTF PNSSQQVVHG WSARYSQSGQ
NVAVQNESYN GLVPSGATIE IGFNGLWSGS NPKPTSFTLN EVACNGGGGP TTAPPTTTPP
TTAPPTTPPP TTPPPTTAPP GGRVDNPYLN AVGYVNPEWK AKAESVSGGN RVSNTSTAVW
IDRIAAIEGT DDSQSNGPMG IRDHLDEALS QGADYIQFVI YNLPGRDCAA LASNGELAPD
ELPRYKAEFI DPIAAIQSDA AYQDLRIINI IEIDSLPNLH ANTGSNPGAT PTCELVKQNG
AYVNGIGYAL ATLGAISNVY NYVDAAHHGW IGWDTNFSPV ALLLKDAATA SGSTLDDVHG
FIVNTANYSA LREPYFQITD TVNGQTIRQS TWVDWNQYVD ELSFAQDFRD ELVAKGFDSG
VGMLIDTSRN GWGGSARPTA PGPTTDVDSY VDGGRVDRRI HAGNWCNQSG AGLGERPQAA
PEPGIDAYVW VKPPGESDGS SEEIPNNDGK GFDRMCDPTY EGNARNGFNP SGALPDAPIS
GAWFPAQFQQ LMQNAYPPLS