Gene Strop_2285 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_2285 
Symbol 
ID5058748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp2578492 
End bp2579880 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content66% 
IMG OID640474547 
Productglycoside hydrolase family protein 
Protein accessionYP_001159113 
Protein GI145594816 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTGG GCATGAGGAC TGTGGCCCTG GCGGGCGTCC TCGCGGTGGT GGCCGCGGGG 
GCCCTGGTGG CCAGCGCCAC CGCGACCGCG GCGGCCGCAC CGGCCGCGAC GTTCGTCAAG
GTTTCCGACT GGGGTAGTGG CTGGGAAGGC CGATACACCA TCACCAACGG GGGAAGCAGC
ACCCTGAACT CCTGGCAGGT CGAGTTCGAC CTACCGGCGG GCACGAGCGT CGGCTCGTAC
TGGAACGCGT TGATGGACCG CGACGGACAG CGCTACCGCT TCACCAACCA GCACTGGAAC
GGCACGATCG CACCGGGCTC CTCGGTGACG TTCGGCTTCC TCGGCGCCGG CCCGGGCAGC
CCGAGCGGTT GCCAGCTCGA CGGCCAGCCG TGCGAGCCGA CAGCTCCTCC GACGACCAGC
CCACCCCCAC CACCACCGAC CACCGCGCCT CCCGGTAGCA CACCGGTTGC GGCGAACGGG
CAACTGCGGG TCTGTGGTGA GCAGCTGTGT AACGAGAACG GCAAGCCGAT CCAACTGCGC
GGCATGAGCA CGCACGGTCT TCAGTGGTAC GCCAACTGCG CGACCAGCGC CTCGCTCGAC
GTCCTCGCCC AGCAGTGGGG CGCCGACGTC CTGCGGATCT CGATGTACAT CCAGGAAGGC
GGCTACGAGA CCGACCCGCG TGGCTTCACC GATCTTGTCC ATGATTACAT CGAGATGGCC
ACCGCTCGCG GTCTCTACGC GATCGTTGAC TGGCACATGC TCACGCCGGG AGATCCGAAC
GACAACCTGT CGCGCGCGCA GACCTTCTTC GCCGAGATCG CCGACCGCCA CCGGGACAAG
GTGAACGTGC TCTACGAGAT CGCGAACGAG CCGAACGGTG TCAACTGGAG CGCGGTCAAG
AGCTACGCCG ACCAGGTCAT CCCGGTCATC CGGGAACGGG ACCCGGAGTC CGTGGTGCTC
GTCGGCACCC CCGACTGGTC GTCGCTCGGT GTCTCCGGAA GTGGCGGCGG CGTTGACGCC
ATCACCGCCA ACCCGGTGTC GGCAAGCAAC CTCATGTACG TCTTCCACTT CTACGCGGCC
TCGCACGGCG ACCTGTACTA CAACACCCTG GCGGCGGCGG CTGACCGAAT CCCGATCTTC
GTGACCGAGT TCGGCACCCA GCAGTACACC GGTGACGGTC CGAACGACTT CACCATGTCC
CAGCGTTACC TCGACCTGAT GGCGAGCAAG AAGATCAGTT GGGTCAACTG GAACTACTCC
GACGATTTCC GCTCCGGCGC GGTCTTCACC ACAGGAACGT GCGCCAGCGG CGACTTCGGC
GGTACGGGCC CGCTCAAGCC GGCCGGTAGC TGGATTCGCG ACCGCATGCG TACCGCTGAC
GACTTCTGA
 
Protein sequence
MKLGMRTVAL AGVLAVVAAG ALVASATATA AAAPAATFVK VSDWGSGWEG RYTITNGGSS 
TLNSWQVEFD LPAGTSVGSY WNALMDRDGQ RYRFTNQHWN GTIAPGSSVT FGFLGAGPGS
PSGCQLDGQP CEPTAPPTTS PPPPPPTTAP PGSTPVAANG QLRVCGEQLC NENGKPIQLR
GMSTHGLQWY ANCATSASLD VLAQQWGADV LRISMYIQEG GYETDPRGFT DLVHDYIEMA
TARGLYAIVD WHMLTPGDPN DNLSRAQTFF AEIADRHRDK VNVLYEIANE PNGVNWSAVK
SYADQVIPVI RERDPESVVL VGTPDWSSLG VSGSGGGVDA ITANPVSASN LMYVFHFYAA
SHGDLYYNTL AAAADRIPIF VTEFGTQQYT GDGPNDFTMS QRYLDLMASK KISWVNWNYS
DDFRSGAVFT TGTCASGDFG GTGPLKPAGS WIRDRMRTAD DF