Gene Sare_4899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4899 
Symbol 
ID5707415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5565476 
End bp5566651 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content66% 
IMG OID641274294 
Productglycoside hydrolase family protein 
Protein accessionYP_001539639 
Protein GI159040386 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000537243 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGACGCC TGGCCCAGGC GCTGACCGTA CTCGCGGTCC TCGCCACCGC CACCCTGGTC 
GCCGCAGCAC CGGCACAGGC CGTGACGATC TGCGAGCAGT ACGGCTCCAC CACCGTCCAG
AACACGTACA TCGTCCAGAA CAACCGCTGG GGCAGCGCCG CCCAGCAGTG CATCGACACG
ACCAACAGCG GCTTCCGGAT CACCTCGCAG CAGGGATCCA CCTCGCCCTC CGGGCCGCCG
CTGTCCTACC CGTCGATGGT GCTCGGATGT CACTACCTGA ACTGCTCACC CGGGACCAAC
CTGCCGAAGA AAGTCGGCCA GATCAGCAGC GTCCCATCCT CGATCAGCTA TTCGTACGCC
GGCGGAACCT ACAACGCCGC GTACGACATC TGGCTGGACC CTGCTCCGAA GACCGACGGA
GTGAACCGGA TGGAGATCAT GATCTGGTTC CACCGGCAGG GGCCGATCCA GCCGATCGGC
AGTCCGGTCG GCAACACCTC GGTGGGCGGC CGTAGCTGGC AGGTCTGGCA GGGCAACAAC
GGTGGCAACG ACGTGGTCTC CTACCTGGCA CCCGGGGCCA TCGGAAGCTG GTCGTTCGAC
GTCAAGGACT TCATCAACGA CGTCGTAGCG CGCACCCAGG TCACCAACGA CTGGTACCTG
ACCAGCCTCC AGGCGGGTTT CGAACCGTGG AGCGGCGGTG TCGGGCTGAG CGTCGACAGT
TTCTCCGCCA CGGTGACCGT CGGGACGAAC CCACCGCCCC CACCGGGCAC CAGCGGCACG
ATCGTCGGTC AGGGCAGCGG CCGCTGTCTG GACCTTTTGG ACCTCGGTAC CGCCGACGGT
ACCCCGATCC AGCTGTGGGA CTGCACCGCC AACTGGAACC AGCTCTGGAC CCGCACCGGC
AACACCTTCG TCAACCCACA GACCAGCAAG TGCCTCGATG TCGCCGGCGG TTCCACCGCC
AACGGTGCCC AGGTGCAGCT GTATACCTGC AACGGCACCG GGGCCCAGAA CTGGCAGGTC
AACGGCGATG GCACCATCAC CAACCCGCAG TCGGGCAAGT GCCTCGACGC GATGGAGAGG
GGAACCGCCA ACGGCACCCG GATCCAGATC TGGGACTGCT ACGGCGGCGG CACCCAGGCC
AACCAGGTCT GGACGGTCAA CGGCCGCACC CGTTGA
 
Protein sequence
MRRLAQALTV LAVLATATLV AAAPAQAVTI CEQYGSTTVQ NTYIVQNNRW GSAAQQCIDT 
TNSGFRITSQ QGSTSPSGPP LSYPSMVLGC HYLNCSPGTN LPKKVGQISS VPSSISYSYA
GGTYNAAYDI WLDPAPKTDG VNRMEIMIWF HRQGPIQPIG SPVGNTSVGG RSWQVWQGNN
GGNDVVSYLA PGAIGSWSFD VKDFINDVVA RTQVTNDWYL TSLQAGFEPW SGGVGLSVDS
FSATVTVGTN PPPPPGTSGT IVGQGSGRCL DLLDLGTADG TPIQLWDCTA NWNQLWTRTG
NTFVNPQTSK CLDVAGGSTA NGAQVQLYTC NGTGAQNWQV NGDGTITNPQ SGKCLDAMER
GTANGTRIQI WDCYGGGTQA NQVWTVNGRT R