Gene Sare_1083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1083 
Symbol 
ID5704074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1215649 
End bp1218072 
Gene Length2424 bp 
Protein Length807 aa 
Translation table11 
GC content69% 
IMG OID641270598 
Productglycoside hydrolase family protein 
Protein accessionYP_001535982 
Protein GI159036729 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4724] Endo-beta-N-acetylglucosaminidase D 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00148049 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGACGCC TGCTCCTGTC CTTGCTGGCC GGCGCCACAG TCGTGGCCGG CAGCACCCTC 
ACCGCGGCTC CCACCGTGGC AGCCGCCACC GAAGCGGTCG ACGGCAGCCA GCCGTACGCC
TCGTACTGGT TCCCGAACGA ACTCCTCGAC TGGGATCCGG AGACGGATCC CGACGCCCGC
TTCAACCGGT CCATGGTTCC GCTCCAGCCC CGGGCCACCG ATCCCGCGCT GAAGGCCAAC
CCGAACGCGC GAGCGGGCGA GGGCCGAGTG GCGTCGCTGG TGTCGTTCGC ACCGACGTCC
GACAACCCGT CGCAGGGCTC GCGCGACGAG GACTACTACG CCTTTGGCCA CTGGCAGTAC
ATCGACACCC TGGTGTTCTG GGGCGGATCG GCCGTCGAGG GCCTGATTCT CGCGCCGAAC
CCGACCGTCA TCGACGCCGC GCACCGCAAC GGCGTCAAGG TGTACGGCAC GGTCTTCTTC
CCGCCGGTCG CCTACGGCGG CAAGATCGAC TGGGTGCACG ACTTCGTGCG TAAGTCGGGC
TCGACCTATC CGGTCGCCGA CAAGCTCGCG GAGGTGGCGC AGTACTACGG CTTCGAAGGG
TGGTTCATCA ACCAGGAGAC GACGGGCGGC AACACCGCCC TCGCCACCGA ACTCCGCAAC
CTGATGACCT ACGGCCGGAA CAAGGGCGTG GAGTTCATGT GGTACGACGC GATGACCGAA
TCCGGCGCCA TTACCTGGCA GAACGCGCTC ACCACCGCCA ACGACTCGTT CCTGGGCGGC
CCGACGCCGG TGTCGGACTC GATGTTCCTC AACTTCTGGT GGTCCACCGG CGGCCTGGCC
TCGTCCCGGG ATCGCGCCGA GTCACTCGGG CGGAGCGGGT ACGACCTGTA CTCCGGCATC
GACACCGAGG CCAACGGCTA CCAGACCAAC GTCAACTGGG ACGCCCTGTT CCCCGCCGGT
GGGTCGCACG TGACCTCCCT GGGCATCTAC CGCCCGGAGT GGACCTGGAC GTCGTCGAGC
GGGCCGGCCG ACTTCCGGGC ACGCGACTCC CGCTACTGGG TCGGCGCGAA CGGCGACCCG
TCGAACACCA CGACCTCCTC GCCTTGGAAA GGGCTCGCCA CCTACGTCGC CGAGTCCACG
CCGGTGACCC AGAAGCCGTT TGTGACCAGC TTCAACGCCG GGCAGGGTTC GACATACCAC
GTCGCCGGGA ACCAGGTGCG CACCGGCGGC TGGAACAACC TGTCGATGCA GGACGTGCCG
CCGACCTACC AGTGGGTGGT CTCCTCGACC GGCACGAAGC TGACGCCGTC GCTCGACTTC
ACCGATGCCT ACGAGGGCGG CTCGACGCTG CGGCTCAACG GCAGGCTGGA CGCGACGAAC
ACCGTGCGCC TCTACCAGAC CGACCTGCCG GTCGCCGCGG ACACCAAGCT GTCGACGGTC
GTCAAGACCC CGGCCGCCGG TGCGACCCAC CTGAGCGTGG CGGTGGCCTT CACCGACGCC
CCGAACACCT TCACCACTCT CGACCTCGGG TCGACCTCCG GCACCGGTTG GGAGCGTCGC
GTTCTCGACC TGTCCGCGTA CGCCGGTAAG ACCATCGCCC AGATCGGGCT GCGGGCGTCG
GCGTCGGCCG TCGTCCCGTC CTACGACATC AAGGTTGGCC AGCTCGCCGT GTACGACGGG
GCCGTGGACA CCGCTGCCGC GCCGACCGGT CTGACCGTCC TGGGCAGCAC CGACGTCTCG
GCGACCCGCA AGACACTGAG GTTGGACTGG ACCCCGTCGG CCAGCGGATC GGTGCACCAC
TACGACGTGT TCCGCCGCAA CCCGGACGGC AGCCGTACCC ACCTGGGCGC CACGCCGAAC
GACGTGTACT TCGTGCCGCA GCTCGACCGG GTCGGCGCCG AGACCAGCAC GGTCATCGAG
GTCGAGGCGG TGTCGACCGA GTACGGCCGC TCCACCGCGG CGACCACGAC CGTCACCTGG
TCCGGCACGC CGCCGACCAC GACCAACCTG GCGCTCGACC GGCCGGCGAC GGCCTCCGGG
CAGTGCACCG CTACCGAAGG ACCCGCCAAA GCCGTCAACG GCAGTGTCTC CGGCGGGAAC
AGCGACAAGT GGTGCACGAC GACCGCCAAC CAGTGGCTCG AGGTTGACCT GGGCTCGGTC
CGTGCCCTCG ACCGGTTCGT CGTCGCGCAC GCCGCCGCGG GCGGCGAGTC CGCCTCGTGG
AACACCCGCG ACTTCACCAT CGACGTACGC TCCGCGGCCT CGGACCCGTG GACCACGGCC
GTCACCGTCA CCGACAACAC CGCCGAGTTG ACAACACACC CAGTGAGCGT CAGCGCACGG
TACGTGCGGT TGGTTGTCGA CACCCCGACC CAGGACGGCG ACCCCGCCAC CCGCATATAC
GAGTTCGAGG CCTGGGGCGA GTAG
 
Protein sequence
MRRLLLSLLA GATVVAGSTL TAAPTVAAAT EAVDGSQPYA SYWFPNELLD WDPETDPDAR 
FNRSMVPLQP RATDPALKAN PNARAGEGRV ASLVSFAPTS DNPSQGSRDE DYYAFGHWQY
IDTLVFWGGS AVEGLILAPN PTVIDAAHRN GVKVYGTVFF PPVAYGGKID WVHDFVRKSG
STYPVADKLA EVAQYYGFEG WFINQETTGG NTALATELRN LMTYGRNKGV EFMWYDAMTE
SGAITWQNAL TTANDSFLGG PTPVSDSMFL NFWWSTGGLA SSRDRAESLG RSGYDLYSGI
DTEANGYQTN VNWDALFPAG GSHVTSLGIY RPEWTWTSSS GPADFRARDS RYWVGANGDP
SNTTTSSPWK GLATYVAEST PVTQKPFVTS FNAGQGSTYH VAGNQVRTGG WNNLSMQDVP
PTYQWVVSST GTKLTPSLDF TDAYEGGSTL RLNGRLDATN TVRLYQTDLP VAADTKLSTV
VKTPAAGATH LSVAVAFTDA PNTFTTLDLG STSGTGWERR VLDLSAYAGK TIAQIGLRAS
ASAVVPSYDI KVGQLAVYDG AVDTAAAPTG LTVLGSTDVS ATRKTLRLDW TPSASGSVHH
YDVFRRNPDG SRTHLGATPN DVYFVPQLDR VGAETSTVIE VEAVSTEYGR STAATTTVTW
SGTPPTTTNL ALDRPATASG QCTATEGPAK AVNGSVSGGN SDKWCTTTAN QWLEVDLGSV
RALDRFVVAH AAAGGESASW NTRDFTIDVR SAASDPWTTA VTVTDNTAEL TTHPVSVSAR
YVRLVVDTPT QDGDPATRIY EFEAWGE