Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1083 |
Symbol | |
ID | 5704074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1215649 |
End bp | 1218072 |
Gene Length | 2424 bp |
Protein Length | 807 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641270598 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001535982 |
Protein GI | 159036729 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4724] Endo-beta-N-acetylglucosaminidase D |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00148049 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGACGCC TGCTCCTGTC CTTGCTGGCC GGCGCCACAG TCGTGGCCGG CAGCACCCTC ACCGCGGCTC CCACCGTGGC AGCCGCCACC GAAGCGGTCG ACGGCAGCCA GCCGTACGCC TCGTACTGGT TCCCGAACGA ACTCCTCGAC TGGGATCCGG AGACGGATCC CGACGCCCGC TTCAACCGGT CCATGGTTCC GCTCCAGCCC CGGGCCACCG ATCCCGCGCT GAAGGCCAAC CCGAACGCGC GAGCGGGCGA GGGCCGAGTG GCGTCGCTGG TGTCGTTCGC ACCGACGTCC GACAACCCGT CGCAGGGCTC GCGCGACGAG GACTACTACG CCTTTGGCCA CTGGCAGTAC ATCGACACCC TGGTGTTCTG GGGCGGATCG GCCGTCGAGG GCCTGATTCT CGCGCCGAAC CCGACCGTCA TCGACGCCGC GCACCGCAAC GGCGTCAAGG TGTACGGCAC GGTCTTCTTC CCGCCGGTCG CCTACGGCGG CAAGATCGAC TGGGTGCACG ACTTCGTGCG TAAGTCGGGC TCGACCTATC CGGTCGCCGA CAAGCTCGCG GAGGTGGCGC AGTACTACGG CTTCGAAGGG TGGTTCATCA ACCAGGAGAC GACGGGCGGC AACACCGCCC TCGCCACCGA ACTCCGCAAC CTGATGACCT ACGGCCGGAA CAAGGGCGTG GAGTTCATGT GGTACGACGC GATGACCGAA TCCGGCGCCA TTACCTGGCA GAACGCGCTC ACCACCGCCA ACGACTCGTT CCTGGGCGGC CCGACGCCGG TGTCGGACTC GATGTTCCTC AACTTCTGGT GGTCCACCGG CGGCCTGGCC TCGTCCCGGG ATCGCGCCGA GTCACTCGGG CGGAGCGGGT ACGACCTGTA CTCCGGCATC GACACCGAGG CCAACGGCTA CCAGACCAAC GTCAACTGGG ACGCCCTGTT CCCCGCCGGT GGGTCGCACG TGACCTCCCT GGGCATCTAC CGCCCGGAGT GGACCTGGAC GTCGTCGAGC GGGCCGGCCG ACTTCCGGGC ACGCGACTCC CGCTACTGGG TCGGCGCGAA CGGCGACCCG TCGAACACCA CGACCTCCTC GCCTTGGAAA GGGCTCGCCA CCTACGTCGC CGAGTCCACG CCGGTGACCC AGAAGCCGTT TGTGACCAGC TTCAACGCCG GGCAGGGTTC GACATACCAC GTCGCCGGGA ACCAGGTGCG CACCGGCGGC TGGAACAACC TGTCGATGCA GGACGTGCCG CCGACCTACC AGTGGGTGGT CTCCTCGACC GGCACGAAGC TGACGCCGTC GCTCGACTTC ACCGATGCCT ACGAGGGCGG CTCGACGCTG CGGCTCAACG GCAGGCTGGA CGCGACGAAC ACCGTGCGCC TCTACCAGAC CGACCTGCCG GTCGCCGCGG ACACCAAGCT GTCGACGGTC GTCAAGACCC CGGCCGCCGG TGCGACCCAC CTGAGCGTGG CGGTGGCCTT CACCGACGCC CCGAACACCT TCACCACTCT CGACCTCGGG TCGACCTCCG GCACCGGTTG GGAGCGTCGC GTTCTCGACC TGTCCGCGTA CGCCGGTAAG ACCATCGCCC AGATCGGGCT GCGGGCGTCG GCGTCGGCCG TCGTCCCGTC CTACGACATC AAGGTTGGCC AGCTCGCCGT GTACGACGGG GCCGTGGACA CCGCTGCCGC GCCGACCGGT CTGACCGTCC TGGGCAGCAC CGACGTCTCG GCGACCCGCA AGACACTGAG GTTGGACTGG ACCCCGTCGG CCAGCGGATC GGTGCACCAC TACGACGTGT TCCGCCGCAA CCCGGACGGC AGCCGTACCC ACCTGGGCGC CACGCCGAAC GACGTGTACT TCGTGCCGCA GCTCGACCGG GTCGGCGCCG AGACCAGCAC GGTCATCGAG GTCGAGGCGG TGTCGACCGA GTACGGCCGC TCCACCGCGG CGACCACGAC CGTCACCTGG TCCGGCACGC CGCCGACCAC GACCAACCTG GCGCTCGACC GGCCGGCGAC GGCCTCCGGG CAGTGCACCG CTACCGAAGG ACCCGCCAAA GCCGTCAACG GCAGTGTCTC CGGCGGGAAC AGCGACAAGT GGTGCACGAC GACCGCCAAC CAGTGGCTCG AGGTTGACCT GGGCTCGGTC CGTGCCCTCG ACCGGTTCGT CGTCGCGCAC GCCGCCGCGG GCGGCGAGTC CGCCTCGTGG AACACCCGCG ACTTCACCAT CGACGTACGC TCCGCGGCCT CGGACCCGTG GACCACGGCC GTCACCGTCA CCGACAACAC CGCCGAGTTG ACAACACACC CAGTGAGCGT CAGCGCACGG TACGTGCGGT TGGTTGTCGA CACCCCGACC CAGGACGGCG ACCCCGCCAC CCGCATATAC GAGTTCGAGG CCTGGGGCGA GTAG
|
Protein sequence | MRRLLLSLLA GATVVAGSTL TAAPTVAAAT EAVDGSQPYA SYWFPNELLD WDPETDPDAR FNRSMVPLQP RATDPALKAN PNARAGEGRV ASLVSFAPTS DNPSQGSRDE DYYAFGHWQY IDTLVFWGGS AVEGLILAPN PTVIDAAHRN GVKVYGTVFF PPVAYGGKID WVHDFVRKSG STYPVADKLA EVAQYYGFEG WFINQETTGG NTALATELRN LMTYGRNKGV EFMWYDAMTE SGAITWQNAL TTANDSFLGG PTPVSDSMFL NFWWSTGGLA SSRDRAESLG RSGYDLYSGI DTEANGYQTN VNWDALFPAG GSHVTSLGIY RPEWTWTSSS GPADFRARDS RYWVGANGDP SNTTTSSPWK GLATYVAEST PVTQKPFVTS FNAGQGSTYH VAGNQVRTGG WNNLSMQDVP PTYQWVVSST GTKLTPSLDF TDAYEGGSTL RLNGRLDATN TVRLYQTDLP VAADTKLSTV VKTPAAGATH LSVAVAFTDA PNTFTTLDLG STSGTGWERR VLDLSAYAGK TIAQIGLRAS ASAVVPSYDI KVGQLAVYDG AVDTAAAPTG LTVLGSTDVS ATRKTLRLDW TPSASGSVHH YDVFRRNPDG SRTHLGATPN DVYFVPQLDR VGAETSTVIE VEAVSTEYGR STAATTTVTW SGTPPTTTNL ALDRPATASG QCTATEGPAK AVNGSVSGGN SDKWCTTTAN QWLEVDLGSV RALDRFVVAH AAAGGESASW NTRDFTIDVR SAASDPWTTA VTVTDNTAEL TTHPVSVSAR YVRLVVDTPT QDGDPATRIY EFEAWGE
|
| |