Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1078 |
Symbol | |
ID | 5704346 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 1209810 |
End bp | 1211087 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641270593 |
Product | hypothetical protein |
Protein accession | YP_001535977 |
Protein GI | 159036724 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3934] Endo-beta-mannanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000321073 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | TTGATCATCA ACGACCGTAC GCCGAACACC CGAAGGCTCC GCCTCGGCGC CAACTACGTA CCGTCCGACG GATGGTTCTA CAGTTGGCTC GACTTCTCGG CTGACGCGGC ACGCCGCGAC TTCGAGGACC TGGCGAGCCT CGGCCTGGAC CACGTACGGG TCTTCCCGAT CTGGCCGTGG ATCCAGCCCA ACCGCGCGCT ACTGCGGCAG CGACCGATCG ACGACCTGCT CTCGTTGATC GACGTAGCGG CGGAATTCGA CCTCGGCGTT GCGGTCGACC TGCTCCAGGG GCACCTGTCC AGCTTCGACT TTCTGCCCTC CTGGGTGCTC ACCTGGCATC AGAGCAGCGT GTTCACCGAC CGGACGGCCC GTGACGGCAT CCGGGAGTAC GTCCGGCGGC TCAGCACCGA GGTCGGGACG CGCCCCAACG TCTTCGCCAT CACCCTCGGC AACGAGGTCA ACAACCTCTG GCCCACCAAC GCAACGACGC CCGCCGCCTC CCGCGAGTGG GCGACGGAAC TACTCGACGT GGTCCGCGCC GCGGCGCCCT CCGCGCTCCC CCTGCACTCC GTCTTCGACG ATGCGTGGTA CGCCCCCGAC CACCCCTTCC ATCCCGCCGA CGCGGTCGAC CTCGGCGATC TGACCACCGT GCACTCCTGG GTGTTCAACG GCACCTCACG CATCGACGGG CCACTCGGGC CGGCCACCAC CTCACACGCC GACTACCTGC TGGAACTCGC CGCTGCGAGT GCCTCCGACG CGGCCCGTCC GGTGTGGCTG CAGGAGGTCG GGGTGCCTCG GCCGGACGTC CCGCCGGAGC ATGCCGGCGA GTTCGTCGCA CGCACGCTGG ACACGGTGGG CCGCAACCCG GCATTGTGGG GCGTCACCTG GTGGTCCTCA CACGACATCG ATCGCCGCCT GACCGACTTC CCGGACCGGG AGTACGACCT GGGCCTGTTC ACCGTCGACC ACCGACCGAA GCCGGCCGCG CTCGCCCTGG CCGAGTTCGC CCGCGGCACG GCGGTCGAGC CGGCGGCCCC GGCTCGGCCG GCGTTGGTGT CTCCGATCAA CCCGCTGCAG GAGCCCGAAC GGCGGGCGGA CGTCGCTCCG GGTAGCGAGT TCCACCTGGA GTGGGTGCGG GCCCGTCAGA CCGAACCCAC CGCGATCGTC ACCCCGGACC GGGCTACCGA CGCCGGCTAC CTCGCCGCCC GCGGCCTCGG CCCGGTCCGC GCACCGGGTG CCAACCGGCC GATGGCGGTA CAGCACCAGG GCTCCTGA
|
Protein sequence | MIINDRTPNT RRLRLGANYV PSDGWFYSWL DFSADAARRD FEDLASLGLD HVRVFPIWPW IQPNRALLRQ RPIDDLLSLI DVAAEFDLGV AVDLLQGHLS SFDFLPSWVL TWHQSSVFTD RTARDGIREY VRRLSTEVGT RPNVFAITLG NEVNNLWPTN ATTPAASREW ATELLDVVRA AAPSALPLHS VFDDAWYAPD HPFHPADAVD LGDLTTVHSW VFNGTSRIDG PLGPATTSHA DYLLELAAAS ASDAARPVWL QEVGVPRPDV PPEHAGEFVA RTLDTVGRNP ALWGVTWWSS HDIDRRLTDF PDREYDLGLF TVDHRPKPAA LALAEFARGT AVEPAAPARP ALVSPINPLQ EPERRADVAP GSEFHLEWVR ARQTEPTAIV TPDRATDAGY LAARGLGPVR APGANRPMAV QHQGS
|
| |