Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3949 |
Symbol | |
ID | 5708220 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4492354 |
End bp | 4493505 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641273374 |
Product | hypothetical protein |
Protein accession | YP_001538730 |
Protein GI | 159039477 |
COG category | [R] General function prediction only |
COG ID | [COG3211] Predicted phosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.381536 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCGTC GTACTGTGTT GCGGGCCGGC GTCGCCGGTG GTGCCGCGGC CTTTTCCGGC AGCCTGTGGG CTGGTGCCGC GCTGGCCGAC CCTGCCCAGC CCGGCGACAG CCCCTACGGA TCCCTGCGGC CGGCCGACGC GCTGGGCATT CAGCTACCCG CAGGATTCAC CAGCCGGATC GTCGCCCGTT CCCGGCAGGC CGTCGCCGGC ACCGCGTACA ACTGGCACGA TGCCCCCGAC GGTGGGGCCT GCTTCGCCGC TGGTGCCGGA TGGGTCTACG TATCCAACTC GGAGATCAGT CCGGGTGGCG GAGCGTCGGC GATACGCTTC GCCGCCGACG GCAGCATCGC CTCGGCATAC CCGATCCTGT CGGGTACCCG GCGGAACTGC GCGGGTGGGG CGACGCCCTG GGGGACGTGG CTCTCCTGCG AGGAGGTCAG CCGCGGCTAC GTGTACGAGA CCTACCCACT GGGCGGTGCG AACGCGGTGC GGCGGGCCGC GATGGGTCGG TTCAACCACG AGGCCGCCGC CTGCGACCCC GTCCGTCGGG TGATCTACCT CACCGAGGAC GAGAGCACGG GCTGCTTCTA CCGATTTCGC CCGACCACCT GGGGTGACCT CTCGGACGGG ATCCTGGAGG TGCTGGTGGC TGGCACCGCC ACCAACGGCC CGGTGACCTG GGCGCAGGTG CCCGACCCGG ACGGCTGGCC GACGCACACC CGTCACCAGG TCTCCGGGGC GAAGCGGTTC GACGGTGGCG AGGGCTGCTG GTACGCCGAC GGGACCTGCT GGTTCACCAC GAAGGGGGAC AACCGGGTCT GGGCGTACGA CGCGGTCCAC CAGCGGGTCG ATCTGGCGTA CGACGACTCA CTGGTCTCCG GCACCGCTCC GTTGACCGGG GTGGACAACA TCACCGGGAC CCCGGGCGGT GACCTCTACA TCGCCGAGGA CGGCGGTAAC ATGGAGATCA ACATCATTAC GCCGGACGAG GTCGTCGCCC CGTTCCTGCG GATCATCGGC CAGTCCTCGT CGGAGATCTG CGGCCCGGCG TTCTCGCCCG ACGGCACACG GTTCTACTTC TCGTCGCAGC GCGGCACCTC CGGATCGTCC TCCGGCGGAA TCACCTACGA GGTGCGCGGC CCGTTCCGCT GA
|
Protein sequence | MDRRTVLRAG VAGGAAAFSG SLWAGAALAD PAQPGDSPYG SLRPADALGI QLPAGFTSRI VARSRQAVAG TAYNWHDAPD GGACFAAGAG WVYVSNSEIS PGGGASAIRF AADGSIASAY PILSGTRRNC AGGATPWGTW LSCEEVSRGY VYETYPLGGA NAVRRAAMGR FNHEAAACDP VRRVIYLTED ESTGCFYRFR PTTWGDLSDG ILEVLVAGTA TNGPVTWAQV PDPDGWPTHT RHQVSGAKRF DGGEGCWYAD GTCWFTTKGD NRVWAYDAVH QRVDLAYDDS LVSGTAPLTG VDNITGTPGG DLYIAEDGGN MEINIITPDE VVAPFLRIIG QSSSEICGPA FSPDGTRFYF SSQRGTSGSS SGGITYEVRG PFR
|
| |