Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4749 |
Symbol | |
ID | 5705340 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 5373848 |
End bp | 5374918 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641274147 |
Product | hypothetical protein |
Protein accession | YP_001539493 |
Protein GI | 159040240 |
COG category | [S] Function unknown |
COG ID | [COG5282] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR03624] putative hydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.794948 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000155251 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCACAGT TCGTGGACTG GGATCTGGCC GCAGCGACCG CGGGGACGCT CGGCAAGACG GGCCCGCGCG TGTCGTACGC CGAAGCCGCG GCGGTGGTCA GCGACCTGAG ACGGTTGACC GACGAAGCGG CCGGGCACGT CGGCGACTTC ACCGGGCTCC GGTCGCAGGT GTCACACCCG CCGGTGCGGG TGGTGGATCG CCGGGACTGG GCGGCGACCA ACGTCGCCGG TCTGCGCGAG GTCATCGGTC CCCTGATCGG TCGCCTCACC GGCGACAAGC AACCCGGCGC GGTGACCGAG GCGGTCGGCT CGCGGATCAC CGGGGTGCAG GCCGGCACGG TGCTGGCGTA CCTGTCCGGC CGGGTCCTCG GCCAGTTCGA GGTGTTCTCC GGCGAACCAG GCCAGCTGCT GCTCGTCGCG CCGAACATCG TCGAGGTGGA GCGGAAGCTG GCGGCGGACC CCCGCGACTT CCGGCTCTGG GTCTGCTTGC ACGAGGTCAC CCACCGCACC CAGTTCACCG CGGTGCCGTG GCTGCGGGCG TACTTCCTCG GTGAGGTGCA GGCGTTCGTC GACGCGTCCA ACAGCGGCGC CGACCCCTTG GTGGAGCGGC TGCGTCGCGG CGTCGCCCTC CTTGCCGACG CGGTGCGGGA ACCGGAGAGT CGCACCAGCG TCCTGGACAT CGTCCAGACC CCGGCCCAGA AGGCGGTGCT GAACCGGCTC ACCGCGCTGA TGACCCTGCT CGAGGGGCAC GCCGAGTTCG TGATGGATGG CGTGGGGCCG CAGGTGATCC CGAGTGTGGA GCGGATCCGG GCGTCGTTCA ACCGCCGTCG GGAGTCGGGT AACCCCCTGG AGAAGACGGT CCGTCGGCTG CTCGGGGTGG AGGTCAAGCT GCGCCAGTAC GCCGAGGGGC GGACGTTCGT GCACGGTGTG GTCGACCGGG TCGGCATGGA GGGCTTCAAC CGGGTCTTTG CCTCCCCGCT GACCCTGCCC CGGCTCGAGG AACTCGGCGA TCCGGACGCC TGGGTGGCCC GGGTGCACGG GCCGGCCGGT CCGCTTCCGG CCGTCGGCTG A
|
Protein sequence | MAQFVDWDLA AATAGTLGKT GPRVSYAEAA AVVSDLRRLT DEAAGHVGDF TGLRSQVSHP PVRVVDRRDW AATNVAGLRE VIGPLIGRLT GDKQPGAVTE AVGSRITGVQ AGTVLAYLSG RVLGQFEVFS GEPGQLLLVA PNIVEVERKL AADPRDFRLW VCLHEVTHRT QFTAVPWLRA YFLGEVQAFV DASNSGADPL VERLRRGVAL LADAVREPES RTSVLDIVQT PAQKAVLNRL TALMTLLEGH AEFVMDGVGP QVIPSVERIR ASFNRRRESG NPLEKTVRRL LGVEVKLRQY AEGRTFVHGV VDRVGMEGFN RVFASPLTLP RLEELGDPDA WVARVHGPAG PLPAVG
|
| |