Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4760 |
Symbol | |
ID | 5707477 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 5388673 |
End bp | 5389821 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641274158 |
Product | hypothetical protein |
Protein accession | YP_001539504 |
Protein GI | 159040251 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.124071 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00173955 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGTGCCT ATCCATTCAC GTTTGTGACC CGTGTGCGTC GTCGGCTCCT GCCGGCGGTC GGCGCACTGC TCGCGGTTCT GGTCGTCGTC GGCTGCGGCG AGGCGGGGTC CGACGCGGTC TGGCAGCCCG GCGTCGGCGG TGACCCAGCC TCCTCCGCCT CGCCCTCCCC CTCCACGGCT GCGGCGGTAT CCCTCACCGC GACCGGCGAC ATCATCCTGG GTAACGCCCC GGACAGGTTG CCACCCGACG ACGGCAAGGG CTTCTTCGAC GACGTGCGCC AGGCTCTCGC CGCCGACCTG GCAATGGGCA ACTTGGAGGA GCCGCTCACC GTCGACACCG GTACCGGCAA GTGCGGGGCC GGCTCGACCA ACTGCTTCCA GTTCCGGGCG CCACCGGAGT ACGCGGCACA CCTCCGCGAC GGCGGCTTCG ACCTGCTCAA CCTGGCAAAC AATCATGGGA ACGACTTTGG GGCCAAAGGC TTCCAGAACA CCCAGGCCGC GCTTGAGCAG CACGAACTGG CACACACCGG CGCGCCGGAC CAGATCACCG TCGTGGAGGT GCAGGGCGTC CAAGTGGCGG TGGTGGGCTT CTCGTCCTAC GCGTGGTCGA ACCCGCTGAC CGACATCCCA GCGGCGACGA AGGTCGTCAC CAAGGCGGCC GAGACGGCGG ACCTGGTGGT CGTGCAGGTG CACATGGGCG CGGAGGGTGC CGACAAGACC CGGGTCAAAC CCGGCACCGA GTTGTACCTG GGTGAGAACC GGGGTGATCC GATCCGGTTC GCTAAGGCCA TGGTCGACGC CGGCGCGGAC CTGATCGTCG GGCACGGGCC ACACGTCCTC CGCGGCATGG AGTTCTACCA GGGCCGGCTG ATCGCGTACA GCCTGGGCAA CTTCGCCGGT GGCGGCAACA TGCTCAACCG CAGCGGCCGG CTCGGCTGGG GCGGCGTACT CAAGGTCTCG CTGAAGCCGG ACGGCACCTG GGTCGACGGG TCGTTCGCCT CGACGTACAT GAACGAGTTG GGTCTGCCGA CGATGGACCC GGACGACCGG GGCCTGGGGC TGGTGCGTGA GCTCAGCGGT GCGGATTTCC CCAAGACCGG TGCGACCTTC GACGACTCCG GGACGATCAG CCCACCCCGC GCGGGCTGA
|
Protein sequence | MRAYPFTFVT RVRRRLLPAV GALLAVLVVV GCGEAGSDAV WQPGVGGDPA SSASPSPSTA AAVSLTATGD IILGNAPDRL PPDDGKGFFD DVRQALAADL AMGNLEEPLT VDTGTGKCGA GSTNCFQFRA PPEYAAHLRD GGFDLLNLAN NHGNDFGAKG FQNTQAALEQ HELAHTGAPD QITVVEVQGV QVAVVGFSSY AWSNPLTDIP AATKVVTKAA ETADLVVVQV HMGAEGADKT RVKPGTELYL GENRGDPIRF AKAMVDAGAD LIVGHGPHVL RGMEFYQGRL IAYSLGNFAG GGNMLNRSGR LGWGGVLKVS LKPDGTWVDG SFASTYMNEL GLPTMDPDDR GLGLVRELSG ADFPKTGATF DDSGTISPPR AG
|
| |