Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1032 |
Symbol | |
ID | 5708263 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1155498 |
End bp | 1156694 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641270548 |
Product | cell wall anchor domain-containing protein |
Protein accession | YP_001535932 |
Protein GI | 159036679 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0114324 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00525581 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGGTGCTG GCGCCGTCGG TTTTCTCGTT GCGGGTTCGT TCGCCGCGCC CGCTTACGCG GTTGATGGCA CGGACCTCTC CGTCGAGCTG TCGGGCACGA CTATCGCTGC CGGCTCGACC GAAAAGGTCG TCGAAATCGA CCTGGAAAAC GTCGGAAAGA CCACACCGGA GTCGGTCGAC ATTATCTTCA CCGTCGGCGA TCTCGTCGGT GCGTTCCAGT TCGCTGACAT CTGCGACGAT GATCCTGCAG ATGGCACCGT CCACTGCTCC GTCGATAAGG CCTTCATTCC GGAGCCGGGT GGCACAAGCG ATTTGTCCTA CTTAGTGACC CGGGCCGACG ACTCCGCCGA CGCCTCGGGC GAGCTGACTG TGACGGTCCT GGTCGACGGC GACGACAACA CCGCCAACAA CACTGACACC GCCGAGGTGG TGATCGGGGA GCACGGTGTC GACCTGGGTG TCGTAGCCGA GGACGTGCGC ACGCCCATCG ACGCCGACGC GAGCCTCGAG TCTGGGGCAC CTGTCTACGT CGACGGGTCG ATCACCCCGG GCGGTTCGAC CGCCGTCGTG GCCACCGTGT CCAATCAGGG CGATATGACG GCGGACGGGG TCAGGGTCTC TGTCACCCTA CCGGAGCAGG CCTCCTTCAC CGAACCTGAG CAGGGTTGCG AGTACAGTGC CGACAACCGC ACGGTCACCT GTGACTACTC CGAAATTATG CTGATCCCGG CCGACCATGA CACACAGGAC GGCGACGCAA CCTCCGGTGG TAGCTTCTGG TTCCCCGCCA AGATCGATGC TGGTGTTGCG AAGAGCGGTG CCCTGACCGG CGGTAGCGTC ACTGTCGCCG CCCTCGCCGC AGTTCCCTAC GGCCCCAAGC AGAACATCGC CGCACCTACC ACACTGCCGG AGAACGTGGA ACTGCTCGAC GCCGCGGACA TCGACAAGGT TGTTGATGTT GACGAGTCCG ACAACACCGA CACGTACTCG ATCTTCGTGT CGGTCAACGG CGGTTCCGGC GGTGGGTCCA CCGGCGGTGA CGGCGACGGT GGCTCACTTC CGGTGACTGG CGCGCAGGCT GGCCTGTTCG GTGGCATCGG CGCCTTCGTG ATCGTCCTCG GTGCCGTGCT CTTCCTGGTC GCTCGCCGTC GCCGCGTCGT CCTGATGACC CCGGGTGACG AAAAGCCGAC CGCGTAA
|
Protein sequence | MGAGAVGFLV AGSFAAPAYA VDGTDLSVEL SGTTIAAGST EKVVEIDLEN VGKTTPESVD IIFTVGDLVG AFQFADICDD DPADGTVHCS VDKAFIPEPG GTSDLSYLVT RADDSADASG ELTVTVLVDG DDNTANNTDT AEVVIGEHGV DLGVVAEDVR TPIDADASLE SGAPVYVDGS ITPGGSTAVV ATVSNQGDMT ADGVRVSVTL PEQASFTEPE QGCEYSADNR TVTCDYSEIM LIPADHDTQD GDATSGGSFW FPAKIDAGVA KSGALTGGSV TVAALAAVPY GPKQNIAAPT TLPENVELLD AADIDKVVDV DESDNTDTYS IFVSVNGGSG GGSTGGDGDG GSLPVTGAQA GLFGGIGAFV IVLGAVLFLV ARRRRVVLMT PGDEKPTA
|
| |