Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4145 |
Symbol | |
ID | 5708302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4709422 |
End bp | 4710348 |
Gene Length | 927 bp |
Protein Length | 308 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641273573 |
Product | LmbE family protein |
Protein accession | YP_001538926 |
Protein GI | 159039673 |
COG category | [S] Function unknown |
COG ID | [COG2120] Uncharacterized proteins, LmbE homologs |
TIGRFAM ID | [TIGR03445] 1D-myo-inosityl-2-acetamido-2-deoxy-alpha-D-glucopyranoside deacetylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0240973 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGGACG TGACGACACT GCCCGATCGC CGGCTCCTCC TGGTCCACGC GCACCCCGAT GACGAAGCTA TCGGCACCGG CGCGACGATG GCCCACTACG CGGCCACCGG CGGACACGTC ACGCTGGTCA CGTGCACGTT GGGTGAGGAG GGCGAGGTGC ACGTGCCGGA GCTGGCGCAG CTCGCCGCGG CCGGGGCCGA CCAACTCGGC GGCTACCGGA TCGGGGAGCT GGCGGCGGCC TGCCGCTCAC TCGGGGTCAC CGATCACCGG TTTCTCGGCG GCGCCGGCCG CTACCGTGAC TCCGGCATGA TGGGTCTTGC CACCAACGAG CACCCACGCG CCTTCTGGCG GGCAGATCTC GACGAGGCCG CCGCGCATCT GGTGGAGCTC ATGCGTGAGG TCCGGCCCCA GGTCATGGTC ACCTATGACG ACAACGGGTT CTACGGCCAC CCGGACCACA TCCAGGCCCA CCGGGTGGCG ATGCGGGCGT ACGAGTTGGC TGCCGTCGAG GGGTTCGCTC CGGCGAAGGT CTACTGGACG GCGATGCCGC AGAGTGTGCT GGAGGCCGGG ATGGTTCACT TCGCCGGCTC GTCGGACAAC CCTTTCGCGG GCATCGAGGA GGCGGTGGAG TTGCCGTTCT GCACTCCGGA CGACCGTATC GCCGCCCGGA TCGATGCCAC CGGGCAGCAC GCCGCGAAGG AGGCGGCGAT GCGCGCGCAC GCCACCCAGA TCCCGGACAA CTCCTGGCTG TACTCGATCG CCGCGAACTT CGGCAGCGAG TTCATGGGCG TGGAGTACTA CACGCTCGCC GTTGGCGCCA AGGGCCCCGG CGCTGGCCCG TACGGGTGGG AGGACGATCT GTTCGCCGGG CTTCCGGTCG CCGCCGGCCC GGATCGGACT CGGGCCGGCG TGACCGGTCC ATGGTGA
|
Protein sequence | MTDVTTLPDR RLLLVHAHPD DEAIGTGATM AHYAATGGHV TLVTCTLGEE GEVHVPELAQ LAAAGADQLG GYRIGELAAA CRSLGVTDHR FLGGAGRYRD SGMMGLATNE HPRAFWRADL DEAAAHLVEL MREVRPQVMV TYDDNGFYGH PDHIQAHRVA MRAYELAAVE GFAPAKVYWT AMPQSVLEAG MVHFAGSSDN PFAGIEEAVE LPFCTPDDRI AARIDATGQH AAKEAAMRAH ATQIPDNSWL YSIAANFGSE FMGVEYYTLA VGAKGPGAGP YGWEDDLFAG LPVAAGPDRT RAGVTGPW
|
| |