Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1821 |
Symbol | |
ID | 5706467 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2098714 |
End bp | 2099919 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641271323 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_001536698 |
Protein GI | 159037445 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0000914518 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCGACG ATCTGGACCC CGAGTTCGAT GCGGACCGGG GAGAGAAGGG CCGGCATCGG CGCCGCTACG TGCGCAGGCG CCAGCGTCAG CGCCGGAGCG GTTCGGGTGG TGGGCGTGGC AAGACCGCCC TGGCTCTGTT GCTCACCCTG GTTTTGCTCG GCGGCCTCGG CGGTGGTGCC TTCTACGGCT TCGAACGGAT CCAGAACTTC CTCGGCACGC CGGATTACGA CGGTTCTGGC ACCGAGGCGG TGACGGTCGA GATCATGGAA GGGGCATTGA TCGCCGACAT GGCGGTCACG CTCTACGAGG CCGGGGTCGT CAAGAGTACC AAGGCTTTCA TCGAGGCCGC GGAGGATGAC GGCCGCAGCA AGACCATCCA GCCAGGCCAG TACCAGTTGC GCAGGCAGAT GAGTGGCGCC AGCGCCGTGG CCGCGCTGCT GGACCTGACG AACCGGGTCG TCAACGGGAT CACCATTCCC GAGGGGCGCA CCGCGAAGAG CGTCTACAAG CTCCTCTCCG AGAAGACCAA CGTCCCGGTC ACGGAGTTCG AGGCGGCGGC GAAGGACCCG ATCGCGCTCG GTGTCCCGGA ATGGTGGTTC ACGCGCACGG ACGACCGGAA GGTCGAGCCG TCGATCGAGG GATTCCTCTT CCCCGACACC TACGAGTTTC CCCCGAAGTC AACGGCTGAG TCGATCCTTG GGCTGATGGT GGAGCGGTTC CTCACCGTCG CCGAGGAGCT GCGGTTCGTC GACCGGGTGC AGAACGAACG GCAGATCGCG CCGTACGAGG CGCTGATCGT CGCGTCGCTC GCCCAAGCTG AGGCGGGTGT TCCGGGGGAT CTCGGCAAGG TCGCCCGGGT CGCCTACAAC CGGGTCTACG GCGACTTCCC GTGCAACTGC CTGGAGATGG ACGTCACGAT CAACTACCAC CTGGAGTTGA CCGGCCAGAA GACCAAGACC TCGGCCGAGA TGACGGAGGA CGAGCTGCTC GACACAAAGA GCCCGTACAG CCGCAAGCTT CGGGGTCTGA TTCCCACACC GATCAACAAT CCGGGTCAGT TGGCCCTGGA GGGCGCCATG GACCCGCCGC CGGGTAAGTG GCTGTACTTC GTTGCGATCA ACAAGGAGGG ACAGTCCGCC TTCGCGGAGA CCTACGAGGA GCAGCTGCGC AACGAGGCAA AGGCGAGGGA GGCGGGTGTC ATCTGA
|
Protein sequence | MIDDLDPEFD ADRGEKGRHR RRYVRRRQRQ RRSGSGGGRG KTALALLLTL VLLGGLGGGA FYGFERIQNF LGTPDYDGSG TEAVTVEIME GALIADMAVT LYEAGVVKST KAFIEAAEDD GRSKTIQPGQ YQLRRQMSGA SAVAALLDLT NRVVNGITIP EGRTAKSVYK LLSEKTNVPV TEFEAAAKDP IALGVPEWWF TRTDDRKVEP SIEGFLFPDT YEFPPKSTAE SILGLMVERF LTVAEELRFV DRVQNERQIA PYEALIVASL AQAEAGVPGD LGKVARVAYN RVYGDFPCNC LEMDVTINYH LELTGQKTKT SAEMTEDELL DTKSPYSRKL RGLIPTPINN PGQLALEGAM DPPPGKWLYF VAINKEGQSA FAETYEEQLR NEAKAREAGV I
|
| |