Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2257 |
Symbol | |
ID | 5706743 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2595463 |
End bp | 2596797 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641271736 |
Product | hypothetical protein |
Protein accession | YP_001537107 |
Protein GI | 159037854 |
COG category | [S] Function unknown |
COG ID | [COG1262] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR03440] conserved hypothetical protein TIGR03440 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.689177 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0943468 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGGAGA CGACCAACCG GCCCGACGGT GTACGGCTGC GGGAGCACAT AGCGACGGAG CTGGCCCGCG CCCGCGCCCG TACCGCCGTG TTGACCGACG CGGTTGACGA CGCCGACCTG GTACGACAGC ACTCGCCCCT GATGTCGCCC CTGGTGTGGG ACCTCGCCCA CGTCGGCAAC CAGGAGGAGC TCTGGCTGGT GCGGGACGTC GGCGGCCGGG AGCCGGTTCG CCAGGACATC GACGACCTTT ACGATGCGTT CAAGCAGCCC CGCCGGGATC GCCCGGCATT GCCGCTGCTG CCGCCGCCGG AGGCGCGAGC ATACGTGTCG ACGGTCCGGG ACAAGGTGCT CGACCTGCTC GACCGGGTGG CCTTCACCGA CCGGCGGCTG GTTGCGGACG GCTTCGCCTT CGGCATGATC GTGCAACACG AACAACAGCA CGACGAGACG ATGCTCGCGA CCCACCAGCT GCGATCCGGC CCGGCCGTGC TCGACGCGCC ACCCCCGCCG GAGCCCCGGG TTCGGGTCGC CGGCGAGGTA CTGGTTCCGG CCGGCGAGTT CACCATGGGC GCCGACACCG ATCCGTGGGC GTTGGACAAC GAGCGTCCCG CCCACCAGGT GTACCTGCCG GCGTACGCCA TTGACGCGGC TCCGGTCACC AACGGTGCGT ACGCGGCGTT CATCGCCGCG GGCGGCTACC ACGACCCGCG GTGGTGGAGC GCCGCGGGCT GGGCGTATCG GCAGCAGGCG GGCCTGACCG GGCCGTTGCA CTGGCGCCCG GACGGCGACG GCTGGGCCTA CCACCGCTTC GGCCGGTGGG CGCCGGTACG CGAGGACGAG CCGGTGGTGC ACGTCAGTTG GTATGAGGCG CAGGCGTACG CCGCCTGGTC AGGTAAGCGG TTGCCAACTG AGGCGGAGTG GGAGAAGGCA GCCCGCTGGG AACCGGCGAC AGGTCGGTCC CGCCGCTACC CGTGGGGCGA CGAGGATCCG ACGGTCGACC ATGCCAATCT GGGTCAGCGG CACCTGTGGC CGGCACCGGT CGGGGCGTAC CCGGCCGGTG CGTCACCGCT CGGCGTCCAC CACCTGATGG GCGACGTGTG GGAGTGGACC TCGACCACCT TCCGCGGCCA CCCCGGCTTC GTGGCCTTCC CCTACCGGGA GTATTCCGAG GTCTTCTTCG GCGACGACTA CCGGGTGCTG CGGGGCGGGT CGTTCGGCAC CGATCGGGCC GCCTGTCGGG GCACCTTCCG CAACTGGGAC TATCCGATCC GGCGGCAGAT CTTCAGCGGT TTCCGCTGTG CCCGGGACGC CGCACCTGGG GAGGCACCCG CGTGA
|
Protein sequence | MTETTNRPDG VRLREHIATE LARARARTAV LTDAVDDADL VRQHSPLMSP LVWDLAHVGN QEELWLVRDV GGREPVRQDI DDLYDAFKQP RRDRPALPLL PPPEARAYVS TVRDKVLDLL DRVAFTDRRL VADGFAFGMI VQHEQQHDET MLATHQLRSG PAVLDAPPPP EPRVRVAGEV LVPAGEFTMG ADTDPWALDN ERPAHQVYLP AYAIDAAPVT NGAYAAFIAA GGYHDPRWWS AAGWAYRQQA GLTGPLHWRP DGDGWAYHRF GRWAPVREDE PVVHVSWYEA QAYAAWSGKR LPTEAEWEKA ARWEPATGRS RRYPWGDEDP TVDHANLGQR HLWPAPVGAY PAGASPLGVH HLMGDVWEWT STTFRGHPGF VAFPYREYSE VFFGDDYRVL RGGSFGTDRA ACRGTFRNWD YPIRRQIFSG FRCARDAAPG EAPA
|
| |