Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2591 |
Symbol | |
ID | 5707176 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2953618 |
End bp | 2955798 |
Gene Length | 2181 bp |
Protein Length | 726 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641272053 |
Product | hypothetical protein |
Protein accession | YP_001537423 |
Protein GI | 159038170 |
COG category | [S] Function unknown |
COG ID | [COG1944] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.245728 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.176917 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACTG CATACGACAC GGTGGCGCAG ACCCGGCCGC GAGTGCGCCA CGACGTGCTG TTCACCCGAA CCGAGGACGG AGTGCTGTTC CACAACGCCA CCAGCGGCTT CCGGTTCTCC TCCACCACCG CGTACCGGCT GGCGTCGGTA CTCGTCCCGC ACCTGAACGG GCGCAATCAG GTCGCGGACA TCTGTGCCCG GCTACCCGCC GGGCAACGGG CCATGATCGG TGAGCTGGTG AGCACCCTCT ACGCACGCGG CTTCGCCCGC GACGTCCCCG AGACGGAGGG AGACCCGACG GCGATCCTCG GCCCGGCGGT GGCCGCGCAC TTCGCCACCC AGGTTGCCTA CCTCGACCAC TACACCGACC GGGCGCCGCA GCGCTTCGCC ACCTTCCGGC ACACCTCGGT CGCGGTGCTG GGCGCCGGCC CGGTCGCCAC CGCCTGCGCC ACCGGACTGC TTCGCAACGG CGCCGCCACG GTGACCGTCT CGCCGGCGAT CGCGCCGAGG CTCGCGCCCG AGCTGGCCGA GCTCGACGCC GCAGGCTGCC CCGCGACGAC CGTGCCGCTG CCCACGACCG GCAACGAGGT CGGCTGGTCC GACCTGGCCA CAGCGCAGAT CGTCGTGGTG GCCGGCGGGG ACGACGCGCC CCGCGACACC CTGCGGCTCC TCGCGGCCGG CGTTCCCGCC GACCGGCTGC TCCTGCCGGC CTGGGTCGCC GGCGGACGGA TGCTCGTCGG GCCGGTGCAG GGCGAAGGCC GTACCGGATG CTGGTGCTGC GTCATGCGGC GACTCGCCGA CAACGACGAG ACCGGTGGCG CCGGGCAGGT GTGGCAGGCG GCGGCGCTCC CGTCCGGGGC CGCGCCAGCC GCCACCGAGC CCGACGGACC GCTTGCGGCG ATGATCGGCA ACCTGTTGGC GTACGAGGTG TTCCGGCTGA CCACCGGCGC GCTGCCCGCC GAGACCGACG GGAGCGTGAT CGTTCAGCAC CTCGCCTCAC TCGACGTGCT GACCGAGCAA CTTCTTGCGC ACCCCCGGTG CACCTTCTGC CGGCCGGCAC CGCCCGAACC GGCCTGGACC ACCGAAGGGC TGGACGAAGC GCCTGCCGAG GCCGCCTCCG CGGCCGACCC GGCCGCGGGG GCGCAGGAAG CGCTCGCACA GTTGGAGTCC CACCAACCGC TACTCCAGCC GCATCTGGGC GTGTTCCGTC GCTACGACGA CGAGCGTTGG GACCAGACCC CGATCAAGGT GGGCGCGGTC GAGCTCACCG ACGGCAGCGG CCGGCGCCGA ACGGTGACCG CGTTCGACGT CCACCACGTC GCGGCGGCCC GGCTGCGGGC ACTACGGATC GCGGCCGTCG TCAACACCAG CAGCATCGCG GTCGGGACCC CCGCCCCCCA GGGCGCCGAG CGGGTCGACG CCGCCCGACT CGGCCTCGCC TCAGGCTGGG GCGACGCCCC GGTGCAACGG TGGGCCACCG CCCGGTCACT GCTCAGCCGC GAGGTGGTGG CGGTGCCGAT GCCCGTGCTG GAGCCGTTCG GCGCGGCCAA CCGCCGGCAC GAGGCCGAGC CGACCAGCGC CGGTGGGGGA GCGGGCGGCG ACCTCACCGA GGCCGTCCGG GCCGGCCTCG CCTCGGCGCT CGCCGGGCAC ACGCTACGGC AGGTCATCGC CGGTCGGGAC ACCGTCCGAC GGATCCGCCT CGACACCATC GGCACCACCC CCGAGCTGGT GTTTCTCACC AGGTCAACAG CGAACCTGGG CGTCACCATG GAACTGCTCG ACCTCGGTGG ACAGCGCGAC ACCGGGGCGG CGGTGCTGCT GGCCCGCTCC TTCGACCCGG ACCGCGGTCA ATGGACGTTC GCGCTGACCG CCGATCCGGA CTGGACGACG GCGGCCGCGG CGGCGCTGCG CGACGTGCTC GGTCAGGCCC AGCTTCGCGC GCAGGACCCC GAGCTCGTAC CAGATATCGG CGACCCGGTG CTGGTCGACT TCGACCCCGG CACGGTGCCG GTGCACGACG AGGTGGACGC GGCCGGGGTG CGGCACCGCT GGGCCGACGT GCTGCAGCGG CTACCGGGAC TCGGGTACGA CGTGCTGGTG GCGCCCGTCG GTGGTGCGGA CCTGGCCGCG GGCGGGCTGG TGGCGGTCAA AGTGCTGCTC GCCGCCGGAG ACCACCGATG A
|
Protein sequence | MSTAYDTVAQ TRPRVRHDVL FTRTEDGVLF HNATSGFRFS STTAYRLASV LVPHLNGRNQ VADICARLPA GQRAMIGELV STLYARGFAR DVPETEGDPT AILGPAVAAH FATQVAYLDH YTDRAPQRFA TFRHTSVAVL GAGPVATACA TGLLRNGAAT VTVSPAIAPR LAPELAELDA AGCPATTVPL PTTGNEVGWS DLATAQIVVV AGGDDAPRDT LRLLAAGVPA DRLLLPAWVA GGRMLVGPVQ GEGRTGCWCC VMRRLADNDE TGGAGQVWQA AALPSGAAPA ATEPDGPLAA MIGNLLAYEV FRLTTGALPA ETDGSVIVQH LASLDVLTEQ LLAHPRCTFC RPAPPEPAWT TEGLDEAPAE AASAADPAAG AQEALAQLES HQPLLQPHLG VFRRYDDERW DQTPIKVGAV ELTDGSGRRR TVTAFDVHHV AAARLRALRI AAVVNTSSIA VGTPAPQGAE RVDAARLGLA SGWGDAPVQR WATARSLLSR EVVAVPMPVL EPFGAANRRH EAEPTSAGGG AGGDLTEAVR AGLASALAGH TLRQVIAGRD TVRRIRLDTI GTTPELVFLT RSTANLGVTM ELLDLGGQRD TGAAVLLARS FDPDRGQWTF ALTADPDWTT AAAAALRDVL GQAQLRAQDP ELVPDIGDPV LVDFDPGTVP VHDEVDAAGV RHRWADVLQR LPGLGYDVLV APVGGADLAA GGLVAVKVLL AAGDHR
|
| |