Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3997 |
Symbol | |
ID | 5704883 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4547872 |
End bp | 4549113 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641273422 |
Product | hypothetical protein |
Protein accession | YP_001538778 |
Protein GI | 159039525 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00194517 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0036434 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACACAGG GCGAGCCGGA GCTGCCGGAG CCGACCGGCC GCATGAACCA GCCGCCGGCC GGTCAGCCCA CCGGACCAGG CCCGACCAAC GCACCGCGCC CCTCCTTCCC CGGCGACACC GCGGGGGAAG GGGCCGAATC CGGTGCGCCC CCGGTTCCGC CCGAAGCGCC GCAGCGGCAG CCCGACAACG GGTCCGGGCG GTTCGGCATG CCGGGACGGC CACTGCGGCG CAACAGCTTC CTCGTCGGCT TCACCGGTGG CCTCGGCGCA CTGCTGGCGT ACGCACTCTT CCTCGGCCTG CGCAACGCCG CCGGCCTGCT CGTCCTCGTG GTGATCGCGC TCTTTCTCGC CGTCGGGCTC TACCCGGCGG TGGCGCGGCT ACGCCGGCTC GGGCTGCCCC ACGGACTGGC GGTCGCGGTC GTCACGCTGA CACTTCTCCT GCTGTTCTGC AGCGGCGTGG TCGCGCTGGT ACCTCCGGTC GTCACTCAGT CCAACCAGTT CATCGAGCAG TTTCCCAACT ACGTTGAGTC ACTGCGGCGC AACGAGACGA TCAACGAGTT GGTCGAACGG TACGACCTGA TGGAACGGAT AGAGCGGGCC GCCGACACCG ACACGCTCGG CCACGCGCTC GGCGGGGTAC TCGGCGGCGC TCAGCTCATC TTCGGCACCG CATTCCGGAC CCTGACCGTG CTTGTGCTCA CCGTCTATTT CCTGGCGTAC TTCAACCGGT TGCGGTCGCT CGGGTACGCG CTCGTTCCCC GGTCCCGGCG GGACCGGGTA CGGCTGATCG GCGATGAGAT CATCATGAAG GTCGGCGCGT ACATCGTCGG GGCGCTCATC ATCGCCGTCC TCGCCGGCAC GACCACCTTC GTGTTCGCGG TGATCGCCGA GCTACCGTAC CCGTTCGCCC TGGCCGTCGT GGTGGCGGTG GCCGACCTGA TCCCGCAGAT CGGCGCGACG CTCGGAGCGG TGATCGTGAG CCTGGTCGGC TTCGCCACCG ACCTGCCGGT GGGGATCGCC TGCGTGGTGT TCTTCCTCAT CTACCAGCAG TTGGAGAACT ACCTGATCTA CCCCAAGGTG ATGCGTCGAT CGGTGCAGGT CAACGAGGTG GCTGCGCTGG TCGCGGCGCT GCTCGGCGTC GCCCTGATCG GCGTGGTGGG CGCCTTCATC GCGATCCCCA CGGTCGCGGC GTTCCAACTG ATCCTGCGCG AGGTGATCGT CCCGCGTCAG GATTCCCGCT GA
|
Protein sequence | MTQGEPELPE PTGRMNQPPA GQPTGPGPTN APRPSFPGDT AGEGAESGAP PVPPEAPQRQ PDNGSGRFGM PGRPLRRNSF LVGFTGGLGA LLAYALFLGL RNAAGLLVLV VIALFLAVGL YPAVARLRRL GLPHGLAVAV VTLTLLLLFC SGVVALVPPV VTQSNQFIEQ FPNYVESLRR NETINELVER YDLMERIERA ADTDTLGHAL GGVLGGAQLI FGTAFRTLTV LVLTVYFLAY FNRLRSLGYA LVPRSRRDRV RLIGDEIIMK VGAYIVGALI IAVLAGTTTF VFAVIAELPY PFALAVVVAV ADLIPQIGAT LGAVIVSLVG FATDLPVGIA CVVFFLIYQQ LENYLIYPKV MRRSVQVNEV AALVAALLGV ALIGVVGAFI AIPTVAAFQL ILREVIVPRQ DSR
|
| |