Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4159 |
Symbol | |
ID | 5707708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4723511 |
End bp | 4724716 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641273586 |
Product | hypothetical protein |
Protein accession | YP_001538939 |
Protein GI | 159039686 |
COG category | [S] Function unknown |
COG ID | [COG2311] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.776458 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000152397 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGTTGCCG TAGTTCCAGA TGTAGGTACT CGGGCGACCC CTGTCCGACT GCGCCTGCTG GGACCCGACC TGGCGCGCGG GGCCATGCTG CTGTTGATTG CACTGGCCAA TGTTGATGTC TTCGCTTTCG GCTTCTTGCC AGGATTCCGG GGGTACCCCG CCGAGCAATC TATACTCGAC AGTATTTTCA CCATGGCGCG GATGGTGTTG GTTGACGGCC GGGCCTTTCC GCTTTTCGCC GCGTTGTTCG GCTATGGCCT CGTACAACTG CTGCAGAACC GGGGTTCTGC AGAACCCGCG CGGTCTTCGC GGCTGTTGCG TCGTCGTGGC ATGTGGTTGG TGGTTATCGG ATTCGTTCAC GGCATGCTTT TGTTCACGGC GGACATCGTT GCGCTCTACG GGTTGTGCGC CCTCGTCTTC GCTGGGCTCG TGGTACGTCT CAGTGACCGT GGCCTGCTGA CCGTAGCCCT GTCGCTGGTG GCACTCGCCC TGCTAACCGG TGCCGTCCGC GGACTGCCTG CGGAGGCTCT CGGCCAGGCA GGCGTAGTCA CAGCGACACC GACCATATTC GGTGGTGACG TCGTCGGCGC GCTGCAGGCG AGGATGAGTG AGTGGGCCTT GGGAGCGATA CGCCTATTCG GACTGATGCC GGCGGTGCTC TTCGGTGTCT ACGCGGGACG TAAGTCAGTC TTGACCTGGG GGCCAGAGCG GAAGCGGGTA CTTGGTCTGG TCGCGTTTAC CGGGCTGGCC GCCGGTATCG TTGCTGGGGT TCCTTCGGCG CTGATGGCGG CATCGGTGTG GACTGACCCG CCAATCGGTA TCGCCGCGAT TGCGGGAACG CTTCACCTGG CGGGCGGGTA TGCGGCTGCA GCCGGCTATC TGGCCCTATT TGCCCTACTC GCGGCCACCG TGCGGCGGCC TCCAGGCCTG ATAGTGAAGG CGCTATCGGT GAGTGGGCAA CGCTCCTTGA CGCTGTATCT TAGCCAGTCC TTGCTGTTCC TCGTTCTCTT CGATCCGGAC TTTTTTGGGC TGGGTGACAG ATTCGGTATT GCGTTGAACT CTGCTGTGGC TGCCGGTGTC TGGATCGTCG GCGTTCTCAG TGCGCTGTTG ATGGACAGGC TGTCCATCCG TGGCCCGGCC GAGGTGCTGC TACGCAGTCT CACCTACTGG CCGGTGGCCA GATCAGCCGG TCCGCGTTCT CGTTGA
|
Protein sequence | MVAVVPDVGT RATPVRLRLL GPDLARGAML LLIALANVDV FAFGFLPGFR GYPAEQSILD SIFTMARMVL VDGRAFPLFA ALFGYGLVQL LQNRGSAEPA RSSRLLRRRG MWLVVIGFVH GMLLFTADIV ALYGLCALVF AGLVVRLSDR GLLTVALSLV ALALLTGAVR GLPAEALGQA GVVTATPTIF GGDVVGALQA RMSEWALGAI RLFGLMPAVL FGVYAGRKSV LTWGPERKRV LGLVAFTGLA AGIVAGVPSA LMAASVWTDP PIGIAAIAGT LHLAGGYAAA AGYLALFALL AATVRRPPGL IVKALSVSGQ RSLTLYLSQS LLFLVLFDPD FFGLGDRFGI ALNSAVAAGV WIVGVLSALL MDRLSIRGPA EVLLRSLTYW PVARSAGPRS R
|
| |