Gene Sare_1369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1369 
Symbol 
ID5707288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1584181 
End bp1585245 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content71% 
IMG OID641270880 
Producthypothetical protein 
Protein accessionYP_001536261 
Protein GI159037008 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.791695 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00309173 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGTCG GGGTCGCACT CGTCACCTCG GTGGTTCTGC TGGCGTTGAA CGGCTTCTTC 
GTCGCCGCCG AGTTCGCCCT CGTGGCGAGC AAGCGCTACC GGCTGGAGCA GGCCGCTGCC
GGTGGTGGCC GGGCAGCCCG AGCCGCACTG GACGGCGTAC GGGAGTTGTC GCTCATGCTG
GCCGGCGCGC AACTGGGCAT CACCCTGTGC ACGCTGGGCC TGGGCGCGCT GGCCGAACCG
GCGATCGAGC GTCTGCTCAG CCCGTTGCTG CACGCCGTCG GGCTGCCCAC CGCGGCGAGC
CACGTCATCG CGTTGATCTT CGCGCTGAGC TTGGTGACCT TCCTGCATCT GGTGGTGGGG
GAGATGGCGC CGAAGTCGTG GGCGATCAGC GACGCCGAAC GGTCCGCGGT CTTGTTGGCG
CTGCCGTTCC GCGCTTTCGC CCGGGTGTCC CGGCCGGTGT TGTCGGCACT GAACTCGATG
GCGAACGGCA TCCTGCGCCT GTTCAAGGTC AAGCCGCAGG ATCAACTGGC CCAGGTGCAC
GGCCCGGAGG AACTGCGCAT CCTGCTGGAG CAGTCCCGTG AACACGGGCT GCTCGGTGCC
GAGCAGCACG AGTTGCTGAC CAGCATGCTG GAGCTGCAGG GCACGACGGT GGCCCAGGTG
ATGGAGCCGT TCGATCGGAT CGTCACCGTG CGACGGCACG CGGACGTAGG CCGGATCGAG
CAGGTCAGCC GCGACAGCGG GCGGTCCCGC CTGGCGGTGC TCGACGAGGC CGGTGACGTG
TGTGGGCTGG TGCACGTGCG GGAGGCGGTC CGGGCCGCGG TCAGCCGTCC GACGGCGACC
GCCGGGGAGC TGATGACGGC CGCGTTCACC CTGCCCGCGT CGGCGACGGT CACCGAGGCG
GTGGCGGCGA TGCGGGCCCG ACGTTCGCAG CTGGCCTTGG TCCGTAACGG CGGGGGGCCG
GCCCGTCCGG TCGGTTTCGT CGCGCTGGAG GACCTGCTGG AGGAGGTCAT CGGCGAGTTC
GACGATGAGA CGGATCCGGT TCCTCGGGGG CGGCGGTTGC GCTGA
 
Protein sequence
MSVGVALVTS VVLLALNGFF VAAEFALVAS KRYRLEQAAA GGGRAARAAL DGVRELSLML 
AGAQLGITLC TLGLGALAEP AIERLLSPLL HAVGLPTAAS HVIALIFALS LVTFLHLVVG
EMAPKSWAIS DAERSAVLLA LPFRAFARVS RPVLSALNSM ANGILRLFKV KPQDQLAQVH
GPEELRILLE QSREHGLLGA EQHELLTSML ELQGTTVAQV MEPFDRIVTV RRHADVGRIE
QVSRDSGRSR LAVLDEAGDV CGLVHVREAV RAAVSRPTAT AGELMTAAFT LPASATVTEA
VAAMRARRSQ LALVRNGGGP ARPVGFVALE DLLEEVIGEF DDETDPVPRG RRLR