Gene Sare_0439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0439 
Symbol 
ID5707860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp503289 
End bp504542 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content68% 
IMG OID641269964 
Productlow temperature requirement A 
Protein accessionYP_001535359 
Protein GI159036106 
COG category[S] Function unknown 
COG ID[COG4292] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000555172 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGGAGCA GGTCCGAGGG AGCGAACCGG CCGCGTCGGA CGGCGGGGCT GAGGCGGCTG 
CGGAAGCGGC TGTGGCAACC GCCCCGCCCG CACGGACAGG TTCGTGCGGG CCGCCGGGTC
AGTTTCCTGG AGCTCTTCTA CGATCTGATC TTCGTCGTGG TGATCGGCCG CGCGGCCCAT
GCCCTCGCTG CGGACGTGTC GTGGCGCACG GTCGGCGAGT TCGTGGTCGT GTTCGGGCTG
ATCTGGATCG CCTGGATGAA CGGCACGATC TACCAGGATC TGCATGGCCG AGAGGACGTA
CGCAGCCGGT CGTACGTGTT CTTTCAGATG CTCATCCTCA CCGTGTTGGC GCTCTACACC
GAGCACGCCG GTGGCGAGGA CGGCCGCGCC TTCGCGGTCG TCTACAGCGT GCTGCTGCTG
GTGCTGACGT GGCTCTGGTA CGTGGTGCGG AGGCACGATC CCCCCGAGCT GCGCCCAGTG
CCCGCCCTGT ACCTGATCGG CATGGTCCTC TCGGTGGCAA CTGTTTCGGG GAGCGCGGTG
TTGCGGCCAC AGCACCGTCT GCTGGTGTGG GCGGTGTTGG TGGTGGCCTG GATGATCGGA
CTGATGCTGC TGCATCGGCC CGACCGGTCG ATCGAGGACA TCGGCGTGCG ACCGTCGCCG
GCCCTGGCTG AGCGGTTCGG CCTGTTCACC ATCATCGTGC TCGGCGAAGC GGTCCTCGGT
GTCGTGGTCG GCATACTGGA CGCGGGACGG ACTCCTCGCG CCATCGCGAC CGGCCTGACC
GGTTTGTTGA TCGCATATGC CTTCTGGTGG ACCTACTTCG ACCTCGTGGG TCGTCGGCTA
CCCGGTGTCA CCGGCGGTTG GTTCGGCCGC TGGGTCTCCC TTCACCTTCC GGTGACGGGT
GCGATTGCCG CGGCGGGGGC AGGCATGGAG AGCATGGTCG AGCACGCCAC CGACAGCCAC
GTTCCGCTGG CGACTGGCTG GCTGCTCGCC GGGGCTGTCG CGCTGCTGAA CCTCGGGCTC
ATCATGTTGA TCCGCACCCT CAACGACTAT CGCCGACTCC TACCGGTCTA TCGGCCGGTG
AACGTCGCGC TGGCCGTCGG GGCCGGGGTG GCGCTGCTGA TCGGTTGGAT CCGTCCGCCC
CCATGGATGC TGGCACTGTC GCTGGTGGCG GTTCTCGCCG CCGTCTGGTG GGTCGGGGTC
AACCGGATGC TCCACCTGCC CGATCCGGAC GAAGCACTGC CCAAGCCGGA ATGA
 
Protein sequence
MGSRSEGANR PRRTAGLRRL RKRLWQPPRP HGQVRAGRRV SFLELFYDLI FVVVIGRAAH 
ALAADVSWRT VGEFVVVFGL IWIAWMNGTI YQDLHGREDV RSRSYVFFQM LILTVLALYT
EHAGGEDGRA FAVVYSVLLL VLTWLWYVVR RHDPPELRPV PALYLIGMVL SVATVSGSAV
LRPQHRLLVW AVLVVAWMIG LMLLHRPDRS IEDIGVRPSP ALAERFGLFT IIVLGEAVLG
VVVGILDAGR TPRAIATGLT GLLIAYAFWW TYFDLVGRRL PGVTGGWFGR WVSLHLPVTG
AIAAAGAGME SMVEHATDSH VPLATGWLLA GAVALLNLGL IMLIRTLNDY RRLLPVYRPV
NVALAVGAGV ALLIGWIRPP PWMLALSLVA VLAAVWWVGV NRMLHLPDPD EALPKPE