Gene Sare_0846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0846 
Symbol 
ID5705948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp946740 
End bp948143 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content73% 
IMG OID641270364 
Producthypothetical protein 
Protein accessionYP_001535755 
Protein GI159036502 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.421105 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCGT CCACCGCGCC GCCGTCCCAG GCAGAGCAGC CGGAGCAGAC CGGCGGCTGG 
GTGCCGACTC GGGCCCTGGG CCGGGCCGTG CTCCTGACCG GACTGCTCCT GATCGCCGGC
GTGCTGCTCG GGCGCGTCGA CCTGGTCGTG CTGGCTGCCC CGTTCGCGAT CGGCACCGCG
TACGCCCTGC GACGTCGGCC CGCCGCGGTG CCACAGGTCC GGATCGCCAC CGCGGAGGGG
AACCTGGTGG AGGGCGGTGA ACTGGCTGGA CGGATCACCG TCGGAAACCC GGATTCGGTC
GACTACGACC TCGCCGTGGT CCGCAGCCGA ACGTCGCCGT GGCTGCGGGT CGACCGCGTG
ACGATCGCCG GTGTGGCGGT GGCCGCTGCC GCGCCAGAGG CCGGCCGGTC GATCGCCGAC
CGTCCGTTGG TGACCGCCGT ACCGGCCGGC GAGGTGGTCG ACGTGGACCT GTCCGGCACT
GCGCGACGGT GGGGCCGGCA TTCGCTGGGT CCGGCCGGAA CCCAGGCTTC CGCCGCGCTC
GGGCTCCTGG TTTCACCCCC GGTGGTCACC GAAGCGATCC AGGTGAGTAC CTATCCGGTG
ACCGACCCGT TCGACGCGGT GGAGGCGATG CCCCGCGCGG CGGGCCTGGT CGGCGCACAC
CATTCGCGAC GCCCGGGCGA AGGCGGCGAG CTGGCCGGTG TGCGGGTCTT CGCCCCCGGC
GACCGGCTGC GCCGGATCGA CTGGCGGGTC TCACTGCGGG CGCGGCAACT GCACGTCGCG
GCAACCCTCT CCGACCGGGA CGCCGAGGTG GTGGTGCTGC TCGACGTGCT CGCGGAGGCA
GGTCGCTCCG GTGGGGTCGG CGGTACCGCG TCGGTGCTGG ATACGACGGT TCGGGCTGCC
GCGGCGATCG CGGAGCACTA CCTGCACCGC GGCGACCGGG TGTCGATGGT GGAGTACGGT
CCGGCCGGTC GCCGGTTGCG TCCCGCCACC GGCCGCCGCC AGTTCCTGAC GGTTCTGGAG
TGGTTGCTCG ACGTGCATCC GCAATCCTCC CCACACGAAC TCTACGACTC GGTGCTCGGA
TCACAGATGC TGTCGTCGGA CGCATTGGTG GTGGTGCTCA CGCCCCTGCT GGACGAGCGG
TCCGCGCAGA TGCTGGCCCG GTTGGCCTGG TCCGGGCGCT TCGTCGTCGC GGTCGACACC
CTGCCCATCG ACCTGACCCC GCCCCGGGAC CGGGGCTGGG CGGAGGCGGC GCACCGGCTG
TGGCGGCTGG ACCGGGACAC GATGGTGCGT CAGCTGCGGG AACACGGCGT ACCGGTGGTG
CGGTGGGCCG GCGCCGGCAG CCTGGACGAG GTGCTGCGTG ATGTGGCCCG GCTCGCCACA
GCTCCGAGAG CGGGGGCCCG GTGA
 
Protein sequence
MTPSTAPPSQ AEQPEQTGGW VPTRALGRAV LLTGLLLIAG VLLGRVDLVV LAAPFAIGTA 
YALRRRPAAV PQVRIATAEG NLVEGGELAG RITVGNPDSV DYDLAVVRSR TSPWLRVDRV
TIAGVAVAAA APEAGRSIAD RPLVTAVPAG EVVDVDLSGT ARRWGRHSLG PAGTQASAAL
GLLVSPPVVT EAIQVSTYPV TDPFDAVEAM PRAAGLVGAH HSRRPGEGGE LAGVRVFAPG
DRLRRIDWRV SLRARQLHVA ATLSDRDAEV VVLLDVLAEA GRSGGVGGTA SVLDTTVRAA
AAIAEHYLHR GDRVSMVEYG PAGRRLRPAT GRRQFLTVLE WLLDVHPQSS PHELYDSVLG
SQMLSSDALV VVLTPLLDER SAQMLARLAW SGRFVVAVDT LPIDLTPPRD RGWAEAAHRL
WRLDRDTMVR QLREHGVPVV RWAGAGSLDE VLRDVARLAT APRAGAR