Gene Sare_1050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1050 
Symbol 
ID5708329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1176886 
End bp1178742 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content71% 
IMG OID641270566 
Producthypothetical protein 
Protein accessionYP_001535950 
Protein GI159036697 
COG category[S] Function unknown 
COG ID[COG4529] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.526264 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.107352 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCTTGC CGCAGCCGAT CCGTGCGATG TGCGTCGTGG GAGCCGGACC ACGCGGACTC 
GCTGTCCTCG AGCGGTTGAG TGCCAATCAC CCCGACGGGC GCCGGCTGCT GGTCCACGTC
GTGGACCCCT ACCCACCGGG GCCCGGCCGA ACCTGGCGCA CCACACAGTC CTCCCACCTG
CTCATGAACA CGGTGACCTC CCAGGTCAGC CAGTTCACCG ACGACAGTGT CCACTGTTCC
GGCCCGATCC GGCCGGGTCC GAGCCTGCAC GAGTGGCTAC ACCGTTCCCC CGTGCCCGCG
GGCCGGGCCT GGCCCGGCCC GGACCGGCTC GGCCCCGACG AGTACCCGTC CCGCGTCCAG
TACGGCCACT ACCTGACGTG GGTGTTCGAG GACCTTGTCG CGAACGCCCC CGAGGGCCTG
CGCTTCGTCG TCCACCAGGC GACGGCGGTG GCACTGGACG ACGAGGCGGA CGGCCAGTGC
GTCACCCTGG ACGAGGACAC CCGGATCACC GGCCTGGACG CCGTGGTGCT GGCGCTCGGG
CACGGAGCCA ACGAGCCGAC CGAGGAGGAG CGGCGGTTCA CCACGTACGC GTGCCGCAAC
GGGTTGCGCT ACCTACCGCC GGGCAATCCC GCCGACGCCG ACTTCGACGG CGTCGGCCCC
GGGGAGACCG TCGCCGTCCG GGGGCTCGGG CTGGCCTTCT TCGACGTGCT GTCCCTGCTC
ACCGAGGGCC GGGGAGGCAA GTTCGTCAAC TCCCCGGAGG GTCTGCAGTA CCTCCCCTCC
GGGCAGGAGC CGATCCTGTA CGCCGGATCC CGGCGCGGAG TGCCGCACCA CGCCCGCGGG
GAGAACCAGA AAGGACCGGC GGGCCGGCAC GAACCGCTGT TCCTGACCCT GGAGGCAGTT
CAGCGCATCC GGAACACACC CGACGCGACA TTCAAGCGGG ACGTGTGGCC ACTGCTGGAC
GCGGAGGTAC GCGCGGTCCA CCACCACGCC CTGGTGGCCC AGCGCACGGG ACGGGCGACG
GCCGACCGTT TCCTCACCGA CCTGCTGGCG GCCCCCGCGG ACGGTCGGGC GGACGTCCTG
CGCGGATACG GTCTCACCGA GCGGGACGAC TGGGACTGGC GAAGAGTCGA ATGCCCGTGG
GGCGAGCGGG CGTTCACCGA CCGCGCGGAC TTCAACCGGT GGCTGCTCGG CCACCTACGT
GACGACGTGC GGCAGGCCAG GAACGGCAAT GTCGACGACC CGTTGAAGGC CGCCCTGGAC
GTCCTGCGCG ACCTGCGCAA CGAGGTACGA CTCGCGATCG ACCACTCGGG GATCACCGGC
AGGTCGTACC GCGACGAGGT GGTGGCCTGG TTCAACCCGC TGAACGCCTA CCTGTCCATC
GGCCCGCCCC GGTCGCGGAT CGAGGAGATG ATCGCCCTGA TCGAGGCTGG CGTGCTCCAG
GTCGTCGGAC CGCGTACCCA GGTCCGTGCG GCCCCCGGCG GTGAGGGCTT CCTGATCGGC
TCCACCCACC TGGGGGGACC GGAGGTGCTG GCGACAACGC TGATCGAGGC ACGCATCGCC
GAGCCGGACC TACGCCGTAG CACGAATCCG CTGCTACGGC ATCTGCTCGC GACCGGGCAG
TGCCGGCCCT ACCGGATCGC CGACGGGGAC GACGACTACG AGAGCGGCGG CCTGGAGGTC
ACCGCCCGGC CCTATCGCGT GATGGACGCG TCGGGCGTTC CCCACCCGAG GCGATTCGCC
TACGGGGTGC CGACCGAGTA CGTGCACTGG GCCACCGCGG CGGGCATCCG CCCCAGCGTC
GGCTCCGTGA TCCTCGAGGA CGCGGACGCC ATCGCCCGCG CAACATCACC CTCATGA
 
Protein sequence
MTLPQPIRAM CVVGAGPRGL AVLERLSANH PDGRRLLVHV VDPYPPGPGR TWRTTQSSHL 
LMNTVTSQVS QFTDDSVHCS GPIRPGPSLH EWLHRSPVPA GRAWPGPDRL GPDEYPSRVQ
YGHYLTWVFE DLVANAPEGL RFVVHQATAV ALDDEADGQC VTLDEDTRIT GLDAVVLALG
HGANEPTEEE RRFTTYACRN GLRYLPPGNP ADADFDGVGP GETVAVRGLG LAFFDVLSLL
TEGRGGKFVN SPEGLQYLPS GQEPILYAGS RRGVPHHARG ENQKGPAGRH EPLFLTLEAV
QRIRNTPDAT FKRDVWPLLD AEVRAVHHHA LVAQRTGRAT ADRFLTDLLA APADGRADVL
RGYGLTERDD WDWRRVECPW GERAFTDRAD FNRWLLGHLR DDVRQARNGN VDDPLKAALD
VLRDLRNEVR LAIDHSGITG RSYRDEVVAW FNPLNAYLSI GPPRSRIEEM IALIEAGVLQ
VVGPRTQVRA APGGEGFLIG STHLGGPEVL ATTLIEARIA EPDLRRSTNP LLRHLLATGQ
CRPYRIADGD DDYESGGLEV TARPYRVMDA SGVPHPRRFA YGVPTEYVHW ATAAGIRPSV
GSVILEDADA IARATSPS