Gene Sare_3765 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3765 
Symbol 
ID5705668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4301368 
End bp4302864 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content69% 
IMG OID641273185 
Productphage-related major capsid protein 
Protein accessionYP_001538549 
Protein GI159039296 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones134 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones1596 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGGAGT ACCTTCGCGC GCAGCTTGTC CAGCTGCGCG AGCGGCGGGC CGCGCTGCGT 
GCCGAACTCG ACGCCGTCAT CAGCGGCGCC CGGTCGGCAA GCCGCGACCT GACCGACGAC
GAGCAGGCTC GCCTCGTCGC GGGAACCGAG CAGCTGCGCA AGCTCAACGG TGACGAGGAC
GGCCTGAATA GCCAGATCCG CGAGGCGGAG GAGACCGAAC ACCGCGAGCA GGTCGCGGCG
GGCGCCCGCG CCGAGTCCGG CCAGGTGGGC GAGCACCGCA CCGGCGGCGC CGTGGTCACC
AGTGAGCCGC AGGTGTACGG CAACGGCTCC GGAAACTCCT ACTTCCACGA CCTGGCCAAG
GCGCAACTGC GCGCCGACTC CAGCGCCGCG GAACGCCTGC AGCGGCACGC CGCCGAACTA
CGGGTCGAAC TACCCGCCCG GGAGCGGCGC CGCGAGGAGC GCGCCCAGCG GGAGATGGAC
GGCATGGGCA CGGCCGAGCG CTGGCACGAG GAGCAGCGCA GCCGCGTGTT CGAGAAGCGG
GTCAACCCGA ACCGGACCGA CGGCCAGGGC GGCTACTTCG TGCCGCCGCT GTGGCTGATC
GACCAGTACA TCGACCTGCC GCGCTTCGGT CGGCCGATCG CCAACGCCGT GCGCAACATG
GCACTGCCGG GCGGCACCGA CTCCGTGAAC CTGCCGAAGG TCGCCACCGG CACGTCAACC
GCCGCGCAGA CCGCCGATGG TGCCCCGGTG ACCAGCACCG ACATGACCGA TACCAGCGTG
TCCGCATCGG TCTACACGGT CGCCGGCCAG CAGGACGCGT CCATGCAGCT ACTCGACCAG
TCCCCGGCGC CGGGCTTCGA CGAAATCATC TTCGCCGACC TACTCGCCGA CCTCGCGGTT
CGGCAGGACG TGTACGTGAT CAACGGCTCC GGGACTGCCG GGCAGCCGAC CGGCATCCTC
AACGTCAGCT CGCCGAACGC GATCACCTAC ACGGACGCGT CGCCGACGCT GCCGGAGATG
TACGTGCCGT GGGTCCAGTC GGTCTCCCAG ATCTTCACCA ACCGCAAGCG GCCCGCCACA
GCCACCTTTG CGTTGCCGAA GATCTGGTTC TGGGCGACTG CCGGTCTCGA TACCACAAAC
CGGCCGCTGA TCCAGCCGTC ACAGGAGGCG CCGTTCAACC CCATGGCCTT ACAGACCGGC
GAAATCGCTG AGGGCCCGGT CGGCAAACTG ACCGTCGGCA CACCGGTGAT CCTCGACGGC
AACATCCCGG AGAACCTCGG CGCCGGCACC GACGAAACGC GGATCATCAC GCTACGCACC
TCCGACCTGT ACCTGTGGGA GGGCGCGATC CAGACCCGTG TCCTCACCGA GGTGCTGTCG
GGGACGCTGC AGGTCCGCTT CCAGGTGTAC CGGTACGCGG CGTTCATGGC CACCCGGCTA
CCGAAGGCGA TTTCGATCGT CTCGGGCACC GGCATGATCC CGACCTCCGG CTACTGA
 
Protein sequence
MLEYLRAQLV QLRERRAALR AELDAVISGA RSASRDLTDD EQARLVAGTE QLRKLNGDED 
GLNSQIREAE ETEHREQVAA GARAESGQVG EHRTGGAVVT SEPQVYGNGS GNSYFHDLAK
AQLRADSSAA ERLQRHAAEL RVELPARERR REERAQREMD GMGTAERWHE EQRSRVFEKR
VNPNRTDGQG GYFVPPLWLI DQYIDLPRFG RPIANAVRNM ALPGGTDSVN LPKVATGTST
AAQTADGAPV TSTDMTDTSV SASVYTVAGQ QDASMQLLDQ SPAPGFDEII FADLLADLAV
RQDVYVINGS GTAGQPTGIL NVSSPNAITY TDASPTLPEM YVPWVQSVSQ IFTNRKRPAT
ATFALPKIWF WATAGLDTTN RPLIQPSQEA PFNPMALQTG EIAEGPVGKL TVGTPVILDG
NIPENLGAGT DETRIITLRT SDLYLWEGAI QTRVLTEVLS GTLQVRFQVY RYAAFMATRL
PKAISIVSGT GMIPTSGY