Gene Sare_3714 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3714 
Symbol 
ID5705507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4266242 
End bp4267738 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content69% 
IMG OID641273134 
Productphage-related major capsid protein 
Protein accessionYP_001538498 
Protein GI159039245 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones143 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGGAGT ACCTTCGCGC GCAGCTTGTC CAGCTGCGCG AGCGGCGGGC CGCGCTGCGT 
GCCGAACTCG ACGCCGTCAT CAGCGGCGCC CGGTCGGCAA GCCGCGACCT GACCGACGAC
GAGCAGGCTC GCCTCGTCGC GGGAACCGAG CAGCTGCGCA AGCTCAACGG TGACGAGGAC
GGCCTGAATA GCCAGATCCG CGAGGCGGAG GAGACCGAAC ACCGCGAGCA GGTCGCGGCG
GGCGCCCGCG CCGAGTCCGG CCAGGTGGGC GAGCACCGCA CCGGCGGCGC CGTGGTCACC
AGTGAGCCGC AGGTGTACGG CAACGGCTCC GGAAACTCCT ACTTCCACGA CCTGGCCAAG
GCGCAACTGC GCGCCGACTC CAGCGCCGCG GAACGCCTGC AGCGGCACGC CGCCGAACTA
CGGGTCGAAC TACCCGCCCG GGAGCGGCGC CGCGAGGAGC GCGCCCAGCG GGAGATGGAC
GGCATGGGCA CGGCCGAGCG CTGGCACGAG GAGCAGCGCA GCCGCGTGTT CGAGAAGCGG
GTCAACCCGA ACCGGACCGA CGGCCAGGGC GGCTACTTCG TGCCGCCGCT GTGGCTGATC
GACCAGTACA TCGACCTGCC GCGCTTCGGT CGGCCGATCG CCAACGCCGT GCGCAACATG
GCACTGCCGG GCGGCACCGA CTCCGTGAAC CTGCCGAAGG TCGCCACCGG CACGTCAACC
GCCGCGCAGA CCGCCGATGG TGCCCCGGTG ACCAGCACCG ACATGACCGA TACCAGCGTG
TCCGCATCGG TCTACACGGT CGCCGGCCAG CAGGACGCGT CCATGCAGCT ACTCGACCAG
TCCCCGGCGC CGGGCTTCGA CGAAATCATC TTCGCCGACC TACTCGCCGA CCTCGCGGTT
CGGCAGGACG TGTACGTGAT CAACGGCTCC GGGACTGCCG GGCAGCCGAC CGGCATCCTC
AACGTCAGCT CGCCGAACGC GATCACCTAC ACGGACGCGT CGCCGACGCT GCCGGAGATG
TACGTGCCGT GGGTCCAGTC GGTCTCCCAG ATCTTCACCA ACCGCAAGCG GCCCGCCACA
GCCACCTTTG CGTTGCCGAA GATCTGGTTC TGGGCGACTG CCGGTCTCGA TACCACAAAC
CGGCCGCTGA TCCAGCCGTC ACAGGAGGCG CCGTTCAACC CCATGGCCTT ACAGACCGGC
GAAATCGCTG AGGGCCCGGT CGGCAAACTG ACCGTCGGCA CACCGGTGAT CCTCGACGGC
AACATCCCGG AGAACCTCGG CGCCGGCACC GACGAAACGC GGATCATCAC GCTACGCACC
TCCGACCTGT ACCTGTGGGA GGGCGCGATC CAGACCCGTG TCCTCACCGA GGTGCTGTCG
GGGACGCTGC AGGTCCGCTT CCAGGTGTAC CGGTACGCGG CGTTCATGGC CACCCGGCTA
CCGAAGGCGA TTTCGATCGT CTCGGGCACC GGCATGATCC CGACCTCCGG CTACTGA
 
Protein sequence
MLEYLRAQLV QLRERRAALR AELDAVISGA RSASRDLTDD EQARLVAGTE QLRKLNGDED 
GLNSQIREAE ETEHREQVAA GARAESGQVG EHRTGGAVVT SEPQVYGNGS GNSYFHDLAK
AQLRADSSAA ERLQRHAAEL RVELPARERR REERAQREMD GMGTAERWHE EQRSRVFEKR
VNPNRTDGQG GYFVPPLWLI DQYIDLPRFG RPIANAVRNM ALPGGTDSVN LPKVATGTST
AAQTADGAPV TSTDMTDTSV SASVYTVAGQ QDASMQLLDQ SPAPGFDEII FADLLADLAV
RQDVYVINGS GTAGQPTGIL NVSSPNAITY TDASPTLPEM YVPWVQSVSQ IFTNRKRPAT
ATFALPKIWF WATAGLDTTN RPLIQPSQEA PFNPMALQTG EIAEGPVGKL TVGTPVILDG
NIPENLGAGT DETRIITLRT SDLYLWEGAI QTRVLTEVLS GTLQVRFQVY RYAAFMATRL
PKAISIVSGT GMIPTSGY