Gene Sare_3890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3890 
Symbol 
ID5705828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4430597 
End bp4431796 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content71% 
IMG OID641273315 
Producthypothetical protein 
Protein accessionYP_001538672 
Protein GI159039419 
COG category[S] Function unknown 
COG ID[COG4198] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.282654 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTCG CCCATCCGAT CGCCCGGGCC TGGATCACCA CCGGCGGCAC CGGCGCGCAG 
AACTACGACG AGTTCGCCGA CGACGCGGAG ATCACCGCGA TCATCGAGGC GAATCCGCAC
AGTGCCCTCG GCATCGAGAT GCCGCACCGG GCCCCGGAGA GCCGGGGGAA GGCCTTCCTC
GACGCGCTGC CGGACGCGGC GGCCCGGCTG GCCGAGGCCA AGGCGGCTGG CAGCTACACA
CCTGCCGAGC AGGTGGTGGT CCTCTACCGA ATCAGCGCGC CGGACGAGGA GCCCGGGTAC
GGGCTGTTCG TCATGGTCGA CACCGACCAG ATCTCGACCA GCGCCGACGA GCCGGGTCTG
GTCATCCGTA ACGAGGACGT GTTCATCTCC AAGGTGCGCG AGCGGGTCGC CCTCGCCGAG
ACCCTGGGGC ACCTGCTGTC GCCGGTGCTC CTGCTGCAGA CCGGCCGCGG CGACGAACTG
CACGCCGCCC TCGCCGCAGC CACGGACCGG GCCGGGGTGC CGGCCGCCAC CGACACCGAT
CAGGCCGGGC GCACGCATGC AGTGTGGCTG GTCGGTCCAG GCCGAGAGCA GGATGAGCTG
ACCGCCCTGG CGGGCGGTGG CGAGTTGGTC GTCGCCGACG GCAACCACCG TAGTCTCGCG
GCCCAGACCG GCGGGCTACC ACGCTTCCTG GCCGTGGTCA CCACGCCGGC CTCGGTCGCC
ATCGCGCCGT ACAACCGGCT GGTCGAACAG CTCACCACCA CCCCAGACGA ACTGGTCGAC
CGGCTTCGCA CCGCCGGCGC CCAGGTCGAG CCGATCGACG CGCCGGTCGA GGTCCCGGCA
GCGGGCGGCA CCGTCCACCT CCGGCTACCC GATGCCGCGT ACGCGGTACG CCTGCCTCGG
GTGGGCGCCG GACGCCTGGA GAACCTGGAC CATGCCCTGG TAGAGCGGTT GCTGCTGCGG
GACGCGTTGG GGCTGGAACC GGGCGACAAG CGGATCATCT ACGTGGGCGG CGACTACCCG
GCGACTTGGC TTTCCGGTGA GGTCGACGCC GGACGAGCCG AACTGGCCGT CCTCGTCGCG
CCGGTGACCG TGGACGACTT CGTCGCGGTG AACCTGGCGC GGGAGAAGAT GCCACGCAAG
AGCACCTGGT TCACCCCGAA GGCCCGCGGC GGCCTGGTCG TCGCCGAGCT GGTGTCCTGA
 
Protein sequence
MTVAHPIARA WITTGGTGAQ NYDEFADDAE ITAIIEANPH SALGIEMPHR APESRGKAFL 
DALPDAAARL AEAKAAGSYT PAEQVVVLYR ISAPDEEPGY GLFVMVDTDQ ISTSADEPGL
VIRNEDVFIS KVRERVALAE TLGHLLSPVL LLQTGRGDEL HAALAAATDR AGVPAATDTD
QAGRTHAVWL VGPGREQDEL TALAGGGELV VADGNHRSLA AQTGGLPRFL AVVTTPASVA
IAPYNRLVEQ LTTTPDELVD RLRTAGAQVE PIDAPVEVPA AGGTVHLRLP DAAYAVRLPR
VGAGRLENLD HALVERLLLR DALGLEPGDK RIIYVGGDYP ATWLSGEVDA GRAELAVLVA
PVTVDDFVAV NLAREKMPRK STWFTPKARG GLVVAELVS