Gene Sare_4342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4342 
Symbol 
ID5708410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4909867 
End bp4911129 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content66% 
IMG OID641273764 
Producthypothetical protein 
Protein accessionYP_001539114 
Protein GI159039861 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0527256 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTTG TTCAGTCCTG CGCGAGATGC CGGACCGTGC TCATTCCCGG CCAGCAGGGG 
TGCGTGCGAT GTGGCCTCAC GGCCGCCGAG CCGACACGGG AGTGCCCCGC CTGTCAACGT
CCCACCGCCG TGGATGCGCA GTACTGCCCT GCCTGCGGAG AACAGCTGAG GCAATCGTCA
CCGATGCTCG CCGCCGACCT GGCTACGGCA CCGGCGCCGG GCACTGACCC GTCCGCCCCA
TTGACGGCAG CGCCCGCCGT GTTCGCCACA GTGCGCAAAC CCCGTCGCTG GGTATATCTG
CTCACCGGCG TCGCACTGGC AGTGCTGCTG ACCGGCATCA CGGGCCTCTA CGCGGTGCAG
AGCATCGTCT ACACCCCCGA GCGTGTGGTG ACGGGCTACT TTGCCGCACT GTCCGAACGG
GACGCCGCGG CGGCGCTGTC ATTCCTTGAG GAGCCGCGCA GCGAGATCCC GAATCGCCCC
TTGGAACTGC CGATGGTCCC GTTGACCACG TCCTATCAGC CGCCATCGGG GGCAAAGGTC
ACGTCGATCG GGGGCCTGAG CGAGGCCGAG TTAGAGGGCA GCCCGCCTGC GGAGAACAGT
GATGACTGGC GTTCGGCGAG GGTTACGTAC AGGGTCGGCG ACCGTACCTA TCGGGACGTG
CTCTATCTGC ACCGGCAGGA ACGGAAAGAG TTAGGCCTCT TCCGTGACTG GCTGATCTAC
GGCGGGGTGA ATCAGCTGGC CGTACGCAAT CGGCCGGACA GTCCTGGCGT GCTTATCAAC
GGGCAGGCGG TACCGACCCG CGAGGGGTAC GCGCGGGCAC GCGCCTTTCC CGGTATCCAC
GAGGTGCGGC TGGCCGACGA CCCGTTGGTC GAGGTGGAGC CGGTGGTTCT GGAGGTTGGC
CTGGTACGGC CCGACAATGT ACTGCTCAAG CCGATTCTCA GGGAATCCGC GCGCAGCGAG
GTCGAGAGTC AGGTGAAGGC GTACCTGGAC GAGTGTGCCG AGAGCAGCGA CATGTCGCCG
AAAGGCTGCC CCTTCTCCGG TCCTCCGTTC GGGACCGCGA CGAACGTCAG GTGGACGATC
GACGCGTACC CGAAGCTTGA CATCCGGGCG ATCGACGGTG AGCTCACCGT CAGGGGCTGG
TCGGGACGTG CCTCCGTGAC GTGGACTGGT TCCGGCGGCA GGACGCACGA ATACGACAAC
CCCTTCGTTG TCACCGGTCG GGCCACAGTG ATCGACGGTA GGGTGACGTT CCTCAGCGAC
TGA
 
Protein sequence
MTVVQSCARC RTVLIPGQQG CVRCGLTAAE PTRECPACQR PTAVDAQYCP ACGEQLRQSS 
PMLAADLATA PAPGTDPSAP LTAAPAVFAT VRKPRRWVYL LTGVALAVLL TGITGLYAVQ
SIVYTPERVV TGYFAALSER DAAAALSFLE EPRSEIPNRP LELPMVPLTT SYQPPSGAKV
TSIGGLSEAE LEGSPPAENS DDWRSARVTY RVGDRTYRDV LYLHRQERKE LGLFRDWLIY
GGVNQLAVRN RPDSPGVLIN GQAVPTREGY ARARAFPGIH EVRLADDPLV EVEPVVLEVG
LVRPDNVLLK PILRESARSE VESQVKAYLD ECAESSDMSP KGCPFSGPPF GTATNVRWTI
DAYPKLDIRA IDGELTVRGW SGRASVTWTG SGGRTHEYDN PFVVTGRATV IDGRVTFLSD