Gene Sare_3706 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3706 
Symbol 
ID5705499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4260475 
End bp4262580 
Gene Length2106 bp 
Protein Length701 aa 
Translation table11 
GC content69% 
IMG OID641273126 
Producthypothetical protein 
Protein accessionYP_001538490 
Protein GI159039237 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones83 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCC CCGCTCGGTT CGAGTTCCGG GTGGAGATCC ATCTCGGGGC GTTGGGGTGG 
GTGGACATCA CCGCGGATGT GCGGGGCTCG GTAAAGATCA CACGGGGACG GACCGCCGAG
GGTCGCCGCG TCGACCGGGG TACCGCTTCG ATGCGGCTGG ACAACAGCAG CGGTAGCTAC
TCGCCACGGC GGCCGACCGG TGCCTACTAC GGCGTGATTG GCCGCAACAC TCCGCTGCGG
GTATCCGCTG GCCTCGCCGG TGGAACCCTT CACGACCGGT TCTATGGGGA GGTGTCGTCG
TGGCCGCCGC AGTGGTCGGT GACCGGGGTT GACCGGTATG TCGACATCAC TGCGTCGGGG
GTCCTGCGTC GCCTCGGGCA GGGAGCCTCG CCGCTCAGGT CCCCTCTGCA TCGGGCGATC
ACCGGGACCG GTCCGGTGTC GTACTGGCCG GTTGAGGACG GCTCCGGTTC CACCCAGGCC
GCGTCTGGGC TGCCTGGTGG CACCCCACTG ACCGCGACGG GTGAGATCGC CTGGGCATCT
GTGACATCGC CGGGTTCGTC ACCCCTGCCG GACTGGTCGC GGGACGCCGG TGGTCTGACC
GGTCCGGTCA CCGGCGTCAC GGATGGTGAC GATTGGGCTA TCCGGGTCCT GGTCCAGTTC
GGCACCGGGG CGACCTGGGA CGTGCTGGTG GCAGCGGTGA CCGGCGACAG CATCTACGAC
GAGCTGCGGA TAGAGCTGTC GGCGTCGTCC GTGCTGGTGG CAGCGGTGGC CTACGGGCCC
GATTCATCTA GCCTTACCTA TATTCTCACT GACTTTACCG ACTACAGCGA CGAGCTGCCG
CATTGGCTTG AGGTGGGGGC GGCCGACGCC GGCGGCGGAA CCGTCACCTA CACGCTGCGG
ATTGACGGTG CCCTCCGGAG CTCCACCACC ACGGCCGGAT CACCGGGTGT GCCGCACCGT
CTACATATCG GCCCACGGAC GGATGGCACG GCCGCGCTGG GCCACATCGG TGTGTGGCGG
GCCCCATCCA CGGATACCTG GACAACCGTC GCTCCGGCCG TCACGGGACA CGCTGGCGAG
ACGGCTGGCC GGCGAATCGA GCGGCTGTGC CACGAGGAAG GCACCACCTG TCACATCGTG
GGCGACCCCG ACGACACCGT GACGATGGGC GCGCAACCCA CGGGCACGCT GCTAGAGCTA
ATCAGGCAGT GCGAGGACGT TGACCAGGGC ATCCTCTACG AGCCGCGTGA GGTGCTGGGG
CTGGCCTATC GAACCGTGCG GTCCCGATAC AACCAGCCCG TCACCCTCGC GTTGACCTTT
GGCGCGGACG GGGAGGTAGC GCCGCCGCTG GAGCCCGTCG ACGACGACCG GCACGTCCGC
AACGACGTTA CTGCCTCACG GTCCGGCGGG TCGTCATACC GGGCCGAGGT GACGTCGGGT
CCGCTGTCCA CGGCGGCGCC GCCGGATGGT GTCGGCCGCT ACGACCACCG AGTGACCGTC
AACGTCCCCT CCGACTCCCA CCTCGCCGAC CAGGCGTCAT GGCGCACCCA CCTCGGCACA
TGGGATGAGG CCCGCTACCC CACCATCCGC GTCGACCTGG CCGCCCTCAG CCATGCGGGC
AAACCCACGC TGATCACCGC CGCCGCCGCA GTGGACGTCG GGGACCGGCT GACCATCGGT
TCCCCGCCCG TGTGGCTGCC CCCGGACGCC ATCGATCAGC ACAGCGAGGG GTACACCGAG
ACCATCGACC AGTACACCTG GGATCTGCGG TTGGTATGCG TGCCCGCAGG TCCGTACGCG
GTGGCCGTCG TCGACGGCCC GCAACGCGTT GCGGCGGACG GCTCCACCAT CGCCGCCGTG
GACGCTGCCA CGTTGACGCT GACGATGACC TCGACCACCG AAAACGGCGC CTGGACAACA
GGCGCCGCGG ACTTTCCGCT GGACTTATTG ATCGGCGGCG GGGAACGCGT CACCGCCACC
GGCATCACCG GCACGGGACT TACCCAAACC GTGACCCTGT CCGGGCGGGC CGTCAACGGG
GTCGGCCGGG CGTGGCCGGA CGGCACCTCG GTGCAGGTCT GGGCGCCAGC GATAGCGGCC
TTGTGA
 
Protein sequence
MSAPARFEFR VEIHLGALGW VDITADVRGS VKITRGRTAE GRRVDRGTAS MRLDNSSGSY 
SPRRPTGAYY GVIGRNTPLR VSAGLAGGTL HDRFYGEVSS WPPQWSVTGV DRYVDITASG
VLRRLGQGAS PLRSPLHRAI TGTGPVSYWP VEDGSGSTQA ASGLPGGTPL TATGEIAWAS
VTSPGSSPLP DWSRDAGGLT GPVTGVTDGD DWAIRVLVQF GTGATWDVLV AAVTGDSIYD
ELRIELSASS VLVAAVAYGP DSSSLTYILT DFTDYSDELP HWLEVGAADA GGGTVTYTLR
IDGALRSSTT TAGSPGVPHR LHIGPRTDGT AALGHIGVWR APSTDTWTTV APAVTGHAGE
TAGRRIERLC HEEGTTCHIV GDPDDTVTMG AQPTGTLLEL IRQCEDVDQG ILYEPREVLG
LAYRTVRSRY NQPVTLALTF GADGEVAPPL EPVDDDRHVR NDVTASRSGG SSYRAEVTSG
PLSTAAPPDG VGRYDHRVTV NVPSDSHLAD QASWRTHLGT WDEARYPTIR VDLAALSHAG
KPTLITAAAA VDVGDRLTIG SPPVWLPPDA IDQHSEGYTE TIDQYTWDLR LVCVPAGPYA
VAVVDGPQRV AADGSTIAAV DAATLTLTMT STTENGAWTT GAADFPLDLL IGGGERVTAT
GITGTGLTQT VTLSGRAVNG VGRAWPDGTS VQVWAPAIAA L