Gene Sare_4645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4645 
Symbol 
ID5706232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5264816 
End bp5265946 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content66% 
IMG OID641274046 
Producthypothetical protein 
Protein accessionYP_001539393 
Protein GI159040140 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0894942 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGCGG TGCTGCAGGT CGTGGGCTTG GGGTCGATGT GGATGGTGGT AGCCCTGCTT 
CTCCCGCAGG CCTTGGGCGA TGCCAAACGG CGCATGCTGT GGGCGGTTGT CCTGCTGTTC
GCGATCGAGC TGACGCTGTA CCGGCCAGAG GTGCAGGCAC CGCTGTACGA CGTGATCAAC
GGACATGTCG TCTTCGTGGC GGTGCATCTG GTGAGCGTCG GCAAAGGGGT CGGCGTGCTG
TACCTGCTGC TCGTGCTGGT ACAGCGCCCA CGTTTTCGGC TACCGGTAGC GGCATCGGGG
GTGGCGGTCT CAGCCGCGAT GATCGCTATC TATGCGGCGG CTCGGCCGGA GCCAGCGACG
GTTGACATCC CACCGGAGAT ACCGCTGGTC TACTGGCACA TTCTTGCGGT GTTCCACACG
GTTGCGCATC TGCTGGCAGT TGGGTTGTGT TGGCACGCGA GTCGGCTTGT CGCGCCGAGA
GCGATGCGCA TCAGCCTGAT GGCGTTGACG GGTGGGCTGT TGCTGGCATG CCTGCCGTGG
GCGTTCAACC TCGGCTGGCT CCTCAGCGAT GACACCGCCT GGCTCGCTCC GATCGGACCG
ATTGACACGG TGACCGGGCT GTTCTTCGCC TTCGGTGCCG CGTTGCCGCT GGCGGCGTCG
GTGCGACGGG CAGTGCGGCA CGACCGGGCA ATACGTCAGC TGGAGCCGCT GTGGCGAGAG
CTCACCTCCG TGGTACCCGA CGTTGTCTTC GAAGCGGTCC GGCCCGGGCT CGGTACGCGT
CAGCGTCGGC TGCGGCTGTA TCGACGCGTG GTTGAGGTTC GAGACGCGAT GCTGGTGCTG
CGTGAGTATG TCACTGCTGA TGATCTACGC GGGGCGCAGG AACATGTTGC CGCCGAGTTG
CCCGACGAGC ATCGACGGGA GGCCGCGGCC ACCGCCTGCT GGCTGGCCGC CGCGGTAGCT
GCGAAGTCGC GTGGCGACGC GCCGATGGTG CAGCAGGAGG ACCTGACGAG CGCCCCCGGT
GACGATCTCG ACGAAGAGGT TACGCAGCTA CTGGAGGTAG CGCAGTGGTA CCGCTCGTCG
CTGGTGAGTC GGTATCGGAC CGGACTGCCG CCGGTCACCG CGTCCCAGTA G
 
Protein sequence
MTAVLQVVGL GSMWMVVALL LPQALGDAKR RMLWAVVLLF AIELTLYRPE VQAPLYDVIN 
GHVVFVAVHL VSVGKGVGVL YLLLVLVQRP RFRLPVAASG VAVSAAMIAI YAAARPEPAT
VDIPPEIPLV YWHILAVFHT VAHLLAVGLC WHASRLVAPR AMRISLMALT GGLLLACLPW
AFNLGWLLSD DTAWLAPIGP IDTVTGLFFA FGAALPLAAS VRRAVRHDRA IRQLEPLWRE
LTSVVPDVVF EAVRPGLGTR QRRLRLYRRV VEVRDAMLVL REYVTADDLR GAQEHVAAEL
PDEHRREAAA TACWLAAAVA AKSRGDAPMV QQEDLTSAPG DDLDEEVTQL LEVAQWYRSS
LVSRYRTGLP PVTASQ