Gene Sare_4111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4111 
Symbol 
ID5707662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4671274 
End bp4672287 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content71% 
IMG OID641273539 
Producthypothetical protein 
Protein accessionYP_001538892 
Protein GI159039639 
COG category[T] Signal transduction mechanisms 
COG ID[COG3480] Predicted secreted protein containing a PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.404702 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0634839 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACGTC GCGGCCTGAC CGTCCTGCTC GGTGCCCTGT TCACTGCTCT GCTCGGCATC 
GGCGTGCTCG CAGCACCCAT CCCGTACGTG GTGCTGGGCC CCGGTCCGAC CGTCGACACG
CTGGGCACCG AGGACGGTAC GGAGGTCATC CAGGTCACCG GCCGGGAGAC CTCCGACTCG
ACTGGGGAGC TCCGGTTGAC CACGGTGGGG GTGCAGCCCT CGGTCAAGCT GCGCACGGCC
ATCCAGGGGT GGTTCTCCGA CGACGAGGCG GTGGTGCCGC GCGAGTTGGT GTACCCGCCG
GGGGAGAGCC GGGAGGAGGT CGAGGAACGC AACGCGGAGG ACTTCAAGGT CTCCCAGACC
AGCGCGGAGA CGGTGGCTCT GCGTGAGCTC GGGTTCCCGG TGCGGGTGGT GGTCAAGACG
GTGGCCGAGG ACGGGCCGTC GGTGGGCCTG CTCCGCCCCG GTGACGTGGT GGACTCGGTC
AACGGGCAAC CCGTCCCGGT GGCCTCCCGG CTGACCGAGT TGATCCGGGC CGAGCCGCCC
GGCGCCACCC TCGAGATCGG CTACATCCGG GACGGGGCTC CCGGGACCGC GCGGATCACC
AGTCAGGAGA AGGACGGCCG GCCCCGGATC GGGGTCGGAA TCGAGCAGCA GCAGCCGCAC
CCGTTCACAC TGACCATCGA CCTGGAGGAC ATCGGTGGCC CGAGTGCCGG GCTCATGTTC
GCCCTCGGCA TCATCGACAA GCTGACGCCG GATGACCTGA CCGGTGGTCA GATCATCGCC
GGCACCGGCA CGATCGACGA CGAGGGCCGG GTCGGCCCGA TCGGGGGCAT ACCCCAGAAG
CTGGTCGGCG CCAAGGACGC CGGCGCGACC GCCTTCCTGG TTCCGGCCGA CAACTGTGCC
GAGGCCGTCC GCAATCCACA ACCCGGCCTG CCGTTGCTCA AGGTGGCGAC GCTGGACGAG
GCGCTGACCG CCCTTGAGGC CCTGCGAGCG GGGGGCGAAC CGGCCCGCTG CTGA
 
Protein sequence
MRRRGLTVLL GALFTALLGI GVLAAPIPYV VLGPGPTVDT LGTEDGTEVI QVTGRETSDS 
TGELRLTTVG VQPSVKLRTA IQGWFSDDEA VVPRELVYPP GESREEVEER NAEDFKVSQT
SAETVALREL GFPVRVVVKT VAEDGPSVGL LRPGDVVDSV NGQPVPVASR LTELIRAEPP
GATLEIGYIR DGAPGTARIT SQEKDGRPRI GVGIEQQQPH PFTLTIDLED IGGPSAGLMF
ALGIIDKLTP DDLTGGQIIA GTGTIDDEGR VGPIGGIPQK LVGAKDAGAT AFLVPADNCA
EAVRNPQPGL PLLKVATLDE ALTALEALRA GGEPARC