Gene Sare_3244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3244 
Symbol 
ID5705395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3737672 
End bp3738781 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content68% 
IMG OID641272672 
Productband 7 protein 
Protein accessionYP_001538039 
Protein GI159038786 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0330] Membrane protease subunits, stomatin/prohibitin homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTTCC TGTTGCCGGT CCTTTTGATA GCTGTGGCGG TCATCGGCGT GGTGACCCTG 
GCCCAGGCGG TGCGGATCGT GCCGCAGCAG CGCCAGGATG TGGTGGAGCG GCTCGGCCGG
TACAAGCGCA CCCTGGACCC GGGGCTGAAC GTGCTGGTGC CGTTCATCGA CTCGGTGCGT
ACCAAGGTCG ACATGCGTGA GCAGGTGGTC AGCTTCCCGC CCCAGCCGGT CATCACCTCG
GACAACCTGG TCGTCTCGAT CGATACTGTC CTCTATTTCA AGGTTGTGGA CTCGGTTCGC
GCCACGTACG AGATTTCGCA TTTTCTCCAG GCCATCGAGC AGCTCACGGT GACCACGTTG
CGTAACGTCA TCGGTTCTCT TGATCTGGAG CGGGCGCTGA CCAGCCGGGA GGAGATCAAC
CGGCACCTGT CCGGCGTGCT GGACGAGACC ACCGGTAGGT GGGGGATCAA GGTGACCCGG
GTGGAGATCA AGGCGATCGA GCCGCCGCCG AGCATCCGGG ACTCGATGGA GAAGCAGATG
CGCGCCGAGC GGGACCGTCG GGCGGCGATC CTCAACGCGG AGGGGCACAA GCAGTCGCAG
ATCCTGACCG CCGAGGGCGA GAAGCAGGCG GCGGTCCTGC GCGCCGACGG TGACCGGCAG
GCCCGCATCC TTCAAGCTGA GGGGCAGGCC AAGGCGGTCC GTACCGTCTT CGACGCCATC
CACCAGGCAA ACCCGAGCCA GAAGGTGCTC GCCTATCAGT ACCTGCAGGC GCTGCCGCAG
ATCGCCAACG GCTCCGCCAA CAAGGTCTGG ATCGTCCCGG CCGAGCTGAC GAAGGCGTTG
GAGGGTATGG GCGGTGCGCT CGGGGGTCTG AGCCAGATGG CCGGTGACGC GCCGTCACCG
GAGGCGTCCG ACGGGGCGAG CCAGGTCGAG CGGGAGGCCG ACGAGGCCGC GCGTGCCGCA
GCGGATGCCG CGCGGGAGAT CCACGACGAG GTTCGCGTCG CCGAGGCCCA GGCTGCCGGG
AGCAAGGGGC CGAAGGGGTT GCCCGCACCC GAGCCGGTCT CCCCGGAGAG TCTGCGGACC
GACACCACCG AGCAGCGCGA GCGGGGCTGA
 
Protein sequence
MDFLLPVLLI AVAVIGVVTL AQAVRIVPQQ RQDVVERLGR YKRTLDPGLN VLVPFIDSVR 
TKVDMREQVV SFPPQPVITS DNLVVSIDTV LYFKVVDSVR ATYEISHFLQ AIEQLTVTTL
RNVIGSLDLE RALTSREEIN RHLSGVLDET TGRWGIKVTR VEIKAIEPPP SIRDSMEKQM
RAERDRRAAI LNAEGHKQSQ ILTAEGEKQA AVLRADGDRQ ARILQAEGQA KAVRTVFDAI
HQANPSQKVL AYQYLQALPQ IANGSANKVW IVPAELTKAL EGMGGALGGL SQMAGDAPSP
EASDGASQVE READEAARAA ADAAREIHDE VRVAEAQAAG SKGPKGLPAP EPVSPESLRT
DTTEQRERG