Gene Sare_1022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1022 
Symbol 
ID5708183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1144357 
End bp1145592 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content67% 
IMG OID641270539 
ProductErfK/YbiS/YcfS/YnhG family protein 
Protein accessionYP_001535924 
Protein GI159036671 
COG category[S] Function unknown 
COG ID[COG1376] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.575167 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAGGT TCACGCGAGC CGGGGCGGTC GTGGGCGCCT TCGGCCTGGT GGCAGCACTG 
GCACTGGCCG GGTGCGGCAA CGACCGGGAG CAGGCGACCT TCGTCGACGG GGCGACGCCA
GCCGAGCGCG GGTCCCCCCG AGAGCCGGTC GCGCCACTTT CCGTAAGCCC GGCGGAAGGG
GCGGCGAAGC AACCGGTAAG TGCGGAGATC AGCGCCACCG ACCTCGCGGA CGGGTCAAGC
TTGTCCGCCG TCACCCTGGC TACCGCCGAC GGTACGGCGG TCGAGGGTCA ACTGCGCGCT
GACGGCTCCT CGTGGGTGCC ATCGACGCCG TTGGAGTACG GCGCCAGCTA CACCGCGACG
GTCACCGCGA CCACTGCCAA CGGCGGTACC CGTGCAGGGC ACAGCACGTT CACGACCATG
GCGAAGCCGA AGTCGATGAT CAGCTCCGGT CTCTACCTCT TCGACGACAG GGAGTACGGC
GTGGCGATGC CGGTGGTCGT CGAGTTCCAC CCCGGCATCC CGGAGCAGGA TCGGGCGGCG
GTGCAGCGGC GAATGTTCGT GCGAACCGAC CCGGTGCAGC CGGGCGCCTG GCACTGGGTC
AACGGCAACC AGGCGTACTA TCGTGCCCCG GAGTACTGGC GTCCGGGCAC GACTCTGCAC
GTCCGGGTCG CGTTGGCCGG TGTTCCGTTG AGCAACGGCC GGTACGGCAA CGTCGACCGG
GTCGCAACGG TCAAGATCGG CCGGGCCTTC GAAATGAAGG TCGACAACGC CAGCAAACAG
ATGACGGTGT ACGAGGACGG TCAGGTGATC CGCACCCTGC CAGTGAGCCT GGGCGCGAAG
AAGACCCCCT CCTCCAGCGG CATGATGGTG ATCATGGAGA AGAAGGAGTC CACCGTCTTT
GACACCCGTG ACGAACCGGA CCCGGACAAC CGCTACGTCA CCGAGATCGA CTACGCCCAG
CGGCTCACCT GGAACGGCGA GTACATCCAC GCCGCACCCT GGTCAGAGCA TGTGCAGGGG
CGGCAGAACG TCTCGCACGG CTGCGTCAAC ATCTCGACTG CCCATGCCCG GTGGTTGTTC
AGCAAGACAA AGATCGGCGA CCCGATCGCC GTCGCCGGCA CCGAGCGACA GCTGACGGCC
GGCAATGGGT GGACGGCGTG GAACCTGAGC TGGCCGGAGT TCGTCAAGGG CAGTGCCTTG
TCGGTACCAG AGGGGGGCGC GGGCTCGCCC TTCTGA
 
Protein sequence
MGRFTRAGAV VGAFGLVAAL ALAGCGNDRE QATFVDGATP AERGSPREPV APLSVSPAEG 
AAKQPVSAEI SATDLADGSS LSAVTLATAD GTAVEGQLRA DGSSWVPSTP LEYGASYTAT
VTATTANGGT RAGHSTFTTM AKPKSMISSG LYLFDDREYG VAMPVVVEFH PGIPEQDRAA
VQRRMFVRTD PVQPGAWHWV NGNQAYYRAP EYWRPGTTLH VRVALAGVPL SNGRYGNVDR
VATVKIGRAF EMKVDNASKQ MTVYEDGQVI RTLPVSLGAK KTPSSSGMMV IMEKKESTVF
DTRDEPDPDN RYVTEIDYAQ RLTWNGEYIH AAPWSEHVQG RQNVSHGCVN ISTAHARWLF
SKTKIGDPIA VAGTERQLTA GNGWTAWNLS WPEFVKGSAL SVPEGGAGSP F