Gene Sare_4505 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4505 
Symbol 
ID5707026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5092030 
End bp5093199 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content67% 
IMG OID641273919 
Productradical SAM domain-containing protein 
Protein accessionYP_001539268 
Protein GI159040015 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.325974 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0287993 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCCG GACTCAAGCG CGAGCTCGAA GCGAAGGTGT ACGCCGGCGA GCGGTTGACC 
CGCGCGGACG GGATCGCCCT CTACGAGAGC GACGACCTGA CCTGGTTGGG TCGGCTCGCG
CACCACCGGC GTACCGAGTT GGCCGGCGAC CGGGTGATGT TCAACGTCAA CCGGCACCTG
AACCTGACCA ACGTCTGCAG CGCAAGTTGT GCGTACTGCT CGTTCCAGCG CAAGCCGGGG
GAGAAGGACG CGTACACGAT GCGGATCGAC GAGGCGGTCC GCAAGGCCAA GGAGATGGAG
GACGAGCAGC TCACCGAGCT GCACATCGTC AACGGCCTGC ACCCCACCCT GCCGTGGCGG
TACTACCCGA AGGTGCTCCG TGAGCTGAAG GCGGCCCTGC CGAACGTGCG ACTCAAGGCG
TTCACCGCGA CCGAGGTGCA GTGGTTCGAG AAGATCAGCG GCCTGAGCGC CGACGAGATC
CTCGACGAGT TGATGGACGC CGGCCTGGAG TCGTTGACCG GCGGCGGTGC GGAGATCTTC
GACTGGGAGG TCCGGCAGCA CATCGTCGAC CACGCCTGCC ACTGGGAGGA CTGGTCGCGC
ATCCACCGGC TGGCGCACGG CAAGGGTATG AGGACGCCGT CGACCATGCT GTATGGCCAC
ATCGAGGAGC CTCGGCACCG TGTCGACCAC GTGCTGCGAC TGCGCGAGCT GCAGGACGAG
ACGAACGGCT TCGCGGTCTT CATCCCGCTG CGCTACCAGC ACGACTTCGT CGACTCGGCA
GACGGCAAGA TCCGTAACCG GATCCAGGCG CGCACGACGA TGGCCTCGCC GGCGGAGTCG
CTGAAGACCT TCGCGGTGTC CCGGCTGCTC TTCGACAACG TCCCGCACGT CAAGTGCTTC
TGGGTGATGC ACGGACTCTC GGTCGCCCAG CTGTCGCTGA ACTTCGGTGT GGACGACCTG
GACGGCTCGG TGGTGGAATA CAAGATCACG CATGACGCGG ACTCGTACGG CACCCCGAAC
ACCATGCACC GGGCCGACCT GCTGAACCTG ATCTGGGACG CCGGCTTCCG CCCGGTCGAA
CGCAACACCC GGTACGAGGT GGTGCGCGAG TACGACGCCG CGCCCTCGCT CGCCGAGCGC
CGCGCCGAGC CGCAGCAGGT CTGGGCCTGA
 
Protein sequence
MDAGLKRELE AKVYAGERLT RADGIALYES DDLTWLGRLA HHRRTELAGD RVMFNVNRHL 
NLTNVCSASC AYCSFQRKPG EKDAYTMRID EAVRKAKEME DEQLTELHIV NGLHPTLPWR
YYPKVLRELK AALPNVRLKA FTATEVQWFE KISGLSADEI LDELMDAGLE SLTGGGAEIF
DWEVRQHIVD HACHWEDWSR IHRLAHGKGM RTPSTMLYGH IEEPRHRVDH VLRLRELQDE
TNGFAVFIPL RYQHDFVDSA DGKIRNRIQA RTTMASPAES LKTFAVSRLL FDNVPHVKCF
WVMHGLSVAQ LSLNFGVDDL DGSVVEYKIT HDADSYGTPN TMHRADLLNL IWDAGFRPVE
RNTRYEVVRE YDAAPSLAER RAEPQQVWA