Gene Strop_4085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_4085 
Symbol 
ID5060567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4644192 
End bp4645361 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content66% 
IMG OID640476346 
Productradical SAM domain-containing protein 
Protein accessionYP_001160893 
Protein GI145596596 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.181055 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCCG GACTCAAGCG CGAGCTCGAA GCGAAGGTGT ACGCCGGCGA GCGGCTGACC 
CGCGCGGACG GGATCGCCCT CTACGACAGC GACGACCTGA CCTGGCTGGG GCGGCTCGCG
CACCACCGGC GGTCCGAGCG CAACGGCGAC CGGGTGATGT TCAACGTCAA CCGGCACCTG
AACCTCACCA ACGTCTGCAG CGCCAGCTGT GCGTACTGCT CGTTCCAGCG CAAGCCGGGA
GAGAAGGACG CGTACACGAT GCGGATCGAC GAGGCGGTCC GCAAGGCCAA GGAGATGGAG
GACGAGCAGC TCACCGAGCT GCACATCGTC AACGGGCTGC ACCCCACCCT GCCGTGGCGC
TACTACCCGA AGGTGCTCCG TGAGCTGAAG GCGGCGCTGC CGAACGTACG GCTCAAGGCG
TTCACCGCGA CCGAGGTGCA GTGGTTCGAG AAGATCAGCG GCCTGAGCGC CGACGAGATC
CTGGACGAGC TGATGGACGC CGGCCTGGAG TCGTTGACCG GCGGCGGTGC GGAGATCTTC
GACTGGGATG TGCGGCAGCA CATCGTTGAC CACGCCTGCC ACTGGGAGGA CTGGTCCCGC
ATCCACCGGC TGGCGCACAA CAAGGGCATG AAGACCCCGT CAACCATGCT GTACGGACAC
ATCGAGGAGC CCCGGCACCG TGTTGACCAC GTGCTGCGGC TGCGTGAGTT GCAGGACGAG
ACCAACGGCT TCGTGGTCTT CATCCCGCTG CGCTACCAGC ATGACTTCGT CGACTCGGCG
GACGGCAAGA TCCGTAACCA GATCCAGGCG CGGACGACGA TGGCCTCGCC GGCGGAGTCA
CTGAAGACGT ACGCGGTGTC CCGGCTGCTG TTCGACAACG TCCCGCACGT GAAGTGCTTC
TGGGTGATGC ACGGGCTCTC GGTCGCCCAG ATGTCGCTGA ACTTCGGCGT GGACGACCTG
GATGGCTCGG TGGTGGAATA CAAGATCACC CATGACGCCG ACTCGTACGG CACGCCGAAC
ACCATGCGCC GGGCCGACCT GTTGGACCTG ATCTGGGATG CCGGGTTCCG CCCGGTCGAG
CGCAACACCC GGTACGACGT GGTGCGCGAG TACGACGCCG CGCCCTCGCT CGCCGAGCGT
CGTGCCCAGC CGCAGCAGGT GTGGGCCTGA
 
Protein sequence
MDAGLKRELE AKVYAGERLT RADGIALYDS DDLTWLGRLA HHRRSERNGD RVMFNVNRHL 
NLTNVCSASC AYCSFQRKPG EKDAYTMRID EAVRKAKEME DEQLTELHIV NGLHPTLPWR
YYPKVLRELK AALPNVRLKA FTATEVQWFE KISGLSADEI LDELMDAGLE SLTGGGAEIF
DWDVRQHIVD HACHWEDWSR IHRLAHNKGM KTPSTMLYGH IEEPRHRVDH VLRLRELQDE
TNGFVVFIPL RYQHDFVDSA DGKIRNQIQA RTTMASPAES LKTYAVSRLL FDNVPHVKCF
WVMHGLSVAQ MSLNFGVDDL DGSVVEYKIT HDADSYGTPN TMRRADLLDL IWDAGFRPVE
RNTRYDVVRE YDAAPSLAER RAQPQQVWA