Gene Strop_4068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_4068 
Symbol 
ID5060550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4626526 
End bp4627710 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content69% 
IMG OID640476329 
Productradical SAM domain-containing protein 
Protein accessionYP_001160876 
Protein GI145596579 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.18733 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCGGG AGATCAACGA CATCCTGCAA CGCGGCGCGG ACGGTGGGCG GATCACGCCC 
GACGAGGCCC TGCTGCTCTA CACCGAAGCG CCCTTTCACG CGCTGGGTGA GGCGGCCGAC
GCGGTACGGC GGCGGTGGTA CCCGGACAAC ATCGTCACGT ACCTGATCGA CCGCAACATC
AACTACACGA ACGTCTGCGT GACGGCGTGC CGGTTCTGCG CCTTCTACCG TGCGCCCAAG
CACCGGGAGG GCTGGACCCA CCCGACCGAG GAGATCCTGC GCCGTTGCGG CGAGGCGGTT
GAGCTGGGTG CCACCCAGGT GATGTTGCAG GGTGGGCACC ATCCCGACTA CGGGGTGGAG
TACTACGAGG AGCTCTTCTC CTCGGTGAAG CGGGCGTACC CGCAGCTCGC CATCCACTCG
ATCGGCCCGA GCGAGATCCT GCACATGGCG AAGGTGTCCG GCGTGGGTCT GACCGAGGCC
ATCACCCGCA TCAAGGCGGC TGGCCTGGAC TCGATCGCGG GCGCCGGCGC CGAGATGCTG
CCCGCCCGGC CGCGGAAGGC GATCGCGCCG CTGAAGGAGT CCGGGGAGCG CTGGCTCGAG
GTGATGGAGC TCGCCCACCA GCAGGGCGTC GAGTCGACCG CGACGATGAT GATGGGAACC
GGTGAGACCG CCGCTGAGCG GATCGAGCAC CTCCGGATGA TCCGCGACGT GCAGGACCGG
ACGCGGGGCT TCCGGGCGTT CATCCCGTGG ACCTACCAGC CGGAGAACAA CCACCTCAAG
GGCCGGACCC AGGCCACCAC CCTGGAGTAC CTGCGGTTGG TGGCGGTGTC CCGGCTCTTC
TTCGAGACGG TGCCGCACCT CCAGGCGTCG TGGCTCACCA CCGGCAAGGA TGTCGGCCAG
CTCGCGCTGC ACATGGGGGT TGACGATCTG GGCTCGATCA TGCTGGAGGA GAACGTCATC
TCCTCGGCGG GCGCGAGGCA CCGTTCGAAC CTGCACGACC TGATCGGAAT GATCCGCTCG
GCGGACCGGA CCCCCGCCCA GCGGGACACC CACTACCGCC GGCTCGCTGT GCACCACACT
CCCGCGGACG ACCCGCGGGA CGACCGGGTG GTGTCGCACT TCTCGTCGAT TGCCCTGCCA
GGCGGTGGCG CCGGGAAGAC GCTGCCACTG GTCGACGCCG GCTGA
 
Protein sequence
MSREINDILQ RGADGGRITP DEALLLYTEA PFHALGEAAD AVRRRWYPDN IVTYLIDRNI 
NYTNVCVTAC RFCAFYRAPK HREGWTHPTE EILRRCGEAV ELGATQVMLQ GGHHPDYGVE
YYEELFSSVK RAYPQLAIHS IGPSEILHMA KVSGVGLTEA ITRIKAAGLD SIAGAGAEML
PARPRKAIAP LKESGERWLE VMELAHQQGV ESTATMMMGT GETAAERIEH LRMIRDVQDR
TRGFRAFIPW TYQPENNHLK GRTQATTLEY LRLVAVSRLF FETVPHLQAS WLTTGKDVGQ
LALHMGVDDL GSIMLEENVI SSAGARHRSN LHDLIGMIRS ADRTPAQRDT HYRRLAVHHT
PADDPRDDRV VSHFSSIALP GGGAGKTLPL VDAG