Gene Sare_4467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4467 
Symbol 
ID5708342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5048597 
End bp5049787 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content68% 
IMG OID641273883 
Productradical SAM domain-containing protein 
Protein accessionYP_001539232 
Protein GI159039979 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.82006 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGTGA GCCGGGAGAT CGACGACATC CTGCAACGCG GCGCGGACGG CGGGCGGATC 
ACGCCCGAGG AGGCCCTGCT GCTCTACACC GATGCGCCCT TCCACGCGCT GGGTGAGGCC
GCCGACGTGG TCCGTCGGCG ACGGTATCCG GACAACATCG TCACCTACCT GATCGACCGC
AACATCAACT ACACGAACGT CTGCGTGACG GCGTGCCGGT TCTGCGCCTT CTACCGCGCA
CCCAAGCACC GAGAGGGTTG GACCCACTCG ACGGAGGAGA TCCTGCGTCG CTGTGGCGAG
GCGGTCGGGC TGGGCGCCAC CCAGGTGATG CTGCAGGGTG GGCACCATCC CGACTACGGC
GTGGAGTACT ACGAGGAGCT GTTCTCCTCG GTGAAGAGGG CGTACCCGCA GCTCGCCATC
CACTCGATCG GCCCGAGCGA GATCCTGCAC ATGGCGAAGG TCTCCGGTGT GAGCCTGACC
GAGGCCATCA CCCGGATCCA GGCTGCTGGC CTGGACTCGA TCGCCGGTGC CGGCGCCGAG
ATGCTGCCGG CCCGGCCGAG GAAGGCGATC GCCCCGCTGA AGGAGTCCGG TGAGCGCTGG
CTCGAGGTGA TGGAGCTGGC CCACCAGCAG GGCATCGAGT CGACCGCGAC GATGATGATG
GGCACCGGTG AGACCGCCGC GGAGCGGATC GAGCACCTTC GGATGATCCG TGACGTGCAG
GATCGCACGC GGGGTTTCCG GGCGTTCATC CCGTGGACCT ACCAGCCGGA GAACAACCAC
CTCAAGGGCC GGACCCAGGC CACCACCCTG GAGTACCTGC GGCTGGTGGC GGTGTCCCGG
CTGTTCTTCG AGACCGTGCC GCATCTCCAG GCGTCGTGGC TGACCACCGG CAAGGACGTC
GGCCAGCTCG CCCTGCACAT GGGCGTCGAC GATCTGGGTT CGATCATGTT GGAGGAGAAC
GTCATCTCCT CGGCCGGGGC CCGACACCGT TCGAACCTGC ATGAGCTGAT CGGGATGATC
CGGTCGGCGG ACCGGATCCC CGCCCAACGG GACACCCACT ACCACCGGCT CGTCGTGCAC
CGGACGCCCG CTGACGACCC CACGGACGAC CGGGTCGTGT CGCACTTCTC CTCGATCGCC
CTGCCGGGTG GCGGCGCCGG GAAGGCGTTG CCACTGGTGG ACGCCGGCTG A
 
Protein sequence
MTVSREIDDI LQRGADGGRI TPEEALLLYT DAPFHALGEA ADVVRRRRYP DNIVTYLIDR 
NINYTNVCVT ACRFCAFYRA PKHREGWTHS TEEILRRCGE AVGLGATQVM LQGGHHPDYG
VEYYEELFSS VKRAYPQLAI HSIGPSEILH MAKVSGVSLT EAITRIQAAG LDSIAGAGAE
MLPARPRKAI APLKESGERW LEVMELAHQQ GIESTATMMM GTGETAAERI EHLRMIRDVQ
DRTRGFRAFI PWTYQPENNH LKGRTQATTL EYLRLVAVSR LFFETVPHLQ ASWLTTGKDV
GQLALHMGVD DLGSIMLEEN VISSAGARHR SNLHELIGMI RSADRIPAQR DTHYHRLVVH
RTPADDPTDD RVVSHFSSIA LPGGGAGKAL PLVDAG