Gene Caci_8066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_8066 
Symbol 
ID8339444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp9360567 
End bp9361727 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content66% 
IMG OID644961151 
ProductRadical SAM domain protein 
Protein accessionYP_003118730 
Protein GI256397166 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.437454 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTCCG GGCTCAAGCG GGATCTCGAG GACAAGGTCC TCTCCGGCTC GCGGCTTTCC 
TTCGAGGACG GCGTGGCGCT GTACGACACC GATGAGGTGG CCTGGCTCGG CGAACTCGCC
CACGAGATGC GGACCCGCAA AAACGGCGAC AAGGTCTTCT TCAACGTCAA CCGCCACCTG
AACATGACGA ACGTCTGTTC GGCGTCGTGC GCGTACTGCT CCTTCCAGCG CAAGCCGGGG
GAGAAGGACG CCTACACGAT GCGCATTGAG GAAGCGGTCC GCCTGGCCAA GGACATGGAG
CCGGACGGCA TCACCGAGCT GCACATCGTC AACGGCCTGC ACCCGACGCT GCCGTGGCGC
TACTACCCGA AGTCGATCAG CGAGCTCAAG GCGGTGCTGC CCGGCGTCTC GATCAAGGCC
TTCACCGCCA CCGAGATCCA CTGGTTCGAG AAGATCTCCG GCCTGAGCGC GGAGGAGATC
CTCGACGAGC TGATCGAGGC CGGTCTGGAG TCCCTGACCG GCGGCGGCGC CGAGATCTTC
GACTGGGAGA TCCGCTCGCA GATCGTGGAC CACGCCACCC ACTGGGAGGA CTGGTCCCGC
ATCCACCGCC TCGCGCATGC CAAGGGCCTG CGCACCCCGG CCACCATGCT CTACGGCCAC
ATCGAGGAGC CCCGGCACCG CGTGGACCAC GTGCTCCGGC TGCGCGAGCT CCAGGACGAG
ACCGGCGGCT TCGCCGTCTT CATCCCCCTG CGCTTCCAGC ACGACTTCCA CGACAGCAAG
GACGGCAAGG TCCGCAACCG CCTCATGAAC CAGCCGATGG CCACCGGCGT CGAGGCACTG
AAGACCTTCG CCGTCTCCCG CCTGATGCTC GACAACTTCG ACCACGTGAA GTGCTTCTGG
GTCATGCACG GCCTGTCCAC CGCCCAGCTC GCGCTCAACT ACGGCGCCGA CGACCTGGAC
GGCTCGGTGG TCGAGTACAA GATCACCCAC GACGCCGACG ACTACGGCAC CCCGAACAAG
ATGACCCGCG AAGACCTCCT CGAGCTGATC CGCGACGCCG GCTTCACCCC GGTCGAGCGC
AACACCCGCT ACGAGATCAT CCGCGAGTAC GACGGCCCGG AGCCGGCCCG CCGCGAAGAG
CCGCAGCTGA TGACCTTCTG A
 
Protein sequence
MDSGLKRDLE DKVLSGSRLS FEDGVALYDT DEVAWLGELA HEMRTRKNGD KVFFNVNRHL 
NMTNVCSASC AYCSFQRKPG EKDAYTMRIE EAVRLAKDME PDGITELHIV NGLHPTLPWR
YYPKSISELK AVLPGVSIKA FTATEIHWFE KISGLSAEEI LDELIEAGLE SLTGGGAEIF
DWEIRSQIVD HATHWEDWSR IHRLAHAKGL RTPATMLYGH IEEPRHRVDH VLRLRELQDE
TGGFAVFIPL RFQHDFHDSK DGKVRNRLMN QPMATGVEAL KTFAVSRLML DNFDHVKCFW
VMHGLSTAQL ALNYGADDLD GSVVEYKITH DADDYGTPNK MTREDLLELI RDAGFTPVER
NTRYEIIREY DGPEPARREE PQLMTF