Gene Cmaq_1416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1416 
Symbol 
ID5709312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1494760 
End bp1495836 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content46% 
IMG OID641275926 
Productradical SAM domain-containing protein 
Protein accessionYP_001541231 
Protein GI159041979 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.150894 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAGG CATTGGAGAA GGCCCTTAGT GGTGAAGAGT TGAAGCCCCC TGATATTGAG 
GAGCTTCTTG AATTAAACCT ATGGGACCTA GGCATCGCTG CAGGGGAGTT AACTAGGAGG
TTGTTTAATG GTGTGGTGAC CTTCATACCC AATCTAATAC TGAACTACAC CAACGTCTGC
ACAATAGCCT GCAGCTTCTG CGCCTTCTAT AGGCCACCTA GGCACCCTGA GGCATACACC
ATGAGTGTGG ATTACGCCGT TAAGCTTGTT ACAGAGGTTG ATTCAAGATT CGGTATTAGG
CAGGTTCTAG TGCAGGGTGG TATTAATCCT GAATTAGGCA TTGAGTACTA CGAGGAATTA
TTCAAGACCC TTAAGGCTAA GCTACCCCAC GTGGCTATTC ACGGTTTAAG CCCCATTGAG
GTTGATTACC TAGCTAGGAA GCATAGAATG AGTTACAGGG AGGTTCTTGA GAGGCTTAAG
GCAGCTGGAA TGGATACGCT AGCTGGGGGT GGCGGGGAGA TTCTGGTGGA TGAGGTTAGG
AGGATTATTG CACCACATAA GATTAGTGCT GAAACCTGGT TGAGAGTAAT GGAGATTGCC
CACGGGTTAG GCATAATGAG TAACGCAACA ATGATGTACG GGCACGTGGA GTCTAAGGCG
CATTGGGCTG AGCACCTATA CAGGATTATT AGCCTACAGA GGAGGACTCA TGGATTCCTA
TCCTTCACCG CGTGGAATTT CGAGCCAGGT AACAGTGAGT TAACTAATAA GGTTCCTTAC
CCATTAACAT CAGCCACATT ACTGAGGGTG GTTGCCGTGG CTAGGCTTGT GTTTAAGGGT
GAGTTACCTA ATATTCAATC AAGTTGGTTA ACTAATGGTC TTGATACTGC TCAATTAGCC
CTCAAATTCG GTGCAAACGA CTTCGGTGGA ACACTATACG AGGAGAGAGT AATACCAGCA
ACCGGGTTAA GTAAACCAGT ATTCACAAGG GATTACGTAA TCAACATGAT TAGAAGCCTA
GGATACAAAC CAGCGGAACG CGACAACTGG TATAGGGTGT TAAAACTATA CGACTAA
 
Protein sequence
MSQALEKALS GEELKPPDIE ELLELNLWDL GIAAGELTRR LFNGVVTFIP NLILNYTNVC 
TIACSFCAFY RPPRHPEAYT MSVDYAVKLV TEVDSRFGIR QVLVQGGINP ELGIEYYEEL
FKTLKAKLPH VAIHGLSPIE VDYLARKHRM SYREVLERLK AAGMDTLAGG GGEILVDEVR
RIIAPHKISA ETWLRVMEIA HGLGIMSNAT MMYGHVESKA HWAEHLYRII SLQRRTHGFL
SFTAWNFEPG NSELTNKVPY PLTSATLLRV VAVARLVFKG ELPNIQSSWL TNGLDTAQLA
LKFGANDFGG TLYEERVIPA TGLSKPVFTR DYVINMIRSL GYKPAERDNW YRVLKLYD