Gene Cmaq_0539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0539 
Symbol 
ID5709802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp572734 
End bp573864 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content47% 
IMG OID641275043 
Productradical SAM domain-containing protein 
Protein accessionYP_001540373 
Protein GI159041121 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1180] Pyruvate-formate lyase-activating enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00688869 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.395603 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGGAGG CTGAGCTACA GCGCAACTTA AGTGGAGGAG TAGTAATGTG TACTGCATGT 
GCTAGGCGGT GTAGGCTTAA TGATGGACAA GTAGGATTCT GTGGAGTTAG GGCTAACTTC
GGTGGTAGAT TATTCCTTGT TGTTTACGGT AAATTATCAG CCATAGCTGT TGATCCAATT
GAGAAGAAGC CCCTCTTCCA CTTTAACCCA GGCTCAAGCG TCTTATCAAT GTCCACCTAC
GGCTGCTCCT GGGCCTGTCA ATTCTGCCAA AACTTCGATA TTAGTCAAAG ACGCCTATGG
GAGGGTTTCG AGGTGACGCC TGAGCTTATT ATTGAATTAG CTGAGAGTTA CGGGGTACAG
GGCTTAACCT ATACTTACAA TGAACCCTCA GTCTTCGCCG AGTTCGCGCA TGACGTTGGC
TTATTAGCTA AGAAGAAGGG ATTATTCAAC ACATTCGTCA CCAACGGTTA CTTAACTGAT
GAGACTGTGG ACTACTTATC AAAGTTCCTT GACGCAGCCA CGGTTGACAT TAAGGGTAAT
GCGAATAAGG AGTTCCTGAG GAAGTACTCA ATGGTTCCTG ACCCTGAGCC AATATTCCAA
TCGATCAAGG AGATGAGGGA TAAGGGGATT CACGTGGAGA TAACTGACTT AGTTGTCCCA
GAGATTGGGG ATAGGCTGGA GGACGCTGAG GTAATGCTTA AGAGAATCAT GGATTACCTA
GGACCCGACG TATCAATACA CTTCCTCAGA TTCCACCCAG ACTACAAGCT CAGTAACCTA
CCTCCAACAC CTGTTAAGAC TCTTGAGAAG CATGCTGAGT TAGCGAGGAG GATGGGGTTC
AGGTACGTTT ACCTAGGTAA TGTCCCCGGG CATAAGCTTG AGAACACCTA CTGCCCTAAC
TGCGGTAACG TGGTTATCAG GAGGTATGGA TTCCAGATAC TTGAGGTTAA CTTAACTGAG
GATAATAGGT GCAGGTTCTG TGGGGCTAAG ATCAACATTG ATGGTAAAGT TTGGCCAACG
TGGAGGGAGG ATAGATTCGC CTACGTCCCA ATCCACTTAT TCACAAAGTA CACTAAGGTT
ACTAAATCAG ACGTGGAGGC TATTAGAGGT AGGTTAAGGC AGGGTGAGTA G
 
Protein sequence
MKEAELQRNL SGGVVMCTAC ARRCRLNDGQ VGFCGVRANF GGRLFLVVYG KLSAIAVDPI 
EKKPLFHFNP GSSVLSMSTY GCSWACQFCQ NFDISQRRLW EGFEVTPELI IELAESYGVQ
GLTYTYNEPS VFAEFAHDVG LLAKKKGLFN TFVTNGYLTD ETVDYLSKFL DAATVDIKGN
ANKEFLRKYS MVPDPEPIFQ SIKEMRDKGI HVEITDLVVP EIGDRLEDAE VMLKRIMDYL
GPDVSIHFLR FHPDYKLSNL PPTPVKTLEK HAELARRMGF RYVYLGNVPG HKLENTYCPN
CGNVVIRRYG FQILEVNLTE DNRCRFCGAK INIDGKVWPT WREDRFAYVP IHLFTKYTKV
TKSDVEAIRG RLRQGE