Gene Cmaq_0022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0022 
Symbol 
ID5709871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp35642 
End bp36655 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content43% 
IMG OID641274525 
Product5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase-like protein 
Protein accessionYP_001539866 
Protein GI159040614 
COG category[R] General function prediction only 
COG ID[COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.653598 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATTG AGGCTACTTT AAAGAACTAT GACTTAAGTA AATTAAACGT AGTCACAATA 
GCCAGTCACT CATCGCTCCA AATATTGAGG GGGGCTAAGA GGCATGGGTT AGGTACCGTG
GCTGTGGCTA AGCCTGGGTC GGGTTGGTTT TACAGGCGTT TTAACTTCAT AGATAATGTT
ATTGAAATTG ATTTAGGCAG TATGGAGCAA CTTGCAGGTG ACTTGGTTAA GAATAATGCA
ATACTCATAC CCCACGGTAG CTACGTGGAG TACGTTGGGT GGAGGAGGGC ATTAAGCATG
CCTATTCCAA CCTTCGGTAA CAGGTACATT ATTGAATGGG AGGCTGATCA GAGGAAGAAG
ATGAGGCTAC TGGAGTATGC TGGAATACCC ATACCTAGGT CATTTAATGA CCCGACCCAA
GTCGATAGGC CTGTTATAGT TAAGTTATCT GGTGCAAAGG GTGGTAGGGG TTACTTCATA
GCTAAGGATG CCGGTGAACT TGCAGGTAAA TTAAGCAGTA TTAATACGGA TGATTACATA
ATACAGGAGT ACGTGATTGG TGTACCAGCC TACTACCACT ACTTCGACTC TAAGGTATAT
GATCGTGTTG AATTGTTTGG AATGGATTTA AGGTATGAGA GTAACGTTGA CGGTAGATTA
TTCAACCTAG CTGAACCAAC CTTCGTAGTT ACTGGTAATA TTCCACTGGT TCTAAGGGAG
TCCCTACTAC CCACGGTTCA GAAGTATGGT GAAGACTTCT CAAGGGCTGT TGCGGAATTA
GTGCCACCGG GTATGATAGG GCCGTATAGC TTAGAGTCAA TAATTAAGGA TGACTTATCA
ATAGTGGTTT TCGAATTCTC AGGTAGGATT GTTGCAGGTA CGAACGTATA CATGGGTGTA
GGTAGCCCAT ACTCAGTACT GTACTTTAAT GAACCAATGG ACATGGGGGA GAGGATAGCC
CATGAGATAG TGAATGCTGT TAAAAGAGGT AAATTAATCA ATGTATTAAC ATAG
 
Protein sequence
MNIEATLKNY DLSKLNVVTI ASHSSLQILR GAKRHGLGTV AVAKPGSGWF YRRFNFIDNV 
IEIDLGSMEQ LAGDLVKNNA ILIPHGSYVE YVGWRRALSM PIPTFGNRYI IEWEADQRKK
MRLLEYAGIP IPRSFNDPTQ VDRPVIVKLS GAKGGRGYFI AKDAGELAGK LSSINTDDYI
IQEYVIGVPA YYHYFDSKVY DRVELFGMDL RYESNVDGRL FNLAEPTFVV TGNIPLVLRE
SLLPTVQKYG EDFSRAVAEL VPPGMIGPYS LESIIKDDLS IVVFEFSGRI VAGTNVYMGV
GSPYSVLYFN EPMDMGERIA HEIVNAVKRG KLINVLT