Gene Cmaq_0708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0708 
Symbol 
ID5708597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp742068 
End bp743183 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content43% 
IMG OID641275206 
Productxylose isomerase domain-containing protein 
Protein accessionYP_001540534 
Protein GI159041282 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4952] Predicted sugar isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00278763 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000165231 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCATGA GTATAAGTAG GTTAAAGTCA ACCTACTCAG GTTTCATAGA GGGTAAGACT 
GTTGATCAAT TTTTCAGTGA GTATAATGTT AAGTTCGCTG CAGGTACATG GACTGCTGGG
GATTTCTCAG ATAGATTCAA TAGAAGTGGT TACTTCCCGA ATCTACCAAG AGGGTTAGTG
GATCAGTTAA GGAGGGTGAG GGAGTCTGGG ATTGAGGGTG TCGTGCCAAT AGATGCTCAA
TTCCTTGACG ATAATCTTAA GGTTAAGGAG GACTTGATAA ATGAAGTTAA GGCAACTGCA
AGTGAATTAG GCCTTAAGAT AGCTGGGTTA GGTATGGATA TTTCAGGTTT CCACGTGTTT
AAACTAGGTT CATTAACTAA CCCTGACCCT AAGGTGAGGG AACTGGCCTT AAGCACCCTT
ACCCAGAGCC TTGAAATAGC CCGTATGCTT GGCCTGGATT CAGTATCACT TTGGCTTGGT
CCAGATGGCT GGGATTACAG TCTTGAGTCT AATTACGGTA AGAAGATTAA GGAACTTTAC
GAGGGGTTAC TTACCCTAGG TAAGGAGGCG CATAGGTTGG GCATTAGGTC TTTTGGACTT
GAGGCTAAGC CCAAGGAGCC TAGGGAGGGT AATTTAATCA TACCCACATC CCATGTTTCA
ATAATGCTCG CTAACAGGCT AAATAATGAT TTAGGTGTAA AACTGTTTGG AATAACCATA
GACTATGGTC ATGAATTAAT GTACGCCGTT GAACCAGCCT ACACAGTCTA CTTAGCTAAG
GAGCAGGGAG TCTCAGTGGC CACGGTGCAT ATTAATACAG CTAAGTGGCA TAGTAATGAT
GAGGACAGGG TTGTTGGAAC TGGGGACGCA TGGCACTTCG TAGACTTCCT ATACGCACTA
CTGGACACTG GTTACTCAGG CTGGTTTACC CTGGATCAAT TCACTTATAG GCTTAATCCA
GTGGATGGGT TAAGGTTATC TAAGGAATTA TTCGCTAACC TGTATAAGAA GGCTCTGGCA
CTATACTTAT CTAGGGATGA GTTTGAGAAC ATTAGGTCCA CGGGTGATCA AGCTAAGATA
CTTGACTACG TTAAGAGGAT AATGTACGGC TTATGA
 
Protein sequence
MSMSISRLKS TYSGFIEGKT VDQFFSEYNV KFAAGTWTAG DFSDRFNRSG YFPNLPRGLV 
DQLRRVRESG IEGVVPIDAQ FLDDNLKVKE DLINEVKATA SELGLKIAGL GMDISGFHVF
KLGSLTNPDP KVRELALSTL TQSLEIARML GLDSVSLWLG PDGWDYSLES NYGKKIKELY
EGLLTLGKEA HRLGIRSFGL EAKPKEPREG NLIIPTSHVS IMLANRLNND LGVKLFGITI
DYGHELMYAV EPAYTVYLAK EQGVSVATVH INTAKWHSND EDRVVGTGDA WHFVDFLYAL
LDTGYSGWFT LDQFTYRLNP VDGLRLSKEL FANLYKKALA LYLSRDEFEN IRSTGDQAKI
LDYVKRIMYG L