Gene Cmaq_1499 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1499 
Symbol 
ID5709140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1578317 
End bp1580131 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content47% 
IMG OID641276008 
ProductX-Pro dipeptidyl-peptidase domain-containing protein 
Protein accessionYP_001541313 
Protein GI159042061 
COG category[R] General function prediction only 
COG ID[COG2936] Predicted acyl esterases 
TIGRFAM ID[TIGR00976] putative hydrolase, CocE/NonD family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000124334 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.885769 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATTG TGTCTGTTTT CAAGGCTGAT GAAATAAAAA TTGCTCCGAA TCAATTCAGC 
CTAGAACCCG AAAAGTTGCC GCCTAACGCG CAGATAAAGG GTAGACTTTT CGATGGTTAC
CCCGTGATAG TTAAAGAGGA GGTTTCTCCG CCAAAATACA GGATGAAGAT TCTGAAGGAT
ATTATGGTAA AGATGCGGGA TGGGGTGCAT CTTGCTGTCG ATATCTATCT GCCGGACGCG
GAGGGCGAGA AGTTTCCTTG CTTAGTTGCA TGGGGAATGT GGGGTAAGGA CAACCAGGAA
ACAGTGCTAT GGCTTAAAGA CCTTCCTCAA CCATATTACA CGAGTCCTTG GTGGGATGGA
AGCCTTGAGG CAGGAGACAT AGAGTACTTC GTCTCCAGGG GGTATGTTTA CGTGATACCG
GATCCAAGAG GGATAGGAAA ATCTGAAGGG GGGCCTCCGC GTACTCTGAT AGACCTCCAC
AAGCCTGAGG ATATTTACGA CCTCATAGAA TGGGTTGCAC AGCAGCCTTG GTGCAACGGG
AAGGTGGGGA TGATTGGCCC CAGCTCCTAC TCCCTTTCAC AATACATGAT TGCAACCAAT
AATCCCCCAC CGCATTTAGT TGCACTGTTC CCGATTGGTT CATTCTATCC TCCTGCGGAT
CCATTTACAG GGATGATAGA CCTTGCTCTA GCGGGCATAT TCCATGGTGG CCATATACAC
GATAGCTCGT TGCCGGTACG CCAATGGGGT CCACCGATGT CTCCAGAAAT ATTGCCCAAG
GATGAATTTG AAAAGAGACT TAAGGAGTTA CTGGAGCACC CTGACATTAA GTTCCATCCC
AAGGTTAGAT CATCACTGGT TTATCCGAGA GAACCCATCT TGTTTGATTA CCTGATGTCA
GCTTTTCATC CAACGCCTGT TAATGATAAT CTTGATAAGG TAACACTCCC AATATACATC
GGTGTTCCTG CCCCAGGGGG TGGAGGGGCA CGTGTGTATT GGTCTGGATT TGAGGCTTAC
AATAAGGTCC GTTCAAAATA CAAGAAATTC CTCATATTCA TCCCTGGTGA GTTCCCAAGA
CCGTTCGTAC ATATGCAATA CGAGATGATA AGGTGGTTTG ACTACTGGTT GAAGGGAATA
GACAACGGCA TCATGGATGA GCCACCGGTG AAGATCTTCA TGGGTGGGGT GAATAAGTGG
AAGTTTGAGG ATGATTGGCC ACCGAAGGAT ATTAAGTGGA TTAACCTCTA CTTAAGGAAG
GGGAATAAAT TATCCACTAT CCCTGAGAGT GATTCAAGAC CTGACGTGTT GTATCAACCT
ATGCCCCTCA AGGACCCCAC AGTCTACTCA CTAAACTACT ACACGGATCC ATTCACCGAG
GACACTGAGA TAGTAGGACC AACCGCCTTA CATTTAGAGG CGACTATTGA TCAAGATGAC
GTAAACTGGA TGATAACGGT AGTGGATGTA AGCCCAGACG GTAGCAAGCA ATTAATGACA
GAGGGCTGGC TCAGGGGTTC CTTTAGAGCT ATTGATGAAA ATAAGTCAAA GCCATGGGCT
CCAGTGCACA AGGTCCAGGA TCCAGTCCCT GTACCGAAGG GAGAGAAGGT GAAGTACGAC
ATCAACTTAA TGCCGATAAC ATGGGTCATC CAGAAGGGGC ACAGGATAGG TGTCATAATA
AGGACCCAGG ATGATATGTA TAGCCGTCTT GCAATTGGTG GCGTATACTT CCTACCAAGA
ATGGTGGATA CGGTAGTCAA TCTGCATCTG GGACCCAATA GCTACATCGT CCTACCTGTA
AGGAGCAAGG AATAA
 
Protein sequence
MSIVSVFKAD EIKIAPNQFS LEPEKLPPNA QIKGRLFDGY PVIVKEEVSP PKYRMKILKD 
IMVKMRDGVH LAVDIYLPDA EGEKFPCLVA WGMWGKDNQE TVLWLKDLPQ PYYTSPWWDG
SLEAGDIEYF VSRGYVYVIP DPRGIGKSEG GPPRTLIDLH KPEDIYDLIE WVAQQPWCNG
KVGMIGPSSY SLSQYMIATN NPPPHLVALF PIGSFYPPAD PFTGMIDLAL AGIFHGGHIH
DSSLPVRQWG PPMSPEILPK DEFEKRLKEL LEHPDIKFHP KVRSSLVYPR EPILFDYLMS
AFHPTPVNDN LDKVTLPIYI GVPAPGGGGA RVYWSGFEAY NKVRSKYKKF LIFIPGEFPR
PFVHMQYEMI RWFDYWLKGI DNGIMDEPPV KIFMGGVNKW KFEDDWPPKD IKWINLYLRK
GNKLSTIPES DSRPDVLYQP MPLKDPTVYS LNYYTDPFTE DTEIVGPTAL HLEATIDQDD
VNWMITVVDV SPDGSKQLMT EGWLRGSFRA IDENKSKPWA PVHKVQDPVP VPKGEKVKYD
INLMPITWVI QKGHRIGVII RTQDDMYSRL AIGGVYFLPR MVDTVVNLHL GPNSYIVLPV
RSKE