Gene Cmaq_0247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0247 
Symbol 
ID5710100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp281217 
End bp282374 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content44% 
IMG OID641274749 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_001540085 
Protein GI159040833 
COG category[R] General function prediction only 
COG ID[COG1524] Uncharacterized proteins of the AP superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTGG TTAAGCCTGA TTACAGTGGA GGGAGTTTGC TTAACTTATC ATCATCCATA 
TGCGACTTCC TGGGTGTTAA GAATCAGCAC CCGAGGCTAA GGAGTATTGA TGGTGTTCAG
GGTAGGAGGA TACTGCTGCT TCTCATCGAT GGTTTAGGTT ATGTCCATTT AAAGCGATAC
TGCAGTAATT GTGAGGAGGC TAAGTGGCTT GGTAATGTTG AGGAGTTAAC TAGTGTATTT
CCATCAGTGA CCTCAACAGT CTTAACAACA TTATCAATGG GAGTACCCCC TGGGGTTCAT
GGTGTATTGG GCACGGTCAT GTATGTTAAG GAGGCTGGTA GCTTGGTTAA TACTTTAACA
ATGGGTTTAA TGCCTGATGG GAGGAGGGGG GAGTTGAGGG ATATTGGCTA TGACCCTAGG
GTTATCTTCT ACGGTGGCTC AACAATATTT GAGGAGGCTA AGTTAAATGG ATATAATTCA
CTGGTTATTA CCCCAAAGGG TATAAGCGGG GGCTTATCAG ACTTAATATA CAGGGGTACT
GAGGTTAAGG AGTACGTGAG CGTTTACGAT GCCTTAGTAC TAGCCTCCAG GGCCCTTGAG
AATAACACCC TCGTGTACGT TTACATACCC ACCCTGGATT CGATTCAACA TGAGTATGGC
CCAGAGTCCG AGGAGTATAG GGTTGCCTTA ATTGAGCTAC TGAATACACT AGGTAGGTTA
ATTAGGCATC TGCCTCAATC AACTACAGTA GTGTTAACTG CTGATCATGG TCAAGTCCAG
GTTGGTCAGG GTGATGTAGT GAACTTAAGG GTAATGACTA GGTTACTGGA TTCATTGTCA
GTGGCGCCTT ACGGTGAACC AAGGGCTCTT CAACTCAAGT TAAGTGACAA GTCACTTAAG
AATGAGGTTA AGGATGCCTT ATCCTCAATG GGTAGGAAGC TACTTATTTA CGATTCAAGT
GAAGTTAAGG AACTATTGGG TGGGGTTACT GAGTACACTG AACAGAGGAT GGGTGACCTA
TGGGTTATAC CACTCGACAC CACTGCCTTA ATCTACCTGT ATAGGCTTAA TGATGATAAG
GTGGCTAAGT TTAAAGGTCA TCACGCTGGT TTACTTGATT ACGAAATGAA GGTTCCCTTA
TCCATAATAA ACCTTTAA
 
Protein sequence
MSVVKPDYSG GSLLNLSSSI CDFLGVKNQH PRLRSIDGVQ GRRILLLLID GLGYVHLKRY 
CSNCEEAKWL GNVEELTSVF PSVTSTVLTT LSMGVPPGVH GVLGTVMYVK EAGSLVNTLT
MGLMPDGRRG ELRDIGYDPR VIFYGGSTIF EEAKLNGYNS LVITPKGISG GLSDLIYRGT
EVKEYVSVYD ALVLASRALE NNTLVYVYIP TLDSIQHEYG PESEEYRVAL IELLNTLGRL
IRHLPQSTTV VLTADHGQVQ VGQGDVVNLR VMTRLLDSLS VAPYGEPRAL QLKLSDKSLK
NEVKDALSSM GRKLLIYDSS EVKELLGGVT EYTEQRMGDL WVIPLDTTAL IYLYRLNDDK
VAKFKGHHAG LLDYEMKVPL SIINL