Gene Cmaq_1140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1140 
Symbol 
ID5710190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1196809 
End bp1197984 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content46% 
IMG OID641275639 
ProductD-galactarate dehydratase/Altronate hydrolase domain-containing protein 
Protein accessionYP_001540957 
Protein GI159041705 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2721] Altronate dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.944509 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGATG TGGCTGGCAA ATACCCAACC ATTAAGGCGT ACATTAGGCC TGATGGAAGC 
GTAGGGGTAA GGAACCACGT AGCCATAATA CCCATTGATG ACTTATCAAA CACAGCCGCC
TTAGGTGTGG CTAAGTTAAT TAGGGGTACT GTGGCTATAC CGCACCCATA CGGTAGATTA
CAGTTCGGTA GGGACTTGAA TCTACTATTC CATGTATTAT CCGGCACTGG GGCTAATCCC
AATGTATACG GCGCCATAGT TATTGGTATT GAGGATAATT GGGCTAATAA GGTTGCTGAT
GTGATAGCTA AGACGGGTAA GCCGGTTTAC GTATTCAGTA TTGAGGGTAA TGGTGATTTA
AAAACCATTG AGGCAGCGGC TAGGAAGGCT AAGGAGCTTG TTCAGGATGC TGGTGAGCAG
CAGAGAACTG AGGTTGATTT AACAGGCATA GTGTTCAGTA TAAAGTGCGG TGAATCAGAC
ACCACATCAG GCTTAGCATC AAACCCAGCA TTAGGGCATA CTGTGGATAA GTTGGTGGAT
TTAGGGAATA CGGTAATGTT TGGTGAAACC TCAGAGCTAA CGGGGGCTGA GGATATTGTA
GCCAGTCGCA TAAAGGATCC TGAGTTAAGG AGTAAATTCA TGAGGATTTA CAACGAGTAC
GTTGAGGTAA TTGAAAGGGA AGGCGTTGAC CTACTTGGTT CACAGCCAAC TGAAGGCAAT
ATTAAAGGCG GTTTATCAAC AATAGAGGAG AAGGCACTAG GCAACATTCA GAAATTAGGC
ACTAGGCCAA TTACGTGTGT TGCCGATTAC CTGGACCCAG TACCAAGGAA CGGGGGCTTA
TGCTTCGTTA ACACTTCATC AGCAGCCGCG GAGGCTGTGA CGTTATTCGC CGCTAAGGGC
TCTGTGCTTC ACTTCTTCAC CACAGGTCAA GGTAACGTGG TGGGTCATCC AATAATACCT
GTTATCAAGA TTTCAGCCAA CCCAAAGACC GTTAAGGCAA TGGGTGAGCA CATTGACGTT
GATGTATCAG ACCTACTGGA GCTTAAGGTT AGCCTACAGG AGGCTGGGGA TAGGATCTTC
AACTACGCGT TAAGGGTAAT GAATGGTAGA TTAACCGCAG CTGAGGCGCT TCAGCATGAT
GAATTCTCAC CCATTAAACT ATACGTGAGT GCGTGA
 
Protein sequence
MSDVAGKYPT IKAYIRPDGS VGVRNHVAII PIDDLSNTAA LGVAKLIRGT VAIPHPYGRL 
QFGRDLNLLF HVLSGTGANP NVYGAIVIGI EDNWANKVAD VIAKTGKPVY VFSIEGNGDL
KTIEAAARKA KELVQDAGEQ QRTEVDLTGI VFSIKCGESD TTSGLASNPA LGHTVDKLVD
LGNTVMFGET SELTGAEDIV ASRIKDPELR SKFMRIYNEY VEVIEREGVD LLGSQPTEGN
IKGGLSTIEE KALGNIQKLG TRPITCVADY LDPVPRNGGL CFVNTSSAAA EAVTLFAAKG
SVLHFFTTGQ GNVVGHPIIP VIKISANPKT VKAMGEHIDV DVSDLLELKV SLQEAGDRIF
NYALRVMNGR LTAAEALQHD EFSPIKLYVS A