Gene Cmaq_0230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0230 
Symbol 
ID5710137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp264395 
End bp265783 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content43% 
IMG OID641274732 
Productglycoside hydrolase family protein 
Protein accessionYP_001540068 
Protein GI159040816 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAATT CTCTTCCCTC AGGTAGGACG TATAATGTTG TTGAGTATGG TGCTGATCCT 
AAGGGTTTGG ATGATAGTAC TGGGGCTATA AATGAAGCTA TTACCCAAGC CAGTGAGACT
AGGGGTATTG TGTATATTCC TCCAGGCAAC TACTTATCAA GGAACATTAT TCTGAGGAGT
AATGTAATGT TACTCATTGA TAAGGGTGCT GTGGTTAAAT TCTCAACCGA TTACAAGTCC
TATCCAATAA TTGAGACTAG GAGAGAGGGG GTTCATCATT GTGGTGTTAT GCCGTTAATA
TTCGGTAAGG ATGTTAGGAA TGTTAGGATT ATTGGGGAGG GTGTGTTTGA TGGCCAGGGT
TACGCATGGT GGCCTATTAG GAGGTTTCGC GTTACTGAGG ATTACTGGAG GAGGCTTGTT
GAATCAGGGG GTGTTGTTGG TGATGATGGT AAAACCTGGT GGCCTACTAG GAATGCCATG
GAGGGTGCTG AGGCCTTCAG GAAAATAACC AGTGAAGGTG GGAAGCCGAG TACTGAGGAT
TGTGAGAGGT ATAGGGAGTT CTTTAGGCCT CAGCTTCTTC AACTCTATAA TGCTGAGAAC
GTGACCATAG AAGGGGTTAC GTTTAAGGAC TCACCCATGT GGACAATACA CATTCTCTAC
TCAAGGCATG TTACATTAAT AAACACTAGT AGTATTGCCC CAGATTACTC ACCAAACACT
GATGGTGTTG TCGTGGATTC CTCAAGTGAC GTTGAGGTAA GGGGCTGTAT GATTGATGTT
GGTGATGATT GCTTAGTCAT AAAGTCTGGT AGGGATGAGG AGGGTAGGAG GATTGGCATA
CCCTCAGAGA ATATTCACGC CTCAGGATGC TTAATGAAGA GGGGGCATGG TGGATTCGTT
ATTGGTAGTG AAATGTCAGG TGGTGTTAGG AATGTTTCAA TTCAGGATAG TGTATTCGAT
GGTACTGAGA GGGGTGTTAG GATTAAGACA ACTAGGGGTA GGGGTGGTTT AATTGAGAAT
GTTTACGTAA ACAACATCTA CATGAGGAAC ATAATTCATG AGGCAGTGGT AGTGGATATG
TTCTATGAGA AAAGGCCTGT TGAACCAGTA TCAGAGAGGA CGCCTAAGAT TAGGGGTGTG
GTTATTAGGA ACACATCATG TGATGGGGCA GACCAGGCGG TGCTAATAAA TGGGTTACCT
GAAATGCCCA TTGAAGACAT TATAATTGAG AATACTAGAA TAACATCAAA CAAGGGTATT
CACATTGAAA ACGCCTCAAG TATTAGGCTC AGTAATGTTA AGGTGAACTC AAGGGCGATA
CCAGTCATAA CCATGAGTAA CGTGAGAAAC ATAACGTTAG ACGACGTGAG CGGCTTATCC
ATGGAGTAA
 
Protein sequence
MINSLPSGRT YNVVEYGADP KGLDDSTGAI NEAITQASET RGIVYIPPGN YLSRNIILRS 
NVMLLIDKGA VVKFSTDYKS YPIIETRREG VHHCGVMPLI FGKDVRNVRI IGEGVFDGQG
YAWWPIRRFR VTEDYWRRLV ESGGVVGDDG KTWWPTRNAM EGAEAFRKIT SEGGKPSTED
CERYREFFRP QLLQLYNAEN VTIEGVTFKD SPMWTIHILY SRHVTLINTS SIAPDYSPNT
DGVVVDSSSD VEVRGCMIDV GDDCLVIKSG RDEEGRRIGI PSENIHASGC LMKRGHGGFV
IGSEMSGGVR NVSIQDSVFD GTERGVRIKT TRGRGGLIEN VYVNNIYMRN IIHEAVVVDM
FYEKRPVEPV SERTPKIRGV VIRNTSCDGA DQAVLINGLP EMPIEDIIIE NTRITSNKGI
HIENASSIRL SNVKVNSRAI PVITMSNVRN ITLDDVSGLS ME