Gene Cmaq_1386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1386 
Symbol 
ID5709391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1461550 
End bp1463226 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content45% 
IMG OID641275897 
ProductBeta-glucuronidase 
Protein accessionYP_001541202 
Protein GI159041950 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTACTG GTGTTAGACC TAGGATACCT TTGGATGGTT TTTGGGGTTT CAGGCTTGAT 
CCTGGTAATG CCGGTGAGTC TAATGGTTGG TTTAAGGGTT TTGAGTCCAG TGATCATATT
TATGTTCCAG CCTCCTGGAA TGAGCAGAAT CCTGATTGGG ATGGTTACAG TGGTGTTGCA
TGGTACTTAA TGGACTTCTA CGTGCCTAGG GAGCTTAATG GCTTAACACC ATGGATAATC
TTCGAGGGTA GTGGTTACTT GAGTAGGGTT TGGGTTAATG GGGTATTTAT TGGTGAGCAT
GAGGGTTCAT TCACATTATT CAAATTCAAG ATACCCAACC TAAACTACGG TGGTTGGAAT
AGGGTCGTTG TTAAGATTGA TAACACATTG AAGCCCGATA ACATACCACC CGGTGAGGGT
TTGAATGCAA CCTACTTCGA CTTCTACCAT TACGGCGGTA TTCAGAGGCC TGTTTGGGTT
GAGTTCACCC ACGGCTGCTT CATTGATGAT TTAATCATTG CCACTAGGCA TGATGGATAC
CTTAAGGTTC AGCCAGTGGT TAATTGCTCA AGGCCCTTTA AACTTGAGTT AAGCCTACTG
GATAAGTCTG GTGGCGTGGT TTACAGTAGG GTTGTTGAGG GTGCCTTTGA GGATTATGTT
AAGGGTCTTG AACCATGGTC CATTGAACAC CCTGTACTCT ACACTCTTGA GGCTAGGTTA
ATTACTAATA ATGATGTTGA GGACACCGTC ACTGAGCGTA TTGGGTTTAG GACATTTGAG
GTTAAGGCTG GTGGATTCTA CCTTAACGGT GAATCAATCT TCCTTAAGGG TTTCGGTAGG
CATGAGGATT ACCCAATATT CGGTAGGGCA TTACCGGGGC CAGTGTTGAT TAGGGATTAC
TATAATATGA GGAGGATTAA CGCCAACTCC TTCAGGACAA GTCACTACCC ATACTCCAAC
GCCCACCTAG ACCTAGCTGA TGAATTCGGC GTACTCGTAA TCCTAGAGGC ACCATTGGTT
GGGCTTAGGG AGCATCACTT CAGTAACCGT GATTACCTTG AGAAGGCTAA GCGGGTTATT
AGTGAGATGA TTAGGCAGCA TAGGAATAGG CCCAGTGTGG TTATGTATAG TGTTGCTAAT
GAACCAAACA GTATTACTGA TGAGGCTAGG GTATTCCTAG GTGAGTTAAT GAACCATGTT
AAGTCCATTG ACCCCACTAG ACCAGTAATA TATACTTCAT TCAGGCACCT TGACGATAAG
GCGCTGGGTC TTGGTGATGC AGTGGCCTTA AACATATACT TCGGGTGGTA TAGTGATACT
GGTGATGTTG AAACTGGCGT GGCTAAGGCT GTTAAGCTTA TTGAGGAGGT TCACTCAAGG
TACCCTGATA AACCAATAAT AATAACGGAG TTTGGTGCTG AGGGGGTTAC TGGACTACAC
CATGACCCAC CAGTGGCTTG GAGTGAGGAG TACCAGGAAT TATTCCTAAG GAGGTATATT
GAGGAATTAT CAAAGAAACC ATACGTAAGG GGATTACACA TATGGAACTA CGCAGACTTC
AGGACCCCAC AGAACCCAAG GAGGGTGATA CTGAATACTA AGGGCTTATA CACCAGGGAT
AGGAGACCTA AGTTAGCCAC TAGGACTGTG GCTGAGTTAT TCTCAAAATT AAAGTAA
 
Protein sequence
MVTGVRPRIP LDGFWGFRLD PGNAGESNGW FKGFESSDHI YVPASWNEQN PDWDGYSGVA 
WYLMDFYVPR ELNGLTPWII FEGSGYLSRV WVNGVFIGEH EGSFTLFKFK IPNLNYGGWN
RVVVKIDNTL KPDNIPPGEG LNATYFDFYH YGGIQRPVWV EFTHGCFIDD LIIATRHDGY
LKVQPVVNCS RPFKLELSLL DKSGGVVYSR VVEGAFEDYV KGLEPWSIEH PVLYTLEARL
ITNNDVEDTV TERIGFRTFE VKAGGFYLNG ESIFLKGFGR HEDYPIFGRA LPGPVLIRDY
YNMRRINANS FRTSHYPYSN AHLDLADEFG VLVILEAPLV GLREHHFSNR DYLEKAKRVI
SEMIRQHRNR PSVVMYSVAN EPNSITDEAR VFLGELMNHV KSIDPTRPVI YTSFRHLDDK
ALGLGDAVAL NIYFGWYSDT GDVETGVAKA VKLIEEVHSR YPDKPIIITE FGAEGVTGLH
HDPPVAWSEE YQELFLRRYI EELSKKPYVR GLHIWNYADF RTPQNPRRVI LNTKGLYTRD
RRPKLATRTV AELFSKLK