Gene Cmaq_1692 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1692 
Symbol 
ID5709110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1769660 
End bp1771891 
Gene Length2232 bp 
Protein Length743 aa 
Translation table11 
GC content42% 
IMG OID641276200 
ProductAlpha-glucosidase 
Protein accessionYP_001541505 
Protein GI159042253 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.90141 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTTT CAATGAAAGT AAGCCTAATA AACGATGATG CATTAAAGGT AAGTATAATT 
AAAAGCGGCA GTAGGTACCG GGAATCCCCT GCTGTTGTTG TTAAGCCGAG TGTTGAATTA
GTAAGTGGTG AGAATAGGCT TGGTCCATGG CTTGTTAAGG TTGCTGAAGA TTCCATTAAT
GTAAGTGTAA ATAACATGAA TGCAACATTA AGGTTCAGTT ATAGTAATGA TCAAATAATA
GTGAGGGGTA ATTTAGGCCT CAATGATGCA GTTTATGGAC TTGGTGAAAA GGCGTTACCA
TTGAATAGGA AAAGGTTCAG GGTAACCATG TGGAACACTG ACGCCTATGG GTACAGGTAT
GGTTCAGATC CACTGTATGT ATCAATACCG TTCTTCATAA TTACTAATAA GAATGGGGCA
ATAGGCCACT TCGCTGATTC CACGGCTAAG GTAATTATTG ATCTTGGTGC AGAGAAGGAG
GATGAGTTCA CGGTTATTGT GAATGATTAT CAACTGGATT ACTACATTAT TAGGGGGCCT
AGGCTTAAGG ATGTGGTTAC TAGGTTCATT AACTTAACAG GTAAACCCAC CTTAATGCCT
AAATGGGCGC TTGGGCATCA GCAAAGTAGG TACAGTTACT ACCCCCAGGA TAGGGTTATT
GAGATTATTA AGACCTTTAA GGAGAAGGAA CTGGATAACA CTGTTGTATA CCTTGATATA
CATTACATGG ATGGCTACAG AATATTCACC TGGAGTAAGG ATAGGTTCCC TAATCCCACT
GAATTAGCTA AGGCGGCTCA TGAACTTGGT GTTAAATTAG TAACCATAGT GGATCCGTAT
GTTAAAGTTG ATCCAAATTA CTACGTGTTT AAGGAGGGTA TTAATGGTAA TCACCTGTCG
CTTGATGATG ATGGTGGATT ATCCATAGTT CAGGGTTGGC CAGGTAAATC AGCATTACCG
GACTTCTTTA ATAAGGAGGC TAGGGAGTGG TGGGCTAGTC TCATTGAGCG TTGGGTTAGG
GAGTATGGTG TTGACGGTAT TTGGCTAGAC ATGAATGAAC CAGCGGCCTT CGATTATCCC
AATCACACTG TTTCAAGTAA AGTAATAACT CATAGACTTG ATGATGATTC AAGGGTGCCT
CATGACTTCC TCCACAACGC CTATGCGCTA TATGAGGCTA TGGCAACATA TGATGGCTTA
GTTAAGGCGG GTAGAAGACC ATTCGTATTA TCCAGGGCCG GTTACGCCGG TATCCAGAGG
TATGCGGCAG TTTGGACTGG TGATAATACC AGTAATTGGG AACACTTGAG ACTGCAATTG
CAGATACTCC TGGGTTTAAG TATATCAGGT GTCACATTCA TTGGCGCTGA TGTAGGTGGC
TTTGCAAAAT ATGTTCCAGG GAGTGGTGGA AATGTTTTGT TTACTTTAAG TCCTGAACTA
CTGGTTAGGT GGTATGAGTG GGCTATTTTC TTCCCACTGC TGAGGAACCA TGCCTCAATT
GGGTCACCTG ACCAGGAACC CTGGGCCTTT GGGCCAAGAA CACTTGAATT AATTAAGAAT
CTTCTGAGGC TCAGGGCTAG GTTAACCCCA TACTTATACT CATTAATGTG GCTTAGCCAC
ATTAATGGTG AACCAATAGT TAGGCCATTG ATATACGAGT ACCCTAATGA TGAGGAGGTT
ATTAATATTG ATGATGAATT CATGCTTGGG CCATTCATGC TAATAGCACC AATGTTAACC
AGTGGTAATG CCAGGGAGGT TTACTTACCT GAGGGGGAAT GGGTTAATAT GTGGAGTGGT
GAGGTGCTTA ACAAGGGATT CCACATTGTT GATGCACCAC TTGGTAAGCC ACCAGTATTC
CTTAGGAGGG GTTCACTGAT ACCTGTACAG GAGACTCAGG GTGTTTTAGG CGTGCTGACG
GTATTGGGTG AGGGGGAATT CACTGTTTAC GATGATGATG GTGAATCATC ATCACCAACA
CCATCAACAT TAAGCCTAAG GATTAGTGGT GAATCAATTA CAGTAGGTAA TTGGATTAAT
CCAATGCCTC AATCACCATC ATCAATAATA CTTGAGGCCT ATGTTAATAA GGAACCAGGT
AAAGTAACTA TTAATGATAC TGAGGTGGCT AAGGCTAAGT TCAATATTGA ACCAGGTCCA
CCATCATGGT ACATGGATAA GCTACTCTAC ATTAGAGCGG CAACTGGGAG TAATGTTAAA
ATAATTAATT AA
 
Protein sequence
MEFSMKVSLI NDDALKVSII KSGSRYRESP AVVVKPSVEL VSGENRLGPW LVKVAEDSIN 
VSVNNMNATL RFSYSNDQII VRGNLGLNDA VYGLGEKALP LNRKRFRVTM WNTDAYGYRY
GSDPLYVSIP FFIITNKNGA IGHFADSTAK VIIDLGAEKE DEFTVIVNDY QLDYYIIRGP
RLKDVVTRFI NLTGKPTLMP KWALGHQQSR YSYYPQDRVI EIIKTFKEKE LDNTVVYLDI
HYMDGYRIFT WSKDRFPNPT ELAKAAHELG VKLVTIVDPY VKVDPNYYVF KEGINGNHLS
LDDDGGLSIV QGWPGKSALP DFFNKEAREW WASLIERWVR EYGVDGIWLD MNEPAAFDYP
NHTVSSKVIT HRLDDDSRVP HDFLHNAYAL YEAMATYDGL VKAGRRPFVL SRAGYAGIQR
YAAVWTGDNT SNWEHLRLQL QILLGLSISG VTFIGADVGG FAKYVPGSGG NVLFTLSPEL
LVRWYEWAIF FPLLRNHASI GSPDQEPWAF GPRTLELIKN LLRLRARLTP YLYSLMWLSH
INGEPIVRPL IYEYPNDEEV INIDDEFMLG PFMLIAPMLT SGNAREVYLP EGEWVNMWSG
EVLNKGFHIV DAPLGKPPVF LRRGSLIPVQ ETQGVLGVLT VLGEGEFTVY DDDGESSSPT
PSTLSLRISG ESITVGNWIN PMPQSPSSII LEAYVNKEPG KVTINDTEVA KAKFNIEPGP
PSWYMDKLLY IRAATGSNVK IIN