Gene Cmaq_0033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0033 
Symbol 
ID5708893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp47839 
End bp50184 
Gene Length2346 bp 
Protein Length781 aa 
Translation table11 
GC content44% 
IMG OID641274536 
Productextracellular solute-binding protein 
Protein accessionYP_001539877 
Protein GI159040625 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.649183 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTATA TAAGTAAATC AACAATACTA ATATCAGTAA TGGTATTAGC CATCATTACG 
ATTATTGCAC TACCCCAACA AACTCAGAAG CCTCAAATAA TATGGGCTGA TTACGGTACA
ATAACCACAC TAACCGGCCC AATCTATAAT CCATTTTACC CAAATACATT AGCCACAGAT
ACAGTAACCA GTATAATATC CTATGCGCCA CTGGCGTTAT ATAATCCATT CAATAATGTA
TTTTACCCGG TCTTAGCCAG TAACTGGACT ATTCAAGTTC TCCCTAATGG TAGTGGTATT
TTAACTGTTT ACCTTAGGAA GGGGCTGTAT TGGTTTAATG GTTCAGCTAC AATGCTCTTC
ACTGCATGGG ATGTTTACGC ATACTTCTAC ATTGAGGATA AGGCCTTTGA GGCATATGCC
CCATTCATGC AGCCACAATA CGCTGATGAG AGTATAAGGG TACTTGATAA TTATACTATT
CAATTCCTAT TCCAAATATG GAGCCCAACC GAGTGGATAT ACTTCATTAC CTCAAGCATT
GCGACTCCTT GGCCTGTTTG GAAACCCATT GTTAATGAGT TGAAGACTAT GAATGCCTCC
CAAGCCTTAG CCTTCTCAAC CAATGTAACT AGGTACGTGG TGCCTTACTG GGGTATTTTC
CCATATTACT TAACCTACAT AAGTTCATCA AGCATAGAAC TAACCCTGGA GCCAAGCCCA
TTGCTCAACC AGTGGTATAC CGTATTTCCG CTTGCAGACT GGAATTACTA TGACCCAACC
TTCGAGGAAT TCTTTACTGG GAGCCAGTAC ATAGCATCAC TGGTGTCGCA TAAGGCCACT
TGGGCTGGCG GCGCGGCAGG CATTAAGCAG GTAGCCTTAC TTAATAGCAG TGGCTTCAGT
GCATACTTCG CCCCAGACTT ATCAGGCTGG GGAATCACGT TCAATCCACA TGTGTATCCA
TTCAACATAA CGCTGATTAG GGAGGCCCTA TGCCTCATAT TTAATAGAAC AGCCGTTGTA
GCTGCCTGGG GACTTAACTA CCCTAACTAC TACTCTCAGC CAATAGCCCC TGAAACTATT
AGTTCTTATC CGCCAAGTGT TAGGCAGTTC ATTATACCGT GCTCCTATGA TCCAGCTAAG
GCGGCTCAAA TGCTGCAGAG CCTAGGATTC AAGAAGATTA ACGGATACTG GTATTTGCCT
AATGGCTCCA TGTTCTCAAT ATACGTACTG GCACCCAGTG GTTGGATTGA TTGGGATACC
ATGGCTTCTG AGGCTATTGA GGAGATGCAG GCATTCGGCA TAAACGCTAA GTTAATCACA
ATGGATGCTG GAGCCTACTG GGGTACCATG ATACCTGATG GTGATTACGT CGCTGCATTA
ACATGGACCA CCGCATTCAC TCCAGCCTAC TATAGTGCGT GGGAGGCGTT GAGTAATCCA
TGGTGGGCCT TCGGTAGTGC AATATCTGCT TATACTCCTG GTAGTGAGGT TTGGCCATTC
CAGTGGCCTA ATGGGACATG CACACCAGTC ACTGCACCAG CATCATTGAA ACTACCCAAT
GGCACAATAG TATGGTGCAT CAACTCAACA TACGGCTACA TAAACCTAAG CAACTGGCAA
ACATTCTTCA ATGATGCAAC TCCAGGAACA GCGGACTACA ATCTAGCACT AGACACAATA
TTCGCATGGT ATAGTTACTT CACTCCAATA GTGCCGTTAG CAGCCAAGAT AGACCCATTC
ACGTACTTAA CACCAATAGC TGATCCGAAT TGGCTATACC TCTGTCTACC TAATGAGACA
ACGTGGTTCC TTGTATCTGC AAACTGGTAC ACCTACGGCT CATTAATAAT GCTAATGTTT
GGTGCTGTTG CACCAAGAGG CGTAGTACCA CCATTAGCCC AGGTAATTGC TAACGGTAGC
CTTTGGGTAA AATACCCACA AATAGCAAAT CTACTTGCCT TACCTAGTCC TGATCCTTCG
TTGCAGGCTT GTGTGGCATC GTACTTCCAT ATACCGTATA CTCCGGTGAC TACTTCAACC
TCAACAACCA CGACTACATC AACAACTACT ACATCAACAA CCACAACCAC TACTACACCG
GTTACTACTA CGGCTACAAG CACTACTACG ACTACTTCAA CAGTAACCAC TACTGCCGTT
AGCACTGTTA CAAGTACTGT TACAACCACT GCTGTAAGCA CGGTTACAAG TACAGCAACC
ACTACGGCAG TAAGCACAGT AACAGTAACC AAACCAGTAG TATCAACAGC ATTAATAGCA
GGAATAATTA TTATTGTAAT CGTAATAGCA GCAGTAGCAG CAATAATAAC ATTAAGAAGA
AGATAA
 
Protein sequence
MRYISKSTIL ISVMVLAIIT IIALPQQTQK PQIIWADYGT ITTLTGPIYN PFYPNTLATD 
TVTSIISYAP LALYNPFNNV FYPVLASNWT IQVLPNGSGI LTVYLRKGLY WFNGSATMLF
TAWDVYAYFY IEDKAFEAYA PFMQPQYADE SIRVLDNYTI QFLFQIWSPT EWIYFITSSI
ATPWPVWKPI VNELKTMNAS QALAFSTNVT RYVVPYWGIF PYYLTYISSS SIELTLEPSP
LLNQWYTVFP LADWNYYDPT FEEFFTGSQY IASLVSHKAT WAGGAAGIKQ VALLNSSGFS
AYFAPDLSGW GITFNPHVYP FNITLIREAL CLIFNRTAVV AAWGLNYPNY YSQPIAPETI
SSYPPSVRQF IIPCSYDPAK AAQMLQSLGF KKINGYWYLP NGSMFSIYVL APSGWIDWDT
MASEAIEEMQ AFGINAKLIT MDAGAYWGTM IPDGDYVAAL TWTTAFTPAY YSAWEALSNP
WWAFGSAISA YTPGSEVWPF QWPNGTCTPV TAPASLKLPN GTIVWCINST YGYINLSNWQ
TFFNDATPGT ADYNLALDTI FAWYSYFTPI VPLAAKIDPF TYLTPIADPN WLYLCLPNET
TWFLVSANWY TYGSLIMLMF GAVAPRGVVP PLAQVIANGS LWVKYPQIAN LLALPSPDPS
LQACVASYFH IPYTPVTTST STTTTTSTTT TSTTTTTTTP VTTTATSTTT TTSTVTTTAV
STVTSTVTTT AVSTVTSTAT TTAVSTVTVT KPVVSTALIA GIIIIVIVIA AVAAIITLRR
R