Gene Cmaq_1189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1189 
Symbol 
ID5710423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1248105 
End bp1250486 
Gene Length2382 bp 
Protein Length793 aa 
Translation table11 
GC content43% 
IMG OID641275691 
Productextracellular solute-binding protein 
Protein accessionYP_001541006 
Protein GI159041754 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.109434 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.000120302 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTGGCA GGAGTAACTC AACCCTAGTT GTGAAAGCAA TACTAGTGGC GGTGGTTATC 
CTGGCGGCAG CATATAGCGC AGTCACAGTG CAGGCTCAAC AGAAGCCTCA AATAATAGGA
GGCAACCCTG TCACAATAAT ATTAACACCA GGTTCACCCA TGTGGAATCC CTATGCGCCA
AGTAACATGA TTGGCAACAC ATGGGATTAC CTACCATTAG CGGCGTTTAA TCCATTGACT
GGCCAATTCT GGCCTATTCT AGCTGAGAAT TGGACTGTTC AAGTTCTCCC CAATGGTAGT
GGCATATTAA CAATCTACCT TAGGCCTGGC TTATACTGGT ATAATGGTTC AGTAGCGATT
CCCTTCACTG CTTGGGATGT TTACGCTGAG TTCTACATTG GCATGAAGGT GCTCGCTTGG
TATGTGCCTT GGATTAACCA ATCCCTTGTT GATGAGGATA TTAGAGTATT GAATAATTAC
ACTATTCAAT TCCTATTCCA AAGATGGACA CCGTACATAC CATATTGGCT ATTAACAAGT
TGGATTGATG TACCATACGC CGCATGGAAA CCAATAGTGG ATAAGTTAAG GACAATGAAC
GCAACGCAGG CAGCGTCATT CACATCTAAT GTAACAGAGT ATGTTGTACC GTATTATGGT
TTATACCCAT ATTACTTAAG TTACATTAGC ACAACATACC TTCACTTCAC CCTTGAGCCG
CCTAACTTAT TATCATCATG GTATAAGGTC TTCCCATTCG CATCATGGGG GTACTATGAT
CCAACGGCAA TAGTATGGGA GACTGGCGGC AATACGCAGG CCTTAAGCGG TATGTTAGCT
GGGAAGATAA CGTATGATTG GATTGGTTTA TCCGAATCCC AGCTTAAGGT CATTAATAGT
ACACCAGGTT GGTCTTCATT CGCCTTACCA GTATTCTCAG TAATGGGTAT TGCCATTAAT
CCTAATTATT ATCCTTGGAA TATTCCTCAG GTTAGGGAGG CTTTATGCGA TGTTATTAAT
AGGACTGAGG TTGCTGCAGC ATGGGGCTTA GCCATTAGTA AACCAGACTA CTACCCCACC
CCAGTAATAC CTGGGACTGA AGACACTTAC CCGCCTGATG TTAGGCAATT CATCATACCA
TGCTCCTACA ACTGGACCAA GGCCGAACAA CTCCTTGAAA GCCTAGGATT CACAAAGAAG
GGAGAATACT GGTACACACC AAACGGAACA GAATTAACAC TCTATGTTTA TGGTCCAGGT
GGATTCACTG ATTGGATGAC AATGGCTAGT GATGCTGTTG AGCAAATGCA AGCATTCGGT
ATCAATGCCA AGTTAATTGG GCAGGATGTT GGAGTATTCT GGAGTAGTAC ATTACCTAAT
AGTGAGTACG AGGGTGCTAC CACGTGGCTT AATTCAGCTA ATGGACCAGC CTACAGTAGT
ATGTGGGGGT TATTAGATTG GCCTTGGTGG ACTACTGGAG TAGCCATTCA GGCATGGCAT
AAGGGTAGTG AGGTTTGGCC ATTCCAGTGG CCTAATGGTA CATGCACACC AGTTATCTTA
CCCACTCAGC CACCTGTCTT CACTAATGGT ACCATAGTGT GGTGTGTCAA CTCAACATAC
GGCTACATTA ACATAACCAA CTGGCAGATG CTTGAGAATA TCGCTGCTCC AGGAACACCT
CAATACGACT TGATGATGAA GATAATATTC GCATGGTATG ATTACTTCGT TCCAATAGTG
CCGCTTTATA ATAAGCTTGA GCCGTATGAA TACATGACAT CTGTAATGGA TCCAAACTGG
TTATTCCAAC CATGCATAAT CAATAAGTAC CCAATATTAA CGTATGAACT AGAATTCATG
CCATGGGGCT ATGGTAACTC CTTATACAGT GACCTACATA TAATAATCGC ATGGGGTCTT
GTTGCACCAA AAGGTGTTGT TCCTCCTGTT GCTCAAGCTA TAGCTAATGG TTCATTGTGG
ACTAAGTATC CTCAGTATGC TGCATTCCTA GGTATTCCTA ATCCTGATCC TTCACTGCAG
CAGTGTGTTG CATCATACTT CCATATACCG TATACTCCAG TATCAACCAC TACTTCAACC
TCAACCACAA CCACTACGAC TACTACTCCA GTGACTACTA CTTCAACGAC TACTAGTACT
ACCACTTCAA CTGTTACCTC AACCACTACT GCCGTTTCAA CAGTCACTAG CACAGTCACA
ACCACTGCTG TAAGCACTGT AACATCAACA GCAACCACTA CGGCAGTATC AACAGTAACA
GTAACCAAAC CAGTGGTATC AACAGCATTA ATAGCAGGAA TAGTAATCAT AGTAATAGTC
ATAGCAGCAG TAGCAGCAAT AATAGCGTTG AGGAGAAGAT AA
 
Protein sequence
MSGRSNSTLV VKAILVAVVI LAAAYSAVTV QAQQKPQIIG GNPVTIILTP GSPMWNPYAP 
SNMIGNTWDY LPLAAFNPLT GQFWPILAEN WTVQVLPNGS GILTIYLRPG LYWYNGSVAI
PFTAWDVYAE FYIGMKVLAW YVPWINQSLV DEDIRVLNNY TIQFLFQRWT PYIPYWLLTS
WIDVPYAAWK PIVDKLRTMN ATQAASFTSN VTEYVVPYYG LYPYYLSYIS TTYLHFTLEP
PNLLSSWYKV FPFASWGYYD PTAIVWETGG NTQALSGMLA GKITYDWIGL SESQLKVINS
TPGWSSFALP VFSVMGIAIN PNYYPWNIPQ VREALCDVIN RTEVAAAWGL AISKPDYYPT
PVIPGTEDTY PPDVRQFIIP CSYNWTKAEQ LLESLGFTKK GEYWYTPNGT ELTLYVYGPG
GFTDWMTMAS DAVEQMQAFG INAKLIGQDV GVFWSSTLPN SEYEGATTWL NSANGPAYSS
MWGLLDWPWW TTGVAIQAWH KGSEVWPFQW PNGTCTPVIL PTQPPVFTNG TIVWCVNSTY
GYINITNWQM LENIAAPGTP QYDLMMKIIF AWYDYFVPIV PLYNKLEPYE YMTSVMDPNW
LFQPCIINKY PILTYELEFM PWGYGNSLYS DLHIIIAWGL VAPKGVVPPV AQAIANGSLW
TKYPQYAAFL GIPNPDPSLQ QCVASYFHIP YTPVSTTTST STTTTTTTTP VTTTSTTTST
TTSTVTSTTT AVSTVTSTVT TTAVSTVTST ATTTAVSTVT VTKPVVSTAL IAGIVIIVIV
IAAVAAIIAL RRR