Gene Cmaq_1161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1161 
Symbol 
ID5709642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1216552 
End bp1218879 
Gene Length2328 bp 
Protein Length775 aa 
Translation table11 
GC content44% 
IMG OID641275661 
Productextracellular solute-binding protein 
Protein accessionYP_001540978 
Protein GI159041726 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATGA CTGGTATGTC ACTAAAAGTA ATAGGTATAG TGTTAAGCAT AATAGCCATG 
CTTCTAGTCA TATACGGTAT AAATGTGACC TATGCACAAC AGAAGCCTCA GATAATTTTC
CCTGTCGCTG GCGCCACCTT TGAGATTGCT CCAGGTACCC CCATTTACAA TCCGTATAAT
CCAATAGGCT TAGCTACATG CAGTATATCT AGCTTCGGCA CATATTTCCC ACTGGCATTC
TATAGCTATA TAACTGGTCA ATTCTGGCCT ATTCTTGCTG AGAATTGGAC TGTTCAGGTT
TTGCCTAATG GTAGTGGTAT TTTAACCATC TACCTTAGGA AGGGCTTATA CTGGTTTAAT
GGCTCAGCCA CAATGCCCTT CACCGCATGG GACCTTTACG CGGAGTATTA CATTGGTATT
AAGAGTTTCG GCTGGTGGAC ACCCTTCATT AACCAATCCC TACTCAACGA GGACCTAAGG
GTATTAAACA ACTACACCCT ACAAATACTA GTCAACAAAT GGTCACCCCT AATACCAGTC
TGGGTATTAA CCCAGTGTGT AAATGCACCA TGGTTTGTAT GGGAACCTGT GGTTAATGAA
TTGAAGTCAA TGAGTGTTTC CCAGGCAATG AAGTTCAGTA CTAATGTAAC CGAAATTAAT
GTACCCTACT GGGGTATTTA CCCATGGTAC CTAACTTACT TCAGTTCAAA CTATATGACA
CTAACTCTCG AGCCAAGTAA CCTACTTAAC GAGTGGCTTA ACATATTCCC ACTGGCGGAT
TGGTACTACT ATGACCCAAC CTATGAGGCA ATATGGGGGC CGAATTCAGT AGCCTACGAG
TACTGGTATG CTCTTAAGGC AACATGGGGT AGTGCTGGTT TTTCACTGCA GCAGGTGCAG
TTACTTAAGT CACGCGGCGT GGGTATATAC TTTGCACCCA CTTGGAACAC CATGGGTATT
TACATTAATC CTCATGATTA TCCTTGGAAT ATTCCTCAGG TTAGGGAGGC TTTGTGTTAT
GTTATTAATA GGACTGAGGT TGGAGCCTCA TGGGGCTTAG CCATTAGTAA ACCAGACTAC
TACCCTGAGC CAATAGTCCC CGAGACTATT GATACTTATC CGCCAAGTGT TAGGCAGTTC
ATTATACCGT GCTCCTATGA TCCAGCCAAG GCGGCTCAAA TACTGCAGAG CCTAGGATTC
AAGAAGATTA ACGGATACTG GTACACACCA AACGGCACTG AACTAAGCCT ACTGGTGTAT
AGTGTTAGTG GGTTTAGTGA TTGGATGACA ATGACCAGTG ATGCAGTATC TCAATTACAG
AAATTCGGTA TTAATGCCAA GTTAATTGGG CTTGATGTTG GAGTATTCTA TGGAACAATA
ATACCCAGTG GTCAATATGA GGCAGAAACT ACTTGGCTTG CAGCCACGCC ATCATATGGT
AGTGCTTGGT CCTTCCTCCA GGATCCATGG GTATACTCAG GTGCGGCAAT ATCTGCTTAT
ACTCCTGGTA GTGAGGTTTG GCCATTCCAG TGGCCTAATG GTACATGCAC ACCAGTCACT
GCACCAGCAT CATTGAATCT ACCCAATAGC ACAATAGTAT GGTGCATCAA CTCAACATAC
GGCTACATAA ACCTAAGCAA CTGGCAAGTA TTCTTTGATG CTGTGGCTCA ACCAAACACA
CCGAACTACA ACCTAGCAAT AGACGCAATA TTCGCATGGT ACTACTACTT CGTACCCTCA
GTACCATTAT ATGATAAGAT TACGCCCCTT TACTACCTAC CATCCGTTGC CGATATAAAC
TGGACCTATG ATTGCTTACC ACCATCAGTA AACTACGCCA TAGTTGGTAC AGCGGATTAC
TGGTGGGTTT ACGGACCCAT GTATTACGTG CTACTAGGCG CCTATGCACC CACTGGTATT
ATTCCACCAT TAGCTGAGGC TATTATTAAT GGTTCATTGT GGACTAATCC AAGGTTTAAG
GTTGTGGCTG ATTTCATAGG CTTACCTAAC CCAGATTCGT CGATACAGGC TTGTGTAGCA
TCATACTTCC ACACAACGTA CACTCCAGTT ACTTCAACTA CAACGACTTC AACGACTACG
GCCACAACAA CCTCAACAAC CACTGTTACT TCAACTGCTG TTGCTACTGT GACTAGTACT
GCCACTGTAA CATCAACAAG CACAGTAGTA AGCACAGCAA CCACTACGGC AGTAAGCACA
GTAACAGTCA CTAAACCAGT AGTATCAACA ACACTAATAG CAGGAATAGT AATCATTGTA
GTGATTATTG CCATTGTTGC AGCAATAATA GCATTAAGAA GGAGGTAA
 
Protein sequence
MRMTGMSLKV IGIVLSIIAM LLVIYGINVT YAQQKPQIIF PVAGATFEIA PGTPIYNPYN 
PIGLATCSIS SFGTYFPLAF YSYITGQFWP ILAENWTVQV LPNGSGILTI YLRKGLYWFN
GSATMPFTAW DLYAEYYIGI KSFGWWTPFI NQSLLNEDLR VLNNYTLQIL VNKWSPLIPV
WVLTQCVNAP WFVWEPVVNE LKSMSVSQAM KFSTNVTEIN VPYWGIYPWY LTYFSSNYMT
LTLEPSNLLN EWLNIFPLAD WYYYDPTYEA IWGPNSVAYE YWYALKATWG SAGFSLQQVQ
LLKSRGVGIY FAPTWNTMGI YINPHDYPWN IPQVREALCY VINRTEVGAS WGLAISKPDY
YPEPIVPETI DTYPPSVRQF IIPCSYDPAK AAQILQSLGF KKINGYWYTP NGTELSLLVY
SVSGFSDWMT MTSDAVSQLQ KFGINAKLIG LDVGVFYGTI IPSGQYEAET TWLAATPSYG
SAWSFLQDPW VYSGAAISAY TPGSEVWPFQ WPNGTCTPVT APASLNLPNS TIVWCINSTY
GYINLSNWQV FFDAVAQPNT PNYNLAIDAI FAWYYYFVPS VPLYDKITPL YYLPSVADIN
WTYDCLPPSV NYAIVGTADY WWVYGPMYYV LLGAYAPTGI IPPLAEAIIN GSLWTNPRFK
VVADFIGLPN PDSSIQACVA SYFHTTYTPV TSTTTTSTTT ATTTSTTTVT STAVATVTST
ATVTSTSTVV STATTTAVST VTVTKPVVST TLIAGIVIIV VIIAIVAAII ALRRR