Gene Cmaq_0685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0685 
Symbol 
ID5709192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp721314 
End bp723362 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content43% 
IMG OID641275187 
Productextracellular solute-binding protein 
Protein accessionYP_001540515 
Protein GI159041263 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.025918 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATAA AATACACCTG GTTCAAGTGG ATCCTAGTAG TCGCTACAGT TCTAGTAGTG 
AGTTTAGTGG TGGGTCAGGT TAATTTTGGG CGTACCGCAG TAAACCCAAT ACTTAATGCC
CAACAGTATA ATATTACACC CTACAATACA ATATATGTAT TCACCATAAG TAGCCCACCC
TTAACCAGTA TGTCGCTTTA TAACCCGAAC ATATTCCATG GTGGTATTGC ATGGTATGGT
GTTGTTCAAT CACATGTAGC GGCGATAAAT TACACCACTG GTGAAGAGAT TCCAATACTG
GCTAAGAACT GGACCCTTCA AGTGCTTCCA AATGGTACAC TGGCTATTTA CGTAACCTTA
AGGCAAAGTG GGATGAGTAA TGGGCAACCA GTCACGTGTT GGGATCTATT GGCTAATAAT
ATTGCTAATG GTTTAATACA CAACATGTGG GGTAACGTCA GCATTGTGAA TAACTACACA
TGCATATTCA AGATGCCTCA AGGCTACTAT GCACCACCCA CTAGCCCAGC TGCTGAGGCT
TATGACGTAT TCTGGGCGCT TAACTGGGGT GGGGTGGCAT TGGTCTGGAC ATTTAATGCA
TGGTACCCAT TAATGGAGGC AACATTGGCT AACTATAGTT GGCTCTGGCT CTTCAACTTT
GGTAACGCCA CCCAGCAGGC TCAGGCTAGA AAAGTATTAA CGCCATTAAT TAATGAGTTA
TTCACTGCAA AATTACCACC TAACACACCT ACAACAGGCC CATTCTATGT TTGCGATATT
ACACCTGAGT ACATTCTCCT TTGTAAGAAT CCGTATTGGT ATGATGCTAA GGATATTAAG
GTTGATTACA TTGTTGAATG GCAGTACTCA TCAATGACCC AGGTTTACGC AGCATTGGCT
TCCGGTAAGA TCAGTATCTG GCGGACCGGG GGCTCGTCAA TTTCATCCAC GTTGCTTAGT
CAAATACTTA GGAATCCATA CATAGAGATG AATATTTACC CAGGCTTCGG TGGTGATGCC
TTGTATTTCA ATTTCCTTAA TCCTTGGTTG GCTATGCCTC AGGTTAGGCA GGCTATTTAC
TATGCTGTTA ATTGGACTCA ATTAGCCCAA GCAGCATACG GGCCTCAATT CATATTACCT
TCACCAACTC CTCAAGTAGG TATAATGAAT GATATGTATC CAAGCTTAGT GCAGCAGGTT
ATAGGCTATT GGGCTAGTCA AGGATCACCA TTAATAAACT ACACTTATAA TCCATCCATG
GCTACTCAAT TGCTTGAGAG TGCTGGTTTC ACTGAGAAGA ATGGTGTATG GTATACACCC
AACGGCTCAG AATTCACGTT AACACTATAC ATTAGTTCAG GAGCATCACC GCCGCAGTTG
GCTTTAGCTA ATAGTATTGC CAATGCGTTA ACAAGCTTCG GAATACCAAC AACAGTAACT
GTTTACCCAA GCTCAGAGTT CAGTACCATA GTTCAACAAG GTAAGTATGA TTTAATATTC
TTGTATTATG ATGGTGCACC TGAACCTGGT TTTCCATCAT TCTTCCCAGA GGGTCCAATA
CCAGCAGCCT ACTTCCAAGG TTACCCATTC AATGCAACGC ATTGGAATAT GGTTGTTACC
CTACCTAATG GTACTCAAGT TACCCCACTT CAAGCATGTA CTGATTACTT GCTGAGTCCA
GCTAGGATAT TCTCATGTTA CGCAATATCA ATGTGGGCTT GGAACCACTA TACACCATTC
ATACAGATTG ATAGGAATAC TTGGATATTC TTCCTAAACA CCCAGTATAT TAACTGGCCG
CTTAACGACA CGTCAATATG GGAGAACCTA CTAACCATAG AAACAGAGGC TTGGACTGCA
TTACTCATGC ACGTATCATT CAAGTCACCT TCAGTCGCAA CTACAACTAC CTCCCCAGTA
ACCACAACTA CCCCGAGCAC TGTAACTAAG GTTGTTACAC CGTCATATAT TATATATGCT
GTAGTGGCTG TAGTTGTAGT GGTTATTGTT GCGGCTGTTG TTGCGATAAT ACTGGTGAGG
AGAAGGTGA
 
Protein sequence
MAIKYTWFKW ILVVATVLVV SLVVGQVNFG RTAVNPILNA QQYNITPYNT IYVFTISSPP 
LTSMSLYNPN IFHGGIAWYG VVQSHVAAIN YTTGEEIPIL AKNWTLQVLP NGTLAIYVTL
RQSGMSNGQP VTCWDLLANN IANGLIHNMW GNVSIVNNYT CIFKMPQGYY APPTSPAAEA
YDVFWALNWG GVALVWTFNA WYPLMEATLA NYSWLWLFNF GNATQQAQAR KVLTPLINEL
FTAKLPPNTP TTGPFYVCDI TPEYILLCKN PYWYDAKDIK VDYIVEWQYS SMTQVYAALA
SGKISIWRTG GSSISSTLLS QILRNPYIEM NIYPGFGGDA LYFNFLNPWL AMPQVRQAIY
YAVNWTQLAQ AAYGPQFILP SPTPQVGIMN DMYPSLVQQV IGYWASQGSP LINYTYNPSM
ATQLLESAGF TEKNGVWYTP NGSEFTLTLY ISSGASPPQL ALANSIANAL TSFGIPTTVT
VYPSSEFSTI VQQGKYDLIF LYYDGAPEPG FPSFFPEGPI PAAYFQGYPF NATHWNMVVT
LPNGTQVTPL QACTDYLLSP ARIFSCYAIS MWAWNHYTPF IQIDRNTWIF FLNTQYINWP
LNDTSIWENL LTIETEAWTA LLMHVSFKSP SVATTTTSPV TTTTPSTVTK VVTPSYIIYA
VVAVVVVVIV AAVVAIILVR RR