Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_0654 |
Symbol | |
ID | 5709756 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | - |
Start bp | 689921 |
End bp | 692353 |
Gene Length | 2433 bp |
Protein Length | 810 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641275155 |
Product | extracellular solute-binding protein |
Protein accession | YP_001540484 |
Protein GI | 159041232 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0507861 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAATAC ATAAGACATA TGTACTATTA ATAGTAGCAT CAGTAATGGT TCTAATAGCG TTAAACATCG CATACGCCTC AAGCGGGTAC ACGATAATTT ACTGGAACTC CTACGGTACG TTAACAGTAC CCCCAGGTAC ACCCATATGC AATATATGGA ATCCTAAGGC TCTTTGGTCA TTTATTAATG CATGGGAGTC ATTAGCCTAC TACAATACTT ATAATGGTCA ATGGTGGCCT GTCCTAGCCA GTAATTGGAC ATTATTCCCC CAGAATGATA CAATTATAAT TCATTTACGT AGGGGCTTAG TTTGGTTTAA TGGCTCAGCC ACAATGCCTT TCACAGCTTG GGATGTTTAC GCTGAATTCT ACATTGGGGT TAAGGCATTT GACTGGTGGT ATCCATATGT TAATGCTGAT GGTATAAGGG TTATTAATAA CTACACATTA TCCATTCAAT TAACAGCATG GAGTCCAACA ACCATATTAT TCATGCTGAC TGAACCCATA ACTACACCAT GGCCTTACTG GGAACCTGTT GTTAAGGCCT TGAAGACCAT GAACTCCACG TACGCCTTAA CAGTTTATGG CCCTAATAAT GTAACCACGT GGAATCCACC ATGCTGGTCA CTTGCACCAT ACTACTTCGT GGCCTTCTAC CCAGGAACAG CCACATTCGT TACACAGCTT GAGCCACCTA ACATATTGAG GCAATGGTAT AATATATTCC CATACGAGGA TTGGCAGTAT TACCCAGTAA TTGATTATGT TCAAGTTCTC GGTAATACAC AGGCATTAAC AGGCTTATTA TCAGGTAAGG CCACGTGGTC GTCCGTGGCT CTTTCACTGG CGCAGATTGG GGTAGTTAAT AAGTCTGGGT TACTGGCCTA CATGGTTGAG GAATTTGGGG AATTAGGCAT GGCAATTAAT CCACTTGGCG GTTACCCATT CAACACAACC CAGTTCAGAA AGGCACTATG CTATGCTGTT AACTTAACAG CCGCAATTGC AGTATGGGGT ATTGGTACAT ATTATCCATC TTACTATCCC GCGCCAGTAT TCCCAACCAC TATTGATACA TACCCACCCA GCGTTAAGCA GTTCATAATA CCATGCAGTT ACAATACCAC TAAAGCCGCT GAGTTACTTG AAAGCATTGG CATGTATAAG AAGGGTAACC AATGGTACCT GCCTAATGGT ACACCATTAA CATTAACCGT AATTACACCT TCAGGCTTCA CTGATTGGGC CACCATAACA GAAGGCTGGG CAACCCAATT AAGCTTATTT GGCATACCCA CTAAGGTATT GGCTTTAGAT ACTGGCACAT ACTGGAGCAG CATATTCCCA AGTGCTCAAT TTGAGGTCGC CAATACGTTG TCGAGTTACG GTAGAGGGTA CTATGATCCA ACCGGCCTAG CATTCCAGGG ACCATTAGAG TGGATACCAT GGGTTACTGA ATTGGGGATT TATACTTGGC CATTCCAGTG GCCTAATGGA ACATGCACAC CAGTGGTAAT ACATCTTCCA CCCTCAGCCA ATAGTAGCCT AGTACCTGCC AATGGTACTG TAGTCTGGTG CATTAACTCA ACTCTTGGGT ACATTAACTT AACCAACTGG TTCACGCTAT ATGATGCCGC AACACCAGGT ACTCACCAGT ATAATGAATT GCTTAAGGTT CTATTCGCCT GGTATGATTA CTATGTTCCA ATAGTGCCCG TTGGTTGGAA GAGGGCTGAG GTTAACATTG GTCCTAAGAA TTACTTAATC ACCTGGGCTT ACAATGTGCC TAACCCCATG TGCGAGAAAT ACATACCACC GTGGCTTAGG ATTGAGTTAA TGCCATCAGA TAACTCATAC ATAATCGGTG CTGAGGCTTA TTACTATCAG GTCATTAGCT CCCCAGTATG GGAGGCCTTC TGGGGTAGTG GCGCCCCAGC TGGTGCAGTA CCCCCATTGA TTGAGGCCAT GGTTAACGGT AGCCTATGGA TCAAGCACCC TGATTACGCG GAATTCCTAG GCTTAACACC AAGTTACTTA ACAGACCTAA ACCAGTTAAG GTATTGTCTA GCCCAGTACT TCAACATAAC CTCAGAGTAT GTTCCATCAA TAATAACCAC AAGTACTACT ACAACAACCA CCACGACGAC TACTTCAACG ACCACGACAT CAACTACAAC AACTACCCCA GTGACTACAT CAGCAACTAC TTCAACTACT ACAAGCACTG TGACTACTAC GGCAGTTAGT ACAGTGGTGA GTACTGTTAC AACCACTGCA GTATCAACAG TGACTAGCAC AGCAACAACC ACAGCAGTAA GCACCGTAAC AGTCACAAAA CCAGTGGTAT CAACAGCATT AATAGCAGGA ATAGTAATCA TAGTAATCGT AATAGCAGCA GTAGCAGCAA TAATAGCGTT GAGGAGAAGA TGA
|
Protein sequence | MRIHKTYVLL IVASVMVLIA LNIAYASSGY TIIYWNSYGT LTVPPGTPIC NIWNPKALWS FINAWESLAY YNTYNGQWWP VLASNWTLFP QNDTIIIHLR RGLVWFNGSA TMPFTAWDVY AEFYIGVKAF DWWYPYVNAD GIRVINNYTL SIQLTAWSPT TILFMLTEPI TTPWPYWEPV VKALKTMNST YALTVYGPNN VTTWNPPCWS LAPYYFVAFY PGTATFVTQL EPPNILRQWY NIFPYEDWQY YPVIDYVQVL GNTQALTGLL SGKATWSSVA LSLAQIGVVN KSGLLAYMVE EFGELGMAIN PLGGYPFNTT QFRKALCYAV NLTAAIAVWG IGTYYPSYYP APVFPTTIDT YPPSVKQFII PCSYNTTKAA ELLESIGMYK KGNQWYLPNG TPLTLTVITP SGFTDWATIT EGWATQLSLF GIPTKVLALD TGTYWSSIFP SAQFEVANTL SSYGRGYYDP TGLAFQGPLE WIPWVTELGI YTWPFQWPNG TCTPVVIHLP PSANSSLVPA NGTVVWCINS TLGYINLTNW FTLYDAATPG THQYNELLKV LFAWYDYYVP IVPVGWKRAE VNIGPKNYLI TWAYNVPNPM CEKYIPPWLR IELMPSDNSY IIGAEAYYYQ VISSPVWEAF WGSGAPAGAV PPLIEAMVNG SLWIKHPDYA EFLGLTPSYL TDLNQLRYCL AQYFNITSEY VPSIITTSTT TTTTTTTTST TTTSTTTTTP VTTSATTSTT TSTVTTTAVS TVVSTVTTTA VSTVTSTATT TAVSTVTVTK PVVSTALIAG IVIIVIVIAA VAAIIALRRR
|
| |