Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_1082 |
Symbol | |
ID | 5709357 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | + |
Start bp | 1134658 |
End bp | 1136478 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641275582 |
Product | thiamine pyrophosphate binding domain-containing protein |
Protein accession | YP_001540901 |
Protein GI | 159041649 |
COG category | [C] Energy production and conversion |
COG ID | [COG4231] Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits |
TIGRFAM ID | [TIGR03336] indolepyruvate ferredoxin oxidoreductase, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0746897 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGGGTA ATCATGCGGT TGCTCATGCG GCATTAGAAG CGGGTTTAGC CGTTGCAGCA GGTTACCCTG GTACGCCTAG TAGTGAGATT ATTGAGTACA TTATTGACCA CTCCAGGGAG ACTGGGGTTT ATGTTGAATG GTCAAGCAAC GAGAAGGTAG CCTACGAGGT GGCTTACGGT GCAGCATTAG CCGGTGCTAA GGCATTGGTT AGTATGAAGC ATGTTGGCTT AAATGTAGCG ATGGATCCAC TAATGTCAAG CGCGTACACT GGTGTTAGGA ATAGTTTACT GGTTATTACC GCTGATGACC CTGGAATGTG GTCAAGCCAG AATGAACAGG ATAATAGGTG GGTTGGTTTA CACGCCCATA TTCCAGTAAT TGAACCATAC AGTCCACAGA ATGCAGCTGA CCTAGTTAAG TTATCAATGA GTATGAGTCA ACGCCTCAAT CACCCAGTGT TAATGAGACT GGTTACTAGG GTTTCCCACG TTAGGGAACC TGTTAAGGTC TGTGAATTCA GTAAACCTGA CTATGCCCAA GGCTACCTCA AGGACCCATC ACACCACGCC TTAGTTCCAT CCAACGCTAG GCAACTTAAA GGTGAGTTAA TTAAACGCTG GGAGAGTATT CAGTATGCCG TTGAGGATTT GCCTCACGAG TACGTTAACG ATGGTAGAGT ACTGGTTATT ACGAGTGGTG TAGCGTATAA TTACGTTAAG GAGGCTTTAA GGGACCTTGA AATCCCCGTT AGTACACTCA ACGTAATCAC CCCAGTACCC CTACCTAGGA GGCTTATAGC TAATGCAGTG TCTAACTCAG ATAAGGTCGT GGTTGTTGAG GAAGGTGACC CAGTGGTTGA GTTTCAGGTT AAGGAGGTGC TTTACGATGA GGGTATTAGG GTTCCAGTGT ACGGTAAGGC TGAGGGTTTC TTCACCAGGG TTGGTGAATT AACACTAATG AATGTTGAGG AGGGGTTGGC TAAGGCACTT GGTGTTGAAT TAAGGGGAAT GCGAAGTAAT GCGCAACGCA TTGATGTACC ACCAAGGCCC CCTGTCTTCT GCCCAGGTTG CCCCCATGCG GCATCATTCT ATGAGTTAAA GATAACCACG GCTAAGGCCA TGGTTAAGCC AGTCTTCAGC GGTGACATAG GCTGCTACTC CCTAGGCATA AACCCACCCT TTAATGAGCA AGACGTGTTA ACGAATATGG GTAGTTCAAT AGGCTTAGGC ATGGGTATCC TTAGAGGCAC TGGTGGTAGG CAATTCATCA TAGCCATTAT AGGTGACTCC ACGTTCTTCC ACGCCGGTTT ACCAGCCCTG GTTAATGCAG TCTACAATAA GGCGCCAATG CTTGTTATTG TTATGGATAA TAGGTTCACT GCGATGACGG GTGGTCAACC AAGCCCAACG CAGGTCATTG ATATTGCCGC TGTAGCTAAG GCAATTGGGG TTAAGTACGT GTACACAATT GACCCCTTCA ACGTTAAGGA GGCTGAGGCT ACGTTAAGCG ATGCGTTAAG GAAGGTTAAG GATGGGGAAT TAGCCCTAGT GGTTATGAAG AGGGCATGCG CCCTAGAGGC ATCAAGGGGG CGTAGTTCAC TCATTGTTAA ATTCACAGTG GACCCCGATG CATGTAAGGC ATGCGGCATC TGCTACAATC TAATAGCCTG CCCAGCCATA GCACCCCTTG AGAACAGGAA GGCTTGGATC GACCCCAATA TGTGCGTGGG ATGCTCAGTA TGCGCTCAAG TCTGCCCTTA CAATGCAATA AAGCCCAGCG GTAATGCAAA GGAGTGGCTT GATAAGTGGG CTGAAATGTG A
|
Protein sequence | MLGNHAVAHA ALEAGLAVAA GYPGTPSSEI IEYIIDHSRE TGVYVEWSSN EKVAYEVAYG AALAGAKALV SMKHVGLNVA MDPLMSSAYT GVRNSLLVIT ADDPGMWSSQ NEQDNRWVGL HAHIPVIEPY SPQNAADLVK LSMSMSQRLN HPVLMRLVTR VSHVREPVKV CEFSKPDYAQ GYLKDPSHHA LVPSNARQLK GELIKRWESI QYAVEDLPHE YVNDGRVLVI TSGVAYNYVK EALRDLEIPV STLNVITPVP LPRRLIANAV SNSDKVVVVE EGDPVVEFQV KEVLYDEGIR VPVYGKAEGF FTRVGELTLM NVEEGLAKAL GVELRGMRSN AQRIDVPPRP PVFCPGCPHA ASFYELKITT AKAMVKPVFS GDIGCYSLGI NPPFNEQDVL TNMGSSIGLG MGILRGTGGR QFIIAIIGDS TFFHAGLPAL VNAVYNKAPM LVIVMDNRFT AMTGGQPSPT QVIDIAAVAK AIGVKYVYTI DPFNVKEAEA TLSDALRKVK DGELALVVMK RACALEASRG RSSLIVKFTV DPDACKACGI CYNLIACPAI APLENRKAWI DPNMCVGCSV CAQVCPYNAI KPSGNAKEWL DKWAEM
|
| |