Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0682 |
Symbol | |
ID | 4068772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 839766 |
End bp | 841655 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637982688 |
Product | tryptophan 2,3-dioxygenase |
Protein accession | YP_589761 |
Protein GI | 94967713 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold [COG3483] Tryptophan 2,3-dioxygenase (vermilion) |
TIGRFAM ID | [TIGR03036] tryptophan 2,3-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.431731 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGACGA TCGACATTCA CAATCATTTC TTTCCGGAGA GCTGGCCGGA CCTTGCCGCG AAATTCGGCA CGCCGGACTG GCCGTGGATC AAGCATACGG AACCGGGCAA GGCGACGATC ATGCTCGGCG ACCGTGAGTT TCGGAAAATC TACTCGGCAT GTTGGGACAT GGACGTCCGC TTGGCGGAGA TGGATCGCGA TGGCGTAGAC CTGCAGATCA TCTCGGCGAC GCCGGTGCTG TTTGCGTACG ACCGTCCGGC GGACCAGGCT CTGGAGTGCG CGAAGATTTT CAACGACGCT GCGTTGGAGT TGTGTGCACG CGCGCCGGGA CGACTGAAGG CACTTTGCCA GGTGCCGCTA CAAGACATTG ATCTCGCATG CGCGGAACTC GATCGGGCGA TTAAGGACGG GCACCTTGGC GTGCAGATTG GCAATCATGT TGGCGAGAAG AACCTCGACG ACGCCGGGAT CGAGACGTTC CTACATCATT GCGCGAGCGT CGGCGCAGCG GTGTTGGTGC ATCCGTGGGA CATGATGGCG CGCGATCGCA TGCCGAATTA CATGGCGCCG TGGACGGTCG CGATGCCGGC GGAGACTCAG CTTGGCATCG TGACGATGAT CTTGAGCGGG GCGTTCGATC GATTGCCGGA GAACCTGCGC ATCTGCTTCG CGCATGGGGG CGGCAGCTTC GCGTTCCTGC TTGGACGCAT GGAGAACGCG TGGTCGCATC ATCCGGTGGC ACGCGGAAAG TCGGAACTTC CGCCGCGCAG ATACGTGAAC CGGTTTTATG TGGATTCCGC GGTGTACGAC GAGGAGGCTC TGTGCTTCCT GATCAGCACG ATGGGAGAAG ACCAAGTCGT GCTGGGATCC GATTATCCGT TTCCGTTGGG AGAAGAGCGA ATCGGTAAAT TGATTCGGAC TTGCAAGATC GACGGTTCGG CAAAGAAGAA GCTGCTGAAT GCAAATGCGG CGAGATTCCT GGGGCTGAAG GTTGAAGCGG AGAGCACCGC ACCGGTTGAA GCTCGCAGTT CCAGCGGATG TCCGGTGGCG CACGGGAACG GTGCGGGTGA CGAGCATAAG CTGACATACG GGTCGTATTT GAAGATCCCT GAGTTACTCT CGCTTCAGCA ACTGCAGTCC GAACCACCTC GGCATGATGA ATTGCTTTTC ATCGTGATTC ATCAGACCTA CGAGTTGTGG TTCAAGGAAC TGCTGCACGA TCTCGAGGCG GTGGTCCGTT GTTTGCAGGC GGTGGCACGT GATCCGCGGG CGCGAGATGA AGTTTACGAG GCGGCGCGAT TGTTGCGGCG ATGCACTGAA GTACTGCGGG TGCTGGTGAG CCAGTTCACG ATCCTCGAAA CGATGCTTCC GACCCATTTT CTCGCCTTCC GTGACAAATT AGAACCGGCG AGTGGTTTCC AGTCGCAGCA GTTCCGGCAG ATTGAATTTC TCTGCGGGCT GCGCGACGAG AAGCTGATGC GGGTTCATGA GCCCGAGCCG AAGGAACACG CGGAATTGGT GAAGCGCCTT CACGAACCAT CGCTGCACGA TGTGCTCTTC GATGCGCTGC GGGCGCTCGG TAAATTGCCG GCGTATGCCC CGGACGCAAC GGATCGCGAT CGTTTCGAGG CGCGGGCATT CGCGATTCGC GATGTTTATG AAGACGAGAA GCACTTCCGC GATTGGATCG ACGTATGCGA GCGGCTGACG GAATTCGACG AACTGGTCGT GAGTTGGCGG CTGCGGCACA TCCAGATGGT GGAGCGCACC ATTGGTCTGA AGATGGGAAC CGGCGGAAGC ACGGGCGCAT CCTATTTGCG GCTGACGCTC GACAAGACGT TTTTTCCCGA GCTGTGGGAA GCGCGAACGA TGCTGCGCAA AGCCGAATAG
|
Protein sequence | MQTIDIHNHF FPESWPDLAA KFGTPDWPWI KHTEPGKATI MLGDREFRKI YSACWDMDVR LAEMDRDGVD LQIISATPVL FAYDRPADQA LECAKIFNDA ALELCARAPG RLKALCQVPL QDIDLACAEL DRAIKDGHLG VQIGNHVGEK NLDDAGIETF LHHCASVGAA VLVHPWDMMA RDRMPNYMAP WTVAMPAETQ LGIVTMILSG AFDRLPENLR ICFAHGGGSF AFLLGRMENA WSHHPVARGK SELPPRRYVN RFYVDSAVYD EEALCFLIST MGEDQVVLGS DYPFPLGEER IGKLIRTCKI DGSAKKKLLN ANAARFLGLK VEAESTAPVE ARSSSGCPVA HGNGAGDEHK LTYGSYLKIP ELLSLQQLQS EPPRHDELLF IVIHQTYELW FKELLHDLEA VVRCLQAVAR DPRARDEVYE AARLLRRCTE VLRVLVSQFT ILETMLPTHF LAFRDKLEPA SGFQSQQFRQ IEFLCGLRDE KLMRVHEPEP KEHAELVKRL HEPSLHDVLF DALRALGKLP AYAPDATDRD RFEARAFAIR DVYEDEKHFR DWIDVCERLT EFDELVVSWR LRHIQMVERT IGLKMGTGGS TGASYLRLTL DKTFFPELWE ARTMLRKAE
|
| |