Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_1569 |
Symbol | |
ID | 5694406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | - |
Start bp | 1867098 |
End bp | 1868585 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641264164 |
Product | anthranilate synthase |
Protein accession | YP_001529450 |
Protein GI | 158521580 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000157437 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCCTTC AACAGTTTCC GGACAAAACG GCATTTCGTG AGATGGCGCA GACCGCCAAC ACCATTCCCG TGTGCGCCGA GATTCTGGCG GATACCGAGA CTCCGGTCTC TATCTTGAAA AAGCTCTACA CCGGAAAAGG GCCGATTTTC CTGTTTGAAA GCGCGGAAGG CGGCGAGCGG TGGGGCCGGT ACAGCTTTCT GGGGGCATCG GCAAAGGCCC ATGTAAAACT GTTCCGGGAC CACGTGGAGA TTTTGGACAA CGGCACGGTG CAGCGGATTC GACACAACGG CTTGCCCCTG CAAATCCTGC GAAACATCAT GGCCGACTAC ACGCCGGCCG AGATGCCTGG CCTGCCCCGG TTCTGGGGCG GGCTGGTGGG GTATTTTACC TACGAAATGG TCTCATTTTT TGAAAAGATT CCCAACCGGC TCCCCGAAGA CCGGCCCATT GCCGCTTTCA TGATTCCGGA CGAGCTGATC ATCTTTGACA ACATCCGCCA CACGCTGGTA GCCCTGGCCA TTGCCTTTAC CCGATCAGCA AAAAGCATTG ACGCGGCTTA CGAGGCGGCG GAAAAGCGGG TTGAAAAACT GCTTGCCGTG GTGCAGGCGC CCCTGCCCGT GGAGGCCGGC AACTCAGCCG CAGCCCCCTG TGTGCTGGCC CCTGAAAGAA CAGATGAGGA TTACAGAACA ATGGTCGGGG TCACCAAGGA CTATATCCGC CAGGGTGAAA TCATTCAGGC GGTGCTGTCC CAGCCCTTTT CCTGCTGCCC GGCCCCGGAT CTCTGGACCC TTTACCGGGC CCAGCGGTAC ATCAACCCGT CGCCATACCT TTTCTTTCTG CACATCGGGG AAACGGCCCT GGTGGGGTCA TCGCCGGAAA CCATGGTGCG GCTTGAAAAC CGCATCGCTA CGGTGCGGCC CATTGCCGGC ACCCGGCCCC GGGGAAGGAC CGAACAGGAG GACCGGGCAC TGGCCGATGA AATGCTGAAG GACGAAAAAG AAAGGGCCGA GCACCTGATG CTGGTGGACC TGGGCAGAAA CGACCTGGGC CGGGTGGCGG TCACCGGCAC GGTACAGGTC ACCGACCTGA TGGTGGTGGA GCGCTACTCT CATGTGATGC ACCTGGTCTC AAACGTACGC TGCGACCTTG AACGGGACCT TGACGCGTGG GACCTGCTGG CCGCCACCTT TCCGGCCGGC ACCCTTTCCG GCGCGCCCAA AATCCGGGCC ATGCAGATCA TCGATGAGCT TGAAAAGGGT CCCCGGGGAC CGTATGGCGG AGCCGTGGGA TATATCTCCT TCAGCGGCAA CATGGATCTG GCCATCACCA TCCGCACGGC CTGCATTGAA GACGATTGTC TCACGGTGCG GGCCGGAGCC GGCATCGTGG CCGACTCGGA CCCGGAAAAG GAGCGCGTTG AAACCGTGAA CAAGGCCAGG GCCATTCAGA AAGCCCTGGA ACTGGCCGGC AGACAACACA CCCATTAA
|
Protein sequence | MILQQFPDKT AFREMAQTAN TIPVCAEILA DTETPVSILK KLYTGKGPIF LFESAEGGER WGRYSFLGAS AKAHVKLFRD HVEILDNGTV QRIRHNGLPL QILRNIMADY TPAEMPGLPR FWGGLVGYFT YEMVSFFEKI PNRLPEDRPI AAFMIPDELI IFDNIRHTLV ALAIAFTRSA KSIDAAYEAA EKRVEKLLAV VQAPLPVEAG NSAAAPCVLA PERTDEDYRT MVGVTKDYIR QGEIIQAVLS QPFSCCPAPD LWTLYRAQRY INPSPYLFFL HIGETALVGS SPETMVRLEN RIATVRPIAG TRPRGRTEQE DRALADEMLK DEKERAEHLM LVDLGRNDLG RVAVTGTVQV TDLMVVERYS HVMHLVSNVR CDLERDLDAW DLLAATFPAG TLSGAPKIRA MQIIDELEKG PRGPYGGAVG YISFSGNMDL AITIRTACIE DDCLTVRAGA GIVADSDPEK ERVETVNKAR AIQKALELAG RQHTH
|
| |