Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1342 |
Symbol | |
ID | 3831899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1387653 |
End bp | 1389152 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637829278 |
Product | anthranilate synthase, component I |
Protein accession | YP_430198 |
Protein GI | 83590189 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00981315 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTTCACC CTGACCGGGC AACCTACCTG GACCTGGCCC GGAAGGCCCC TTATATTCCT GTCTGGACAG AGATTCTTGC CGATACAGAA ACTCCCATTT CCCTGTACTT AAAATTCGCC GGCGGTCCGG ACAGCTTCTT ACTGGAGAGT GTCGAAGGCG GGGAAAACCT GGGCCGCTAT TCCCTCATTG GCTTCGATCC CCTGCTGACC TTCACCGCCA GGAGTGGTAA GGCTTATTTA AGCACGAATA ACGCCACTGC CAGGTTGCTG GAAGAAAAAC CCTTCAAGGC TTTAAACAAT TTAATGGCCA GCCTGTCCCT ACCACCTAGT GAGGGACCAC GCTTTCAGGG CGGCCTGGTA GGCTACCTGG GATATGATAT GGTCCGGGAA CTGGAGGCTT TGCCCCCGGG GCCCGGTAAT GACCTGCAAA TACCAGATAC CCACCTGACC CTCCACCGTT GTTACCTGGT TTACGACCAT ATCCTGCGGA CAGTGAGGAT AACCTGCCTG GGGCGAGGCG GGGAGAATGC CCTTGCCGGG TATGAAGAAG CTGTAGCCGG GGTCAAGGGA ATTCTGGAAA AACTCGGGCG ATCTTCTGGC GGGTACCGGA ATGGGTATCC GCCGGCAGCA GGGCAACTGG TCCCGGAAGG GGTCTCCTGG CAGGCCAGCG TGACCCGACA GGAATTTACC GGGATGGTTA CAAAAGCTAA GGAGTATATC GCCGCCGGCG ATATCTTCCA GGTGGTCCTC TCCCAGAGAT TGAGCCTGCC CTTCAGGGAG GACGCCCTGG TCGTTTACCG GCACTTGCGA GCCCTTAACC CTTCGCCGTA TATGTTTTAC CTTAACTTCC CGGAGGTGCA ACTGGTAGGC GCATCGCCGG AAATGCTGGT GCGGGTGGAA AGGGGAACAA TCGATTATCG CCCCATCGCC GGTACCAGGC GCCGGGGACG GACTGCTGCC GAGGACAGGG CCCTGGCCGC GGAACTCCTG GCCAGCGAGA AGGAGCGTGC CGAACACCTG ATGCTCCTGG ACCTGGGGCG GAACGACGTG GGAAGGATAG CCGTCCCGGG CAGCCTGCAG GTGACCCGGC AGATGGTGGT GGAGTATTAT TCCCACGTCA TGCACCTGGT TTCCAGCATT ACCGCCCGCC TGGCTCCTGG CAGGAGTGCC CTGGACGCCC TTCTGGCCTG TTTCCCCGCC GGTACGGTGA CCGGGGCTCC CAAGGTACGG GCCATGGAGA TCATCACCGA ACTGGAGCCG GTCAACCGGG GACCCTATGC CGGCGCCGTA GGTTACCTGG GTCTCCATGG CAACCTGGAT ACCTGCATTG CCATCCGCAC CATAGTTTTT GCCCGGGGTC GGGCCTTCAT CCAGGCCGGG GCGGGCATTG TCGCCGACTC CGACCCGGAA GCCGAATATG AAGAGACCCT GAACAAAGCC CGGGCGCTGT TGCAGGTATT AAAAAAGCCG GAGGTGGGCA CCCGTGCTGC TAATGATTGA
|
Protein sequence | MFHPDRATYL DLARKAPYIP VWTEILADTE TPISLYLKFA GGPDSFLLES VEGGENLGRY SLIGFDPLLT FTARSGKAYL STNNATARLL EEKPFKALNN LMASLSLPPS EGPRFQGGLV GYLGYDMVRE LEALPPGPGN DLQIPDTHLT LHRCYLVYDH ILRTVRITCL GRGGENALAG YEEAVAGVKG ILEKLGRSSG GYRNGYPPAA GQLVPEGVSW QASVTRQEFT GMVTKAKEYI AAGDIFQVVL SQRLSLPFRE DALVVYRHLR ALNPSPYMFY LNFPEVQLVG ASPEMLVRVE RGTIDYRPIA GTRRRGRTAA EDRALAAELL ASEKERAEHL MLLDLGRNDV GRIAVPGSLQ VTRQMVVEYY SHVMHLVSSI TARLAPGRSA LDALLACFPA GTVTGAPKVR AMEIITELEP VNRGPYAGAV GYLGLHGNLD TCIAIRTIVF ARGRAFIQAG AGIVADSDPE AEYEETLNKA RALLQVLKKP EVGTRAAND
|
| |