Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_0219 |
Symbol | |
ID | 7270604 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 251200 |
End bp | 252753 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643568871 |
Product | Anthranilate synthase |
Protein accession | YP_002465328 |
Protein GI | 219850896 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCA CAAATGTACA TCAAAACTTT GAGGATGAAT TGAAGATGAA CCTCTCTAAG GAGGAGTACA CCACCCTGGC AAGAGACCTG GCGAAGCCGC TGCTGATCCC TCTTACCTGC ACGATCCCGA TCGATGACCT CTCGCCGGCC GTGGGGTACC GTGCGCTGGC AAAAGGGGTG GGAGCACTGC TCGAATCGGT CGAAGGACCG ACCAGACTGG CACGCTACTC GTTCATCGCC ATCGACCCGC CTCTGAAAAT CCAGTTCAGA GGCAATGGAG GAGTGGAGCT TGACGGAGAT TCTCGATTCA TCGCGATCGC CACGGCACCA AAAGGAAGGA ATCCAGTAGA ACAGCTCGAA TCGGTGATGA GCCGGTTCAC CTATGCAGGG ATACGGGTTC CCCCATTTGC CGGCGGGATG ATCGGTTCAT TCTCATATGA ACTGGCACCA CAGATCCATC CAGGACTCAG GCCCTCATTA CGACAGATCA GGGAGGAACC GTTCCTCGGC ACCTTCATGC TGGTCACCGG GGGAGCCGTC TTCGAACACC TTGCAGGGAC CATCACCCTC TTCACGACCC CCATGCTCGG TCAGGAACAG GACGCTGGTG CAGCCTACGA ACAGGCCCGT GAGCATCTAC GCCTCCTCTG CAAAACCCTG GACCAACTTC GGAAAGAGAC ACCCCGCCAG ATCTTTCCAG AGGAGAGGAG GAGAAACGCG GAGACCTACA TCTCCTCCCT CTCACCCGCA GCCTACCAAG ACGCGGTCCT CAAAGCCAGA GAGCATATCC ATGCAGGCGA CATCCTGCAG GCCGTCATCT CGCGGCAGAT CACCTGCCCG TATGCCGGGG ATCCATTCCT CCTCTACCGT GCCCAACGGG CGATCAACCC GGGACCCTAC CTCTACTACC TGGACTTTCA GGACCACCAG ATCGCAGGTT CGAGCCCTGA GATGCTGGTT CGGGTGGAGG GGAGAACAGT CACCACCGTT CCGATAGCCG GGACCAGACG ACGGGGAAAG AACGAGGAGG AGGACCTGGC ACTGGCCACG GACCTGCTCA ACGATCCCAA AGAACGAGCT GAACATCTGA TGCTCGTAGA CCTGGCCAGA AATGACATCG GCAGGGTCAG CACCTACGGC TCGGTCAGGG TCAGGGACTT CATGACGATC GAGCGCTTCT CCCATGTCCA GCACATCGTC TCGACCGTCC AGGGCACCCT CGCCGATCAC CTGACCTGCT TCGATGCGTT CACCTCCTGC TTCCCTGCAG GGACCGTTTC AGGGGCTCCA AAAGTCAGAG CAATGGAGAT CATCAACGAT CTTGAACCGC AGGATCGCGG CCTCTATGCT GGGGCAGTCG GGTACATCGG CTTCGATCGG ACCCTGGACT TTGCCATTGC CATACGGACG GTCGTGATCA GAGACGGGAT CGCTGCGATA CAGGTCGGTG CAGGGATCGT CGCCGACTCG GTGCCGGAGC ACGAGTGGAA GGAGACCGAA GCAAAGGCAG CAGCGATGAT GCAGGCACTC GACCTGGCAG GAGGGAGTGT ATGA
|
Protein sequence | MKITNVHQNF EDELKMNLSK EEYTTLARDL AKPLLIPLTC TIPIDDLSPA VGYRALAKGV GALLESVEGP TRLARYSFIA IDPPLKIQFR GNGGVELDGD SRFIAIATAP KGRNPVEQLE SVMSRFTYAG IRVPPFAGGM IGSFSYELAP QIHPGLRPSL RQIREEPFLG TFMLVTGGAV FEHLAGTITL FTTPMLGQEQ DAGAAYEQAR EHLRLLCKTL DQLRKETPRQ IFPEERRRNA ETYISSLSPA AYQDAVLKAR EHIHAGDILQ AVISRQITCP YAGDPFLLYR AQRAINPGPY LYYLDFQDHQ IAGSSPEMLV RVEGRTVTTV PIAGTRRRGK NEEEDLALAT DLLNDPKERA EHLMLVDLAR NDIGRVSTYG SVRVRDFMTI ERFSHVQHIV STVQGTLADH LTCFDAFTSC FPAGTVSGAP KVRAMEIIND LEPQDRGLYA GAVGYIGFDR TLDFAIAIRT VVIRDGIAAI QVGAGIVADS VPEHEWKETE AKAAAMMQAL DLAGGSV
|
| |