Gene Mpal_0219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0219 
Symbol 
ID7270604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp251200 
End bp252753 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content59% 
IMG OID643568871 
ProductAnthranilate synthase 
Protein accessionYP_002465328 
Protein GI219850896 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCA CAAATGTACA TCAAAACTTT GAGGATGAAT TGAAGATGAA CCTCTCTAAG 
GAGGAGTACA CCACCCTGGC AAGAGACCTG GCGAAGCCGC TGCTGATCCC TCTTACCTGC
ACGATCCCGA TCGATGACCT CTCGCCGGCC GTGGGGTACC GTGCGCTGGC AAAAGGGGTG
GGAGCACTGC TCGAATCGGT CGAAGGACCG ACCAGACTGG CACGCTACTC GTTCATCGCC
ATCGACCCGC CTCTGAAAAT CCAGTTCAGA GGCAATGGAG GAGTGGAGCT TGACGGAGAT
TCTCGATTCA TCGCGATCGC CACGGCACCA AAAGGAAGGA ATCCAGTAGA ACAGCTCGAA
TCGGTGATGA GCCGGTTCAC CTATGCAGGG ATACGGGTTC CCCCATTTGC CGGCGGGATG
ATCGGTTCAT TCTCATATGA ACTGGCACCA CAGATCCATC CAGGACTCAG GCCCTCATTA
CGACAGATCA GGGAGGAACC GTTCCTCGGC ACCTTCATGC TGGTCACCGG GGGAGCCGTC
TTCGAACACC TTGCAGGGAC CATCACCCTC TTCACGACCC CCATGCTCGG TCAGGAACAG
GACGCTGGTG CAGCCTACGA ACAGGCCCGT GAGCATCTAC GCCTCCTCTG CAAAACCCTG
GACCAACTTC GGAAAGAGAC ACCCCGCCAG ATCTTTCCAG AGGAGAGGAG GAGAAACGCG
GAGACCTACA TCTCCTCCCT CTCACCCGCA GCCTACCAAG ACGCGGTCCT CAAAGCCAGA
GAGCATATCC ATGCAGGCGA CATCCTGCAG GCCGTCATCT CGCGGCAGAT CACCTGCCCG
TATGCCGGGG ATCCATTCCT CCTCTACCGT GCCCAACGGG CGATCAACCC GGGACCCTAC
CTCTACTACC TGGACTTTCA GGACCACCAG ATCGCAGGTT CGAGCCCTGA GATGCTGGTT
CGGGTGGAGG GGAGAACAGT CACCACCGTT CCGATAGCCG GGACCAGACG ACGGGGAAAG
AACGAGGAGG AGGACCTGGC ACTGGCCACG GACCTGCTCA ACGATCCCAA AGAACGAGCT
GAACATCTGA TGCTCGTAGA CCTGGCCAGA AATGACATCG GCAGGGTCAG CACCTACGGC
TCGGTCAGGG TCAGGGACTT CATGACGATC GAGCGCTTCT CCCATGTCCA GCACATCGTC
TCGACCGTCC AGGGCACCCT CGCCGATCAC CTGACCTGCT TCGATGCGTT CACCTCCTGC
TTCCCTGCAG GGACCGTTTC AGGGGCTCCA AAAGTCAGAG CAATGGAGAT CATCAACGAT
CTTGAACCGC AGGATCGCGG CCTCTATGCT GGGGCAGTCG GGTACATCGG CTTCGATCGG
ACCCTGGACT TTGCCATTGC CATACGGACG GTCGTGATCA GAGACGGGAT CGCTGCGATA
CAGGTCGGTG CAGGGATCGT CGCCGACTCG GTGCCGGAGC ACGAGTGGAA GGAGACCGAA
GCAAAGGCAG CAGCGATGAT GCAGGCACTC GACCTGGCAG GAGGGAGTGT ATGA
 
Protein sequence
MKITNVHQNF EDELKMNLSK EEYTTLARDL AKPLLIPLTC TIPIDDLSPA VGYRALAKGV 
GALLESVEGP TRLARYSFIA IDPPLKIQFR GNGGVELDGD SRFIAIATAP KGRNPVEQLE
SVMSRFTYAG IRVPPFAGGM IGSFSYELAP QIHPGLRPSL RQIREEPFLG TFMLVTGGAV
FEHLAGTITL FTTPMLGQEQ DAGAAYEQAR EHLRLLCKTL DQLRKETPRQ IFPEERRRNA
ETYISSLSPA AYQDAVLKAR EHIHAGDILQ AVISRQITCP YAGDPFLLYR AQRAINPGPY
LYYLDFQDHQ IAGSSPEMLV RVEGRTVTTV PIAGTRRRGK NEEEDLALAT DLLNDPKERA
EHLMLVDLAR NDIGRVSTYG SVRVRDFMTI ERFSHVQHIV STVQGTLADH LTCFDAFTSC
FPAGTVSGAP KVRAMEIIND LEPQDRGLYA GAVGYIGFDR TLDFAIAIRT VVIRDGIAAI
QVGAGIVADS VPEHEWKETE AKAAAMMQAL DLAGGSV