Gene Mboo_0226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_0226 
SymboltrpD 
ID5410469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp213312 
End bp214331 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content62% 
IMG OID640867440 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_001403391 
Protein GI154149773 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATGA AAGAGGCGAT TGCCACCGTA GTGGAGGGAC GGGATCTTGC TCCCGCCCAG 
GCCGCGGCAG TCATGAACAT GATGATGGAT GGGCAGGCAA CTCCGGCCCA GATCGGGGGA
TTTTTGACCG CACTACGCGC CAAGGGTGAG ACCCCGGAAG AGATCGCGGC GTTTGCCCGG
GTCATGAGGG AGCATACGGT CCACGTAAAG CCACGGGTGT CCGGAACATT GGTCGATACG
TGCGGCACCG GGGGAGACGG GGCGCAGACA TTTAACATCA GCACGGCTGC CGCCTTTGTT
GCCGCGGGAG CCGGCATCAC CGTGGTCAAG CACGGCAACC GGAGCGTGAG CAGCCGGTGC
GGATCGGCAG ATGTTCTTAC GGCTCTCGGT GTGGATATCA GCGTGGACCC GGGCCGGCAG
GCAGGGATCG TACAGGAGAC CGGGATCATC TTTCTCTTTG CACCCAGCCA CCACCCGGCC
ATGAAGCATG TGATGGCTAC ACGACAGGAT CTTGGCTGCC GGACGGTCTT TAACCTGCTC
GGGCCGCTGG CAAATCCTGC GGGAGCGGCA GCCCAGGTGC TTGGGGTCTA CGATCAAAAA
CTCACCGGTC CTATGGCAGA AGTCCTGAGT CTGCTAGGAG TGTCCCGGGC GATGGTCGTC
TTTGGCTCAG GCCTAGACGA GATCACAGTC ACGGGCGAGA CAAGCGTGAC CGAACTTGCC
AATAGCAGGA TCACGAACTA TATAGTTACA CCGGAACAGT TCGGATTTAC CCGGGCTGCA
CCGGGCGATC TTCTTGGCGG TGACCCGGAG AAGAACGCAC GCATCATCCG CGCCATTCTT
GACGGGGCGC CGGGCCCGGC CCGCGATATC GTGCTCATGA ACGCGGGCGC TGCCATTTAC
GTGGGAGGCC GGGCTGCAAC CCTTGCAGAG GGAATCCGGC ACGCAGCGGA GTCCATTGAC
TCAGGGAAAG CGGCGGGCAA ACTCGCCGCC CTCGTTACAG CAACCCGGGG TGCATCATGA
 
Protein sequence
MTMKEAIATV VEGRDLAPAQ AAAVMNMMMD GQATPAQIGG FLTALRAKGE TPEEIAAFAR 
VMREHTVHVK PRVSGTLVDT CGTGGDGAQT FNISTAAAFV AAGAGITVVK HGNRSVSSRC
GSADVLTALG VDISVDPGRQ AGIVQETGII FLFAPSHHPA MKHVMATRQD LGCRTVFNLL
GPLANPAGAA AQVLGVYDQK LTGPMAEVLS LLGVSRAMVV FGSGLDEITV TGETSVTELA
NSRITNYIVT PEQFGFTRAA PGDLLGGDPE KNARIIRAIL DGAPGPARDI VLMNAGAAIY
VGGRAATLAE GIRHAAESID SGKAAGKLAA LVTATRGAS