Gene Mboo_0228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_0228 
Symbol 
ID5410471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp214918 
End bp216465 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content59% 
IMG OID640867442 
Productanthranilate synthase 
Protein accessionYP_001403393 
Protein GI154149775 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages
[TIGR01820] anthranilate synthase component I, archaeal clade 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACCG ATACCCTTCA CATACCACTG ACCTTAAATC TCGGATCCGA GGAATACCTC 
ACCGCGGTAG CCGACCAGCC AAAACCGCTG GTGATCCCGC TCTGTGCCGA GCTAACACTC
CCTCAGTGTT CGCCGCTGGA GGTATTCCTT GCTACCCGGG CCGGGCCCGG GTTCCTCTTA
GAATCCATGG AAGGCAGCGA GAAGATCGCC CGTTATTCGT ACATGGGGAT CAATCCTGCA
TGTGTGATCA CACTTGGCCG GGAGGTATCC GTGGAAGGCA GCGACGCGTT TGTCGCCATT
GCCCACGAGC CGGAGGGGAA AAACCCGGTG GACAGGATCC GCTCCATCCT CTCGCGGTTC
CATTACATCA ACCTCAGGGC CCCGCGGTTC TTTGGCGGGA TGGTCGGCTA CTTCTCGTAC
GATATTGTGC ATGATCTCTT CGAACAGGTG CCCGACCACC GGACAAAGCG GGGCCGGGAA
TGCCCGGATG CCCGGTTCAT GCTCACAAAG GACTGCATCG TTTTTGACCA CCGGGACCGG
AGGCTGTTTG TCTTTTCCAG CCCGTTTCTG ACCTACGATA CCGACCCGGC GGCAGAATAC
CGGAGGTGCG CCGCTCACAT CGCAGAACTC ACCGACCGGA TCGCATCGGT TGCGGGGACT
CCCAAAAAAC CCGATGATGT GATGCAAACC GTGACAGGAA AGGTTGGAAA AAAACCGGAG
ATTACGGATA ATACCGGGCG CGAAGCCTTC ATGCACGCAG TCAGCGGGAT TAAAGAGCAC
ATTGTTGCGG GAGATATCTT CCAGGCCGTG CTCTCGCGGA GGATGGAATG CCCGGTTGTA
CCGGATCCAT TCCCCATTTA CGCAGCGCTG CGGTCAATCA ACCCGAGCCC GTACATGTAT
TACCTGGATT TCGGGGATGA GCAGGTAATC GGCGCAAGCC CTGAGATGCT TGTGCGGGTA
GAGAAACGGC GCGTGACAAC GGTGCCCATT GCCGGCACCC GGCCCCGGGG CGCAGACAAG
AGCGAGGACA AGAAGCTCGC GCAGGAGCTG CTGCTTGACA AAAAGGAGCG GGCCGAGCAT
ACGATGCTCG TGGATCTCGC CAGAAACGAC CTCGGCAGAG TCTGCAAGTT CGGATCGGTG
GAAGTCAGCG AGTTCATGGG CATTGAAAAG TTCTCGCATG TCCAGCACAT GGTCTCAACC
GTCCAGGGCA CGCTCATGGA CCACCTTGAC GGCTGCGATG CGCTCAAGTC CTGCTTCCCG
GCCGGGACCG TGTCCGGCGC TCCAAAGATC CGGGCCATGC AGATCATCGG CGAAAGCGAG
CCGGACGCAC GCGGGATTTA CGCGGGCGCC GTGGGCTATA TCGGGTTTGA CCGGAACCTT
GAATTTGCCA TCGCGATCCG TACTGTGACG GTGAAAGATG GTCGGGCCTC TGTCCAGGCC
GGCGCCGGGA TCGTTGCAGA CTCGGTACCG GAGAACGAGT GGACCGAGAC CGAGAACAAG
GCCGCGGCAA TGATGAAGGC AATCGAGCAG GCAGGTGTCG TGCCATGA
 
Protein sequence
MATDTLHIPL TLNLGSEEYL TAVADQPKPL VIPLCAELTL PQCSPLEVFL ATRAGPGFLL 
ESMEGSEKIA RYSYMGINPA CVITLGREVS VEGSDAFVAI AHEPEGKNPV DRIRSILSRF
HYINLRAPRF FGGMVGYFSY DIVHDLFEQV PDHRTKRGRE CPDARFMLTK DCIVFDHRDR
RLFVFSSPFL TYDTDPAAEY RRCAAHIAEL TDRIASVAGT PKKPDDVMQT VTGKVGKKPE
ITDNTGREAF MHAVSGIKEH IVAGDIFQAV LSRRMECPVV PDPFPIYAAL RSINPSPYMY
YLDFGDEQVI GASPEMLVRV EKRRVTTVPI AGTRPRGADK SEDKKLAQEL LLDKKERAEH
TMLVDLARND LGRVCKFGSV EVSEFMGIEK FSHVQHMVST VQGTLMDHLD GCDALKSCFP
AGTVSGAPKI RAMQIIGESE PDARGIYAGA VGYIGFDRNL EFAIAIRTVT VKDGRASVQA
GAGIVADSVP ENEWTETENK AAAMMKAIEQ AGVVP