Gene Mboo_1472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1472 
Symbol 
ID5409854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1511757 
End bp1513493 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content56% 
IMG OID640868707 
Productamidohydrolase 3 
Protein accessionYP_001404633 
Protein GI154151015 
COG category[C] Energy production and conversion 
COG ID[COG1229] Formylmethanofuran dehydrogenase subunit A 
TIGRFAM ID[TIGR03121] formylmethanofuran dehydrogenase subunit A 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.883375 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGGA TTTCCGGTGA AATTATAATC AGGGGAGGAT TTGTTGTCGA TCCGTCCCAG 
AAGATCGACG GGGATGTTGC CGATATCGCA ATAAAGGACG GCAAGATCGT TGATAAGGTC
AGCAGTGCAG CAAAGGTCAT CGACGCCAAA GGCAAGGTTG TCATGGCCGG CGGGGTGGAT
GTTCACTCTC ACGTGGCAGG TCCCAAGGTT AACGTAGGCC GCCTGATGCG CCCCGAGGAC
AAGCTCCTCT CTGGCGTATC CCGAAGCGCA ATGGCGCAGG CAAACGGTTT CCGTATGGAG
TCGGGTTTCT CCATCCCCAG CGTCTTAAAG ACCGGGTACG ACTACGCCCG CATGGGCTAC
GGCTTTGTCA TGGAAGCGGC AATGCCCCCG ATTCACGCCC CTCACGTACA CGAGGAGATC
CACGACACTC CCATTATTGA TGAAGCGGCA CTGCCGGTCT TTGGGAACAA CTGGTTTGTC
ATGGAATACC TCAAGAAAGG CGAGATCGAG AACACGGCAG CGTATATTGC CTGGCTGATC
CGTGCAACAA AGGGATTCGG CATCAAGGTT GTCAACCCCG GTGGCACGGC GGCATGGGCA
TGGGGGCTCA ACTGTCTCTC GCTCAGCGAC AATGTTCCCT ACTTTGACAT TACGCCCCAC
GAGATCATCA CCGGCCTCAT ACAGGCAAAC GAGTACCTTG GCCTGCCCCA TTCGGTCCAC
CTTCACCAGA ACGATCTGGG TAACCCCGGG AACTACAAGG TCACCCTTGA CTCCCTGCGT
CTTGCCGAGG GCGTTAAGGC AAAGAACAAG TTCGGCCGCG AACAGGTCAT TCACTCGACC
CACATCCAGT TCCACTCGTA CGGCGGGGAT TCCTGGGCAA ACTTCGAGAC CAGGGCAAAG
GATGTCATGG ACTACGTCAA CAGGCAAAAG AACATCACCG TAGATCTTGG CTGCGTAACG
CTGGATGAAA CCACGACCAT GACCGCGGAC GGTCCGTTCG AGCACCACTT AACCGGCCTC
AACCACCTCA AGTGGGCAAA CACCGATGTC GAGATGGAGA CCGCAGCAGG TGTTGTTCCC
TATGTGTATG ACCCTAACAT CAAGGTCTGT GATATCCAGT GGGCGATCGG CCTTGAACTG
GGACTCTATG CAAAGGATCC AATGCGCTGC TTTGTCACCA CCGACCACCC GAACGCCGGG
CCATTTACCC GCTACCCCCG CATCATCAAG TGGCTCATGA GCAAAAAGGC ACGGGAGGCC
ACGCTTGATT CCTTTAAGCA CAAGGACAAG GTTATCGAGG CAACCGATCT GCACTCCCTT
GACCGTGAGC TCACCCTGTA CGAGATCGCT GCAATGACCC GGGCTGGCCC GGCAAAGTGC
CTGGGCCTCT CCAGCATCTA CGGGGGTCTT GCCCCGGGCA TGAACGCCGA TGTTGCAGTC
TTTGACCTCA ATTACAAGAG CATGCCAAGC GATCCCGAAA AGATCGAGTC CGCATTCCTG
CGGGCGGCCT GCTTTGTCAA GTCAGGCGAG ATCGTAGTAA AAGACGGCGA GGTGCTCAGC
CACGGCCACA AGAAGACGGT CTGGGTCAAC CCGAAGATGA AAGAAAACCC CCAGGTTAAG
CGCGATATCG CAGAGAGCTT CAACAAGGGA TATTACACGG TCGGGCTCAC CAATTACCCG
GTCCGCGAAT ACCTTGCACC ACACCCGTTC GTGATCGATG TCGATGTGGA GGCCTAG
 
Protein sequence
MSGISGEIII RGGFVVDPSQ KIDGDVADIA IKDGKIVDKV SSAAKVIDAK GKVVMAGGVD 
VHSHVAGPKV NVGRLMRPED KLLSGVSRSA MAQANGFRME SGFSIPSVLK TGYDYARMGY
GFVMEAAMPP IHAPHVHEEI HDTPIIDEAA LPVFGNNWFV MEYLKKGEIE NTAAYIAWLI
RATKGFGIKV VNPGGTAAWA WGLNCLSLSD NVPYFDITPH EIITGLIQAN EYLGLPHSVH
LHQNDLGNPG NYKVTLDSLR LAEGVKAKNK FGREQVIHST HIQFHSYGGD SWANFETRAK
DVMDYVNRQK NITVDLGCVT LDETTTMTAD GPFEHHLTGL NHLKWANTDV EMETAAGVVP
YVYDPNIKVC DIQWAIGLEL GLYAKDPMRC FVTTDHPNAG PFTRYPRIIK WLMSKKAREA
TLDSFKHKDK VIEATDLHSL DRELTLYEIA AMTRAGPAKC LGLSSIYGGL APGMNADVAV
FDLNYKSMPS DPEKIESAFL RAACFVKSGE IVVKDGEVLS HGHKKTVWVN PKMKENPQVK
RDIAESFNKG YYTVGLTNYP VREYLAPHPF VIDVDVEA