Gene Mboo_1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1047 
Symbol 
ID5410181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1030043 
End bp1031485 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content56% 
IMG OID640868273 
Producthomospermidine synthase 
Protein accessionYP_001404208 
Protein GI154150590 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG5310] Homospermidine synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.129007 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTTA AAAACAAAGT ACTGATCATC GGGTACGGCG CAGTTTCCAA GTGTACGCTC 
CCGATCCTGC TGCGTCACGT CAATATTCCG CTCGAGAATA TCACGATTGT TGATTTTAAA
GATAAATCCC AGGACCTCAA GGCCTATACG ACCCGGGGTC TCCGTTTCCT CAAACGGAAG
ATCACCCCGG AAAACCTTAC ATCGATTCTT ACAGAAACCC TTGGCGAGGG CGGTCTCCTT
GTCGATCTTG CCTGGAACAT CGATGCCGGC GAGATCGTCC AGTGGTGCCA TGACCACAAC
GTGCTGTACG TGAACACGTC CGTGGAGGCA TGGGACCCGC TGGGCGAGCG CTATACGGCA
AGCCCCGTAG AAAAATCCCT GTACTACCGG CAGATGAAAC TGCGGGAGCT GACCCGGGAA
TGGGCAGACT CCACAACCTG CGTGGTAGAT CATGGGGCAA ACCCGGGGCT TATCTCGCAC
ATGACCAAGC AGGGCCTTGT TGATATCGCC CATAAGATGA TCGAGGACGG GCTTGCAGAG
GATCCTGCGC GCTTCAAACG CCTCATCGCT GAGCAGAAAT TCAACGAGCT TGCCATGGAA
GTCGGAATCA AGGTGATCCA CTGCTCCGAG CGTGACACCC AGATCTCCCG GTTTCCAAAA
ATTGTGGACG AATTTGTCGG CACCTGGTGC ATCGAGGGGC TGCTCGAAGA GGGTACAGCC
CCTGCCGAGA TCGGGTGGGG TACCCACGAG AAGGAGCTGC CACCCGGTGC ACATGTGCCA
ACTGACGGCC CGAAAAACGC GATCATGATC CATCACATGG GGATCAACAC CTGGGTCCGA
TCCTGGGTGC CCGAGCAGGA GATCGTGGGC ATGGTGATCC GGCACGGCGA GGCATTCGGG
ATCTCCGACC GCTGGACGGT CTGGAAGGAT GGCAAAGCGA TCTACCGTCC GACCGTGAAC
TACGCGTACA TGCCCTGCGA TGCAACGATC GCCTCGCTCC ATGAACTGCG GGGGAGGAAC
TACGAGCTCC AGTCCCGGGT CCGTATCATG AACGACCGGG AGATCTCGGA AGGGTCCGAC
ATTCTTGGCG CACTTCTGAT GGGCCACCCG TATAACTCAT GGTGGACAGG GAGCATTCTC
TCGATTGAAG AGGCTCGAAA ACTTGCACCG GGCCAGAACG CAACCACGAT CCAGGTCGCG
CTAGGAGTGG TCTCCGCAGT GATGTGGATG ATCGAGAACC CGAAAAAAGG CTTCTGCCTG
CCTGACGACC TGCCCCACCA GTTTGTCCTG GATATTGCAA AACCCTACCT CGGGGAATTC
TGGTCCGGGC CCTCAGACTG GACTCCCTTA AAAAACCGGA CGGTCTATTT CCGCGAGAAC
CCGGATAACG ATTTCGACAG GGATGATGTC TGGCAATTCA AAAACTTCTT ATTTGTAAGA
TGA
 
Protein sequence
MDFKNKVLII GYGAVSKCTL PILLRHVNIP LENITIVDFK DKSQDLKAYT TRGLRFLKRK 
ITPENLTSIL TETLGEGGLL VDLAWNIDAG EIVQWCHDHN VLYVNTSVEA WDPLGERYTA
SPVEKSLYYR QMKLRELTRE WADSTTCVVD HGANPGLISH MTKQGLVDIA HKMIEDGLAE
DPARFKRLIA EQKFNELAME VGIKVIHCSE RDTQISRFPK IVDEFVGTWC IEGLLEEGTA
PAEIGWGTHE KELPPGAHVP TDGPKNAIMI HHMGINTWVR SWVPEQEIVG MVIRHGEAFG
ISDRWTVWKD GKAIYRPTVN YAYMPCDATI ASLHELRGRN YELQSRVRIM NDREISEGSD
ILGALLMGHP YNSWWTGSIL SIEEARKLAP GQNATTIQVA LGVVSAVMWM IENPKKGFCL
PDDLPHQFVL DIAKPYLGEF WSGPSDWTPL KNRTVYFREN PDNDFDRDDV WQFKNFLFVR