Gene Mboo_1402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1402 
Symbol 
ID5410804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1432106 
End bp1433239 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content52% 
IMG OID640868635 
Producthypothetical protein 
Protein accessionYP_001404563 
Protein GI154150945 
COG category[S] Function unknown 
COG ID[COG2237] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00889164 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.436536 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAAG AGCGGACGCT CGTCCTGAGC GTGGATCGGG ACGATGATAT CGGGTGGAAA 
GCCAAGATTG AAAGCCCGGT TGTTGGCAGG CAGGAATGCC TTAAAGCGGC AGACGGGCTT
GCGCTTGCCG ATCCCGAGGA CTCTGATGTC AATGCGATCT TTTCTGCGGT AAAAATTTAT
GATGAACTGA TCGCAAAAGG GGAAGATGCA GTAGTGGCGG TGATTGCCGG CAATCACCTC
CATATGATCG AGGGTGACCG GAAGATCGCA GCAACCCTCG ATGATGTGAT AAAAGCAACG
CAGCCGACCT CCTGCATTCT TGTTTCCGAC GGGGCCGAAG ATGAATTCGT CGTTCCCATT
ATCCAGTCCC GGATCCCGGT GGCAAGTATC CGGCGCGTGA TTGTCAACCA GATGCCCAAC
CTTGAGGGGA CCTATTATAT CCTCAAAAAG CTTCTGGACG ATCCCAAGGT ATCCCGGGTG
GTATTTGTCC CCCTCGGGCT TGCCATGTTG CTGTACGCAA GTGCGTACAT CCTTGGGTAC
CCCGGGACTG CAACCATCAT TGTCGTTGGC GTCATCGGGA TGTACCTGCT GTACAAAGGA
TTTGGCCTGG ATGAAGTGAT CCACGGGGTG ATCAATGCAC TTCGGGCATC GATGTCCCGG
GGTAAATTCT CCTTTGTCAC GTATTCCACT ACCATCGTGC TGGTGGTTAT CGGGTTTACC
ATCGGTTTTT TCACGGTACT GAACTATTAT GCAGCGGACA ACAGCCTCGG AGTCCTCCTG
TATGTCATGA GTTTTATATA TGGCGCTATC CTCTGGCTGA TCGTTGCCGG CCTTGTCACC
TCGCTTGGAG TTATCACCGA TGTGTACATC AATGAGCGGG AGAACCTGGT CAAAGTCGTT
GTTTTCCCGT TCTTTGTAAC GGCAATCGGG TTGATCCTGT ACGGGGCAAG TACCTATATC
CTTGCGGTCA GCAGTGTTTC CGGTTTCCCC ATTGGCCCGT CATCCGCCGG GATATATATT
GTGTACTCTA CCCTCTTCGG GCTCGCGTGC GCGATCGCCG GCGTTGTTGT TCAGTATGTC
CTTGCAAAAA GGGCGGCTGA AGCGCAGAAG GAAGCTATTA TCGAAACAAT CTGA
 
Protein sequence
MTKERTLVLS VDRDDDIGWK AKIESPVVGR QECLKAADGL ALADPEDSDV NAIFSAVKIY 
DELIAKGEDA VVAVIAGNHL HMIEGDRKIA ATLDDVIKAT QPTSCILVSD GAEDEFVVPI
IQSRIPVASI RRVIVNQMPN LEGTYYILKK LLDDPKVSRV VFVPLGLAML LYASAYILGY
PGTATIIVVG VIGMYLLYKG FGLDEVIHGV INALRASMSR GKFSFVTYST TIVLVVIGFT
IGFFTVLNYY AADNSLGVLL YVMSFIYGAI LWLIVAGLVT SLGVITDVYI NERENLVKVV
VFPFFVTAIG LILYGASTYI LAVSSVSGFP IGPSSAGIYI VYSTLFGLAC AIAGVVVQYV
LAKRAAEAQK EAIIETI