Gene Mboo_2125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2125 
Symbol 
ID5410212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2197644 
End bp2198840 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content53% 
IMG OID640869370 
Productaminotransferase, class V 
Protein accessionYP_001405282 
Protein GI154151664 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.573036 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAGAAC AGCGTATAAT CTACATGGAT CATTCCGCGA CAACGTATAC AAAAGAAAAT 
GTTGTTGAGG CAATGCTTCC CTACTTTACC CGGCACTTTG GAAATCCTTC GTCCATCTAT
GGTATCGCCC GCTATACAAA GCAGGCCATA GACACGGCCC GGGCCCAGGT GGCAAAGGCA
ATCGGGGCAG AGCCCGATGA GATCTACTTT ACTTCGGGAG GCAGCGAATC CGACAACTGG
GCAATAAAAG GTGTTGCATA CGCCAACCGG AAACGGGGTA ACCATATTAT TACCACAAAG
ATCGAGCACC ATGCGGTCAT TCATACCTGC CAGTTCCTTG AAAAAGAGGG ATTTGCGGTC
ACCTACCTCC CGGTGGACAA GTATGGGCGT GTCGATCCGG CAGAACTTGA AAAAGCGATC
ACCGACAAAA CGATCCTTGT CTCGATCATG TATGCGAACA ACGAGATCGG TACGATCGAA
CCCATCCGGG AGCTCGCGGC AATAGCACAG AAGCATAAGA TATACTTCCA TACCGATGCA
GTGCAGGCAA TCGGGAATGT CCCTATCAAT GTCAGGAACG AAAAGATCGA TCTGCTCTCC
CTTTCCGCCC ATAAATTCTA CGGGCCCAAG GGCGTCGGCG CCCTCTATAT CCGGAATGGT
GTCCGGCTCG ATAACCTGAT CCATGGCGGC GGGCAGGAAA AGAAGAGACG GGCCGGCACG
GAGAATATTG CGGGAATTGT CGGATGCGGG AAGGCAATAG AGCTTGCCAC TGCCGATATT
GAGGGGCATA ATGTACGGAT TCGTGCGCTG CGCGATCGGC TCCTCAAGGG GATCCTTGAG
AGGATTCCCC ATGCATACCT CAACGGCCAC CCCACAGAGC GGCTGCCGGG GAACATCAAT
ATCAGTTTTG AGTTCATCGA AGGGGAATCC ATGCTCCTGT GGCTGGACGA CGAGGGGATC
TGTGCCTCGA CCGGGAGCGC CTGTACCTCC GGCTCACTCG AACCCTCGCA TGTGCTTCTT
GCCACAGGTC TTCCCGTTGA GATCTCGCAC GGCTCTCTCC GGCTGACCCT TGGTGATGTT
AATACAGAGC AGGATGTGGA TACTGTGCTT GAGGTACTGC CAAAAGTGGT ATCCCGTCTG
AGGGAGATGT CTCCCTTGTA CCAGTCTGCA GGAAAGAAAG GAGGGTGTAA TGTATAG
 
Protein sequence
MGEQRIIYMD HSATTYTKEN VVEAMLPYFT RHFGNPSSIY GIARYTKQAI DTARAQVAKA 
IGAEPDEIYF TSGGSESDNW AIKGVAYANR KRGNHIITTK IEHHAVIHTC QFLEKEGFAV
TYLPVDKYGR VDPAELEKAI TDKTILVSIM YANNEIGTIE PIRELAAIAQ KHKIYFHTDA
VQAIGNVPIN VRNEKIDLLS LSAHKFYGPK GVGALYIRNG VRLDNLIHGG GQEKKRRAGT
ENIAGIVGCG KAIELATADI EGHNVRIRAL RDRLLKGILE RIPHAYLNGH PTERLPGNIN
ISFEFIEGES MLLWLDDEGI CASTGSACTS GSLEPSHVLL ATGLPVEISH GSLRLTLGDV
NTEQDVDTVL EVLPKVVSRL REMSPLYQSA GKKGGCNV