Gene Mboo_1991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1991 
Symbol 
ID5410245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2054429 
End bp2055589 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content56% 
IMG OID640869232 
Productcystathionine gamma-lyase 
Protein accessionYP_001405148 
Protein GI154151530 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTTG AGACCGCCGC AGTGCATGCC GGAGAAGAAC CGGCCTTTTC CGAGGGGGCA 
TCAGGAGACG TTGTAATTCC CATTCACCTC TCGTCCACAT TTGCACGCGT AGACGTGGCA
AAACCGACCG GAGGATATGA ATATTCCCGG AGCAGTAACC CGACGCGTCT GGCACTCGAA
CAGAAACTGG CGGCAATCGA GGGTGCACGC TTTGGTTTGG CGTTCTCCTC AGGACTTGCC
GCGGAAACCA CCGTAGCCCT CTCCCTGCTC AAAAAGGACG AACACGTAGT TGCATTCGAT
GATCTCTATG GAGGCACACG AAGACTCTTT ACCCGGGTGT TCCAGGAAAA CTACGGGATT
GACTTCTCTT ACGTGGATGC ACGGGATGCG GAGAATGTGA AATCCGCACT CCGGAAAGAC
ACCCGTTTTG TCTGGCTGGA GAGCCCCACC AACCCCCTCA TCCGGCTCTG TGACATCCGG
GAGATTGCCG GGATCGCCCA CGATGCTGGA GCGCTTGTGA TCGTGGACAA TACCTTTGCA
AGCCCGTACT TCCAGCACCC GCTTGCGCTC GGTGCGGATA TCGTGGTCCA CAGCACAACC
AAGTACATCA ACGGCCATTC GGACTCAGTG GGTGGGGCCG TGATGCTCTC TGAAGAAGAC
CTGTACCAGC GGATCCGGTA CAACCAGAAC GCTGCAGGTG GCATCCTCTC ACCTTTCGAC
AGTTTCCTTG TTGCACGGGG CATAAAGACG CTCGCCCTGC GGATGGAACG GCACCAGAAA
AATGCCCTTA CTCTTGCAAA GTACTTTGAA GGGCACGAAA AAATCAGCGC CGTCTACTAT
CCGGGCCTCC GCACCCATCC GCAATATGCG CTTGCCAAAA AGCAGATGGA CGGATTCTCC
GGTATGATTT CCTTTGAGGT TAAGGGAGAA GGGAAGGCTG CGCTCAGATT CCTGCGCTCC
CTCTCTCTCT TCGCGCTTGC CGAGAGCCTC GGAGGGGTTG AATCGCTGAT TGAACACCCG
GCAAGCATGA CCCATGCCTC TATCCCGAAA CACGAGCGGG AGAAGGTAGG GGTGACCGAC
TCGCTCATCC GCGTATCGGT TGGCATTGAG AATGTAAAGG ATCTTGTCGA TGACCTGGAA
CAGGCATTTG AAGAGATCTG A
 
Protein sequence
MKFETAAVHA GEEPAFSEGA SGDVVIPIHL SSTFARVDVA KPTGGYEYSR SSNPTRLALE 
QKLAAIEGAR FGLAFSSGLA AETTVALSLL KKDEHVVAFD DLYGGTRRLF TRVFQENYGI
DFSYVDARDA ENVKSALRKD TRFVWLESPT NPLIRLCDIR EIAGIAHDAG ALVIVDNTFA
SPYFQHPLAL GADIVVHSTT KYINGHSDSV GGAVMLSEED LYQRIRYNQN AAGGILSPFD
SFLVARGIKT LALRMERHQK NALTLAKYFE GHEKISAVYY PGLRTHPQYA LAKKQMDGFS
GMISFEVKGE GKAALRFLRS LSLFALAESL GGVESLIEHP ASMTHASIPK HEREKVGVTD
SLIRVSVGIE NVKDLVDDLE QAFEEI