Gene GM21_2008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2008 
Symbol 
ID8137342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2328769 
End bp2329902 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content63% 
IMG OID644869621 
ProductCystathionine gamma-lyase 
Protein accessionYP_003021818 
Protein GI253700629 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value9.31334e-21 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAATTCG CGACCGAACT GATACACGGC GGAGCCCCAA TAGACCCGGA GCACGGCTCC 
CTGAGCCACC CCATCTACCT GAGTTCCACC TTCGCCCAGC GCTCCATGGA GCACTTCGGA
AAGTACGACT ACTCCCGCTC GGGGAACCCG ACCCGCGAGG CGCTCGAAGA GGCGGTGGCC
GGGCTTGAGA AAGGCGCCGT CGGCCTTGCC TTCGCCTCAG GGATGGCAGC CATCTCCTCC
ACCTTGCTCA TCTTCGCGCC CGGCGATCAC CTGGTCGTCT GCGAGGACGT CTACGGCGGA
ACCTTCCGCG CCTTGACCAC CGTATTGAAC GGATGGGGAC TGAACGCCAC CTTCGTGGAC
GCGACCGATA CGGCCGCCAT AGCCGCAGCC ATCAGGCCCG AGACCAAGGC GCTCTACCTG
GAGACCCCCT CCAACCCGCT CATCAAGATA ACCGATCTGA GGAAAGCCGC GGCGCTCGCC
AAGGAGAAGG GGATCCTCAC CATCGTCGAC AACACCTTCA TGACGCCGTA CCTGCAGCGT
CCGCTGGAAT TGGGTTGCGA CATCGTGCTC CACAGCGGCA CCAAGTTCCT GAACGGCCAC
TCCGACGTCC TCTGCGGCTT CGCCGTCGCC AAGGACCCGG CAGTAGGCCA AAGAATCCGC
TTCATCCAGA ACGCCTTCGG CGCAGTACTC GGGCCGCAGG ACTGCTGGTT GGTCTTGCGC
GGGCTGAAGA CCCTGAAAGT GCGCATGGAA GAGCACCAGG CAAGCGCCAA GGTGATCGTC
TCCTGGCTCA AGGAGCAAAA AGAGGTGGCG AAGATCTACT ACCCCGGGCT CCCCGAGCAC
CCGGGCTACG AGATCAACAA CGCGCAGTCC GCAGGCCCCG GCGCCGTGCT CTCCTTCGAG
CTGTCCAGCT ACGAAGTGAC CAGGAAGCTC CTGGAAGGGG TGAAGCTCTC CGCATTCGCC
GTCAGCCTGG GAGGGGTGGA AAGCATCCTC TCCTACCCCG CCAAGATGTC GCACGCCGCC
ATGCCCCCCG CCGAACGCGA GGCCAGGGGG ATCAAGGACA CGCTGGTGCG TCTGTCGGTA
GGCCTGGAAG ATCCCGAGGA CTTGATCGCC GACATGGAAA GCTTCCTCAG ATAA
 
Protein sequence
MKFATELIHG GAPIDPEHGS LSHPIYLSST FAQRSMEHFG KYDYSRSGNP TREALEEAVA 
GLEKGAVGLA FASGMAAISS TLLIFAPGDH LVVCEDVYGG TFRALTTVLN GWGLNATFVD
ATDTAAIAAA IRPETKALYL ETPSNPLIKI TDLRKAAALA KEKGILTIVD NTFMTPYLQR
PLELGCDIVL HSGTKFLNGH SDVLCGFAVA KDPAVGQRIR FIQNAFGAVL GPQDCWLVLR
GLKTLKVRME EHQASAKVIV SWLKEQKEVA KIYYPGLPEH PGYEINNAQS AGPGAVLSFE
LSSYEVTRKL LEGVKLSAFA VSLGGVESIL SYPAKMSHAA MPPAEREARG IKDTLVRLSV
GLEDPEDLIA DMESFLR