Gene GM21_2009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2009 
Symbol 
ID8137343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2329899 
End bp2331047 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content64% 
IMG OID644869622 
ProductCystathionine gamma-lyase 
Protein accessionYP_003021819 
Protein GI253700630 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value1.35589e-20 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGATCG CAACCGAAGC AGCACAGATA GGCCTGCAGC GCGACACCCG CACCGGTGCG 
GTGACCGTCC CCATCTATCA GACCGCCACC TTCCGCCACC CTGGGCTTGG GCAAAGCACC
GGCTACGACT ACAGCAGGTC GGGAAACCCG ACGCGCCAGG CGTTAGAGGA AGGGCTCGCG
GTGCTGGAGG GAGGGTGCCG CGGCTTCGCC TACGCCTCGG GGATGGCCGC CATCACGAGC
CTCATGTTCC TCTTCAAGCA GGGGGACCAC CTGATAGTGA CCGAGGACCT CTACGGCGGG
AGCTATAGGC TCTTCGAGAA GTTGTTCCAG CAGTTCGGGC TCAGCTTCAG CTACGTCGAT
ACGAGCGATA TCGAGCTGGT GCGCCAGGCC GTGAGGCCCA ACACCCGCGC GCTCTTCGTG
GAGTCGCTCA CCAACCCGCT TTTGAAGGTG GCCGACATTG AGCGACTGGC GGCCCTGTGC
AAAGAGCGCG AGATGCTCTG CATCGTGGAC AACACCTTCC TGACCCCGTA TCTCTTGCGC
TGTCTCGACC TGGGCGCCGA CATCACCGTC TATTCAGGGA GCAAGTACCT GGCCGGCCAC
AACGACACCG TCTGCGGCCT GGTGGCGGTG AAGGACCCGG CGCTCGCCGA GCAGGTCTAC
TTCCACCAAA ACGGCGCCGG CGCGGTCCTG GGCCCGCAGG ATTCCTGGCT CACCGTCAGG
GGGATCAAGA CCCTCACCAT CCGCATGGAC CGGCAGCAGG AAAACGCCCT TGCCATAGCT
AACTGGCTGA GCCTCCACCC GCAGGTGGTA AAGGTGCATT ATCCCGGCCT TCCCGATCAC
CCCGGGCACG AACTGATGAA ACAGCGCGGC AAAGGTTTCG GCGCCATGAT TGCCTTCGAG
GTGACCGAGC CGCACCTGGT CGACACGCTG CTGATGAAGA CCCGCCTGAT CTCCTTCGCC
GAGAGCCTGG GCGGGGTCGA GAGCCTGATC ACCTTCCCCG CGGTGCAGAC CCACGCGGAC
ATCGAGCCGG AGACGCTGAA GCGGCTCGGC ATCAACCATT CACTACTCCG CCTTTCGGTG
GGGATAGAAG ACAAGGACGA CCTGATCGCC GACCTGGCGC AGGCGTTCGC AGGAGGAGAA
CCGCAATGA
 
Protein sequence
MKIATEAAQI GLQRDTRTGA VTVPIYQTAT FRHPGLGQST GYDYSRSGNP TRQALEEGLA 
VLEGGCRGFA YASGMAAITS LMFLFKQGDH LIVTEDLYGG SYRLFEKLFQ QFGLSFSYVD
TSDIELVRQA VRPNTRALFV ESLTNPLLKV ADIERLAALC KEREMLCIVD NTFLTPYLLR
CLDLGADITV YSGSKYLAGH NDTVCGLVAV KDPALAEQVY FHQNGAGAVL GPQDSWLTVR
GIKTLTIRMD RQQENALAIA NWLSLHPQVV KVHYPGLPDH PGHELMKQRG KGFGAMIAFE
VTEPHLVDTL LMKTRLISFA ESLGGVESLI TFPAVQTHAD IEPETLKRLG INHSLLRLSV
GIEDKDDLIA DLAQAFAGGE PQ