Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2237 |
Symbol | |
ID | 8137576 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2610857 |
End bp | 2611810 |
Gene Length | 954 bp |
Protein Length | 317 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644869852 |
Product | cysteine synthase A |
Protein accession | YP_003022044 |
Protein GI | 253700855 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases [TIGR01139] cysteine synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 0.425208 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCGTA TCTACCTAGA CAACTCCCAG TCCATCGGCA ACACGCCGCT GGTGCGGCTG AACCATGTAA CCAAGGGGGC CAAGGCGACC GTGCTCGCCA AGGTCGAGGG GAGAAACCCC GCCTACTCGG TGAAGTGCCG CATCGGCGCC AACATGATCT GGGATGCCGA GGAGCGTGGC GTACTGAAGC CTGGGGTGGA GATCGTTGAG CCCACCAGCG GCAACACCGG CATAGCGCTT GCCTACGTGG CGGCGGCCCG CGGTTACAAG CTGACCCTCA CCATGCCCGA GACCATGAGC ATCGAGCGCC GCAGGGTGCT GGCGGCATTG GGGGCTAACC TGATCCTTAC CCCGGGTTCC GCGGGGATGA AAGGGGCCGT GGCCAAGGCC GAGGAGATCG CAGCTTCCGA CCCGGCGCGC TACTTCCTGC CGCAGCAGTT CAAGAACCCG GCCAATCCCG CCATCCACGA GAAGACGACT GGACCGGAAA TCTGGGCCGA CACCGACGGC GCCGTCGACG TCATCGTCGC CGGCGTAGGT ACCGGCGGCA CCATCTCCGG GATCGCCCGC TATCTTAAGC AGACCAAGGG GAAGCAGGTC GTCGCGGTTG CGGTAGAGCC AAAGGAGAGC CCGGTAATCA GCCAGAAGCT AGCCGGCCAG GAACTCAAGC CCGGCCCGCA CAAGATCCAG GGAATCGGCG CCGGTTTCAT CCCCGATACC CTCGACCTCT CCGTCATCGA CCGGGTCGAG CAGGTAGACA GCAACGAGGC CTTGGAGTTC GCCAAGCGCC TCACCAAGGA AGAGGGGTTG CTGGTCGGCA TCTCCAGCGG CGCCGCGGTT GCCGCCGCCG TCCGCCTGGC CAACCTGCAG GAATTCGCCG GCAAGACCAT CGTGGTGGTG CTCCCCGACT TGGCCGAGCG CTATCTCTCC ACCGCACTCT TCGAAGAAGC CTGA
|
Protein sequence | MSRIYLDNSQ SIGNTPLVRL NHVTKGAKAT VLAKVEGRNP AYSVKCRIGA NMIWDAEERG VLKPGVEIVE PTSGNTGIAL AYVAAARGYK LTLTMPETMS IERRRVLAAL GANLILTPGS AGMKGAVAKA EEIAASDPAR YFLPQQFKNP ANPAIHEKTT GPEIWADTDG AVDVIVAGVG TGGTISGIAR YLKQTKGKQV VAVAVEPKES PVISQKLAGQ ELKPGPHKIQ GIGAGFIPDT LDLSVIDRVE QVDSNEALEF AKRLTKEEGL LVGISSGAAV AAAVRLANLQ EFAGKTIVVV LPDLAERYLS TALFEEA
|
| |