Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_37080 |
Symbol | cysM |
ID | 7762602 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 3765857 |
End bp | 3766756 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643806575 |
Product | cysteine synthase B |
Protein accession | YP_002800829 |
Protein GI | 226945756 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases [TIGR01138] cysteine synthase B |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGTGC ATTTCCCCAC CATCGCCGAG TGCATCGGCA ATACCCCGCT GGTGCGCCTG CAGCGCCTGC CGGGAGAAAC CTCGAACACC CTGCTGGTCA AGCTGGAGGG CAACAATCCG GCCGGTTCGG TGAAGGACCG CCCGGCACTG TCGATGATCA CCCGCGCCGA GCTGCGCGGC GACATCCGCC CCGGCGATAC CCTGATCGAA GCCACTTCCG GTAACACCGG CATCGCCCTG GCCATGGCGG CGGCGATCAA GGGCTACCGG ATGATCCTGA TCATGCCGGA CAACATGAGC GCCGAGCGCA AGGCGGCGAT GACCGCCTAT GGCGCCGAGC TGCTGCTGGT GAGCAAGGCC GAAGGCATGG AGGGTGCCCG AGACCTGGCA CTGAAGATGC AAGGCGAGGG GCGCGGCAAG GTGCTCGACC AGTTCGGCAA CCAGGACAAC CCGCAGGCCC ACTACGAGGG TACCGGCCCG GAGATCTGGC GCCAGACCCA GGGCCGAATC ACCCACTTCA TCAGTTCCAT GGGCACCACC GGCACCATCA TGGGCACCTC GCGCTATCTC AAGGAGCAGA ACCCGGCGAT ACGCATCGTC GGCCTGCAGC CGCAGGAGGG CTCGGCGATT CCCGGCATCC GCCGCTGGCC GCAGGAGTAC CTGCCGAAGA TCTTCGATGC CTCCCGGGTC GATCAGGTGA TCGACATGGC GCAGAGCGAA GCCGAAGACA CCATGCGTCG CCTGGCCCGC GAGGAGGGCA TCTTCTGCGG CGTGTCCTCG GGCGGGGCGG TGGCGGCCAT GCTGCGCCTG TCGTGCGAGG TGGAGAATGC GGTGCTGGTG GCGATCATCT GCGACCGCGG CGACCGCTAT CTGTCCACGG GTGTCTATGA CCAGCCTTGA
|
Protein sequence | MTVHFPTIAE CIGNTPLVRL QRLPGETSNT LLVKLEGNNP AGSVKDRPAL SMITRAELRG DIRPGDTLIE ATSGNTGIAL AMAAAIKGYR MILIMPDNMS AERKAAMTAY GAELLLVSKA EGMEGARDLA LKMQGEGRGK VLDQFGNQDN PQAHYEGTGP EIWRQTQGRI THFISSMGTT GTIMGTSRYL KEQNPAIRIV GLQPQEGSAI PGIRRWPQEY LPKIFDASRV DQVIDMAQSE AEDTMRRLAR EEGIFCGVSS GGAVAAMLRL SCEVENAVLV AIICDRGDRY LSTGVYDQP
|
| |