Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4003 |
Symbol | cysN |
ID | 6970539 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3698912 |
End bp | 3700339 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643387771 |
Product | sulfate adenylyltransferase subunit 1 |
Protein accession | YP_002272214 |
Protein GI | 209400153 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2895] GTPases - Sulfate adenylate transferase subunit 1 |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR02034] sulfate adenylyltransferase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACCG CACTTGCACA ACAAATCGCC AATGAAGGCG GCGTCGAAGC CTGGATGATT GCGCAACAAC ATAAAAGCCT GCTGCGTTTT CTGACCTGTG GTAGCGTCGA TGACGGCAAA AGTACCCTGA TTGGTCGTCT GTTGCACGAT ACCCGCCAAA TCTATGAAGA TCAGCTCTCA TCGCTGCATA ATGACAGTAA GCGTCACGGC ACCCAGGGCG AAAAGCTGGA TCTGGCTCTG CTGGTGGACG GCCTGCAAGC TGAGCGTGAA CAGGGCATCA CCATTGACGT GGCCTACCGC TATTTCTCTA CCGAGAAGCG TAAATTTATT ATCGCCGACA CTCCAGGGCA CGAGCAGTAC ACCCGCAATA TGGCGACTGG CGCATCGACA TGTGAACTGG CGATCTTACT GATCGATGCC CGTAAAGGCG TGCTCGATCA AACCCGTCGT CACAGTTTTA TCTCCACACT GTTGGGGATC AAACATCTGG TCGTGGCGAT CAACAAAATG GATCTGGTGG ATTACAGTGA AGAGACGTTC ACCCGTATTC GTGAAGATTA TCTGACCTTT GCCGGGCAGC TGCCGGGTAA TCTGGATATC CGCTTTGTGC CGCTCTCCGC ACTGGAAGGT GACAACGTGG CTTCGCAAAG TGAAAGTATG CCGTGGTACA GCGGTCCGAC ACTGCTCGAA GTGCTGGAAA CCGTGGAGAT TCAGCGAGTG GTGGATGCCC AGCCAATGCG CTTCCCGGTG CAGTACGTTA ACCGCCCAAA TCTCGATTTT CGTGGCTACG CCGGAACGCT GGCATCCGGT CGCGTGGAAG TCGGGCAACG TGTAAAAGTG CTGCCCTCTG GTGTGGAATC AAACGTCGCG CGGATCGTGA CTTTTGATGG TGATCGCGAA GAAGCCTTTG CCGGAGAAGC TATCACCCTG GTGCTGACGG ATGAGATCGA CATCAGCCGT GGCGATCTGC TGCTGGCGGC AGACGAAGCG TTACCAGCTG TGCAGAGCGC GTCGGTGGAT GTGGTATGGA TGGCGGAACA GCCGCTTTCC CCGGGCCAGA GTTACGACAT CAAAATTGCC GGTAAGAAGA CGCGTTCTCG TGTTGATGGC ATTCGTTATC AGGTTGATAT TAATAACCTT ACCCAACGCG AAGTTGAAAA CCTGCCGCTG AACGGCATCG GCCTGGTGGA TCTCACTTTT GACGAGCCAC TGGTGTTAGA TCGTTATCAG CAAAACCCGG TTACCGGTGG GCTGATTTTT ATCGATCGCC TGAGCAATGT GACCGTAGGT GCCGGTATGG TGCATGAGCC AGTTAGCCAG GCAACTGCTG CGCCATCTGA ATTTAGTGCA TTCGAACTGG AATTGAATGC CCTGGTTCGC CGCCACTTTC CACACTGGGG CGCGCGCGAT TTGCTGGGGG AGAAATAA
|
Protein sequence | MNTALAQQIA NEGGVEAWMI AQQHKSLLRF LTCGSVDDGK STLIGRLLHD TRQIYEDQLS SLHNDSKRHG TQGEKLDLAL LVDGLQAERE QGITIDVAYR YFSTEKRKFI IADTPGHEQY TRNMATGAST CELAILLIDA RKGVLDQTRR HSFISTLLGI KHLVVAINKM DLVDYSEETF TRIREDYLTF AGQLPGNLDI RFVPLSALEG DNVASQSESM PWYSGPTLLE VLETVEIQRV VDAQPMRFPV QYVNRPNLDF RGYAGTLASG RVEVGQRVKV LPSGVESNVA RIVTFDGDRE EAFAGEAITL VLTDEIDISR GDLLLAADEA LPAVQSASVD VVWMAEQPLS PGQSYDIKIA GKKTRSRVDG IRYQVDINNL TQREVENLPL NGIGLVDLTF DEPLVLDRYQ QNPVTGGLIF IDRLSNVTVG AGMVHEPVSQ ATAAPSEFSA FELELNALVR RHFPHWGARD LLGEK
|
| |