Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2877 |
Symbol | cysN |
ID | 6143154 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2947190 |
End bp | 2948617 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641617746 |
Product | sulfate adenylyltransferase subunit 1 |
Protein accession | YP_001744901 |
Protein GI | 170683324 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2895] GTPases - Sulfate adenylate transferase subunit 1 |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR02034] sulfate adenylyltransferase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACCG CACTTGCACA ACAAATCGCC AATGAAGGCG GCGTCGAAGC CTGGATGATT GCGCAACAAC ATAAAAGCCT GCTGCGTTTT CTGACCTGTG GTAGCGTCGA TGACGGCAAA AGTACCCTGA TTGGTCGCCT GCTACACGAT ACCCGCCAAA TCTACGAAGA TCAGCTCTCA TCGCTGCATA ACGACAGTAA GCGTCACGGC ACCCAGGGCG AAAAGCTGGA TCTGGCACTG CTGGTGGACG GCCTGCAAGC TGAGCGCGAA CAGGGCATCA CTATTGATGT GGCCTACCGC TATTTCTCTA CCGAGAAGCG TAAATTTATT ATCGCCGACA CCCCAGGGCA CGAGCAGTAC ACCCGCAATA TGGCGACTGG CGCATCGACA TGTGAACTGG CGATCTTACT GATCGATGCC CGTAAAGGCG TGCTCGATCA AACCCGTCGT CACAGTTTTA TCTCCACACT GTTGGGGATC AAACATCTGG TCGTGGCGAT CAACAAAATG GATCTGGTGG ATTACAGTGA AGAGACGTTC ACCCGTATTC GTGAAGATTA TCTGACTTTT GCCGGGCAGC TGCCGGGTAA TCTGGATATC CGCTTTGTGC CGCTCTCCGC ACTGGAAGGC GACAACGTGG CTTCGCAAAG TGAAAGTATG CCGTGGTACA GCGGTCCGAC ACTGCTCGAA GTGCTGGAAA CCGTGGAGAT CCAGCGAGTG GTGGATGCAC AGCCTATGCG CTTCCCGGTG CAGTACGTTA ACCGCCCGAA TCTCGATTTT CGTGGCTACG CCGGAACGCT GGCATCCGGT CGCGTGGAAG TCGGGCAACG TGTAAAAGTG TTGCCCTCTG GTGTGGAATC AAACGTCGCG CGGATCGTGA CTTTTGATGG CGATCGCGAA GAAGCCTTTG CCGGAGAAGC GATCACCCTG GTGCTGACGG ATGAGATCGA CATCAGCCGT GGCGATCTGC TGCTGGCGGC AGACGAAGCG TTACCGGCTG TGCAGAGCGC GTCGGTGGAT GTGGTATGGA TGGCGGAACA GCCGCTTTCC CCAGGCCAGA GTTACGACAT CAAAATTGCC GGTAAGAAGA CGCGTGCTCG TGTTGATGGC ATTCGTTATC AGGTTGATAT CAATAACCTT ACCCAACGCG AAGTTGAAAA CCTGCCGCTG AACGGCATCG GCCTGGTAGA TCTCACCTTT GACGAGCCGC TGGTGTTAGA TCGTTATCAG CAAAACCCGG TTACCGGTGG GCTGATTTTT ATCGATCGCC TGAGTAATGT GACCGTAGGT GCCGGTATGG TGCACGAACC AGTTAGCCAG GCAACTGCTG CGCCATCTGA ATTCAGTGCA TTCGAACTGG AATTGAATGC CCTGGTTCGC CGCCATTTCC CGCATTGGGG TGCGCGCGAT TTACTGGGAG ATAAATAA
|
Protein sequence | MNTALAQQIA NEGGVEAWMI AQQHKSLLRF LTCGSVDDGK STLIGRLLHD TRQIYEDQLS SLHNDSKRHG TQGEKLDLAL LVDGLQAERE QGITIDVAYR YFSTEKRKFI IADTPGHEQY TRNMATGAST CELAILLIDA RKGVLDQTRR HSFISTLLGI KHLVVAINKM DLVDYSEETF TRIREDYLTF AGQLPGNLDI RFVPLSALEG DNVASQSESM PWYSGPTLLE VLETVEIQRV VDAQPMRFPV QYVNRPNLDF RGYAGTLASG RVEVGQRVKV LPSGVESNVA RIVTFDGDRE EAFAGEAITL VLTDEIDISR GDLLLAADEA LPAVQSASVD VVWMAEQPLS PGQSYDIKIA GKKTRARVDG IRYQVDINNL TQREVENLPL NGIGLVDLTF DEPLVLDRYQ QNPVTGGLIF IDRLSNVTVG AGMVHEPVSQ ATAAPSEFSA FELELNALVR RHFPHWGARD LLGDK
|
| |