Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2889 |
Symbol | cysN |
ID | 5591918 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2891411 |
End bp | 2892838 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640922006 |
Product | sulfate adenylyltransferase subunit 1 |
Protein accession | YP_001459517 |
Protein GI | 157162199 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2895] GTPases - Sulfate adenylate transferase subunit 1 |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR02034] sulfate adenylyltransferase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 52 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACACCG CACTTGCACA ACAAATCGCC AATGAAGGCG GCGTCGAAGC CTGGATGATT GCGCAACAAC ATAAAAGCCT GCTGCGTTTT CTGACCTGTG GTAGCGTCGA TGACGGCAAA AGTACCCTGA TTGGTCGTCT GCTGCACGAT ACCCGCCAAA TCTATGAAGA TCAGCTCTCA TCGCTGCATA ACGACAGTAA GCGTCACGGC ACCCAGGGCG AAAAGCTGGA TCTGGCTCTG CTGGTGGACG GCCTGCAAGC TGAGCGCGAA CAGGGCATCA CCATTGACGT GGCCTACCGC TATTTCTCTA CCGAGAAGCG TAAATTTATT ATCGCCGACA CCCCAGGGCA CGAGCAGTAC ACCCGCAATA TGGCGACTGG CGCATCGACA TGTGAACTGG CGATCTTACT GATCGATGCC CGTAAAGGCG TGCTCGATCA AACCCGTCGT CACAGTTTTA TCTCCACACT GTTGGGGATC AAACATCTGG TCGTGGCGAT CAACAAAATG GATCTGGTGG ATTACAGTGA AGAGACGTTC ACCCGTATTC GTGAAGATTA TTTGACCTTT GCCGGGCAGC TGCCGGGTAA TCTGGATATC CGCTTTGTGC CGCTCTCTGC ACTGGAAGGC GACAACGTGG CATCGCAAAG TGAAAGTATG CCGTGGTACA GCGGTCCGAC ACTGCTCGAA GTGCTGGAAA CCGTGGAGAT CCAGCGAGTG GTGGATGCTC AGCCAATGCG CTTCCCGGTG CAGTACGTTA ACCGCCCAAA TCTCGATTTT CGTGGCTACG CCGGAACGCT GGCATCCGGT CGCGTGGAAG TCGGGCAACG TGTAAAAGTG CTGCCCTCTG GTGTGGAATC AAACGTCGCG CGGATCGTGA CTTTTGATGG TGATCGCGAA GAAGCCTTTG CCGGAGAAGC GATCACCCTG GTGCTGACGG ATGAGATCGA CATCAGCCGT GGCGATCTGC TGCTGGCGGC AGACGAAGCG TTACCAGCTG TGCAGAGCGC GTCGGTGGAT GTGGTATGGA TGGCGGAACA GCCGCTTTCC CCGGGCCAGA GTTATGACAT CAAAATTGCC GGTAAGAAGA CGCGTGCTCG TGTTGATGGC ATTCGTTATC AGGTTGATAT TAATAACCTT ACCCAACGCG AAGTTGAAAA CCTGCCGCTG AACGGCATCG GCCTGGTGGA TCTCACTTTT GACGAGCCAC TGGTGTTAGA TCGTTATCAG CAAAATCCGG TTACCGGTGG GCTGATTTTT ATCGATCGCC TGAGCAATGT GACCGTGGGT GCCGGTATGG TGCACGAGCC AGTTAGCCAG GCAACTGCTG CGCCATCTGA ATTCAGTGTA TTCGAACTGG AATTGAATGC TCTGGTTCGT CGCCACTTTC CGCACTGGGG CGCGCGCGAT TTGCTGGGGG ATAAATAA
|
Protein sequence | MNTALAQQIA NEGGVEAWMI AQQHKSLLRF LTCGSVDDGK STLIGRLLHD TRQIYEDQLS SLHNDSKRHG TQGEKLDLAL LVDGLQAERE QGITIDVAYR YFSTEKRKFI IADTPGHEQY TRNMATGAST CELAILLIDA RKGVLDQTRR HSFISTLLGI KHLVVAINKM DLVDYSEETF TRIREDYLTF AGQLPGNLDI RFVPLSALEG DNVASQSESM PWYSGPTLLE VLETVEIQRV VDAQPMRFPV QYVNRPNLDF RGYAGTLASG RVEVGQRVKV LPSGVESNVA RIVTFDGDRE EAFAGEAITL VLTDEIDISR GDLLLAADEA LPAVQSASVD VVWMAEQPLS PGQSYDIKIA GKKTRARVDG IRYQVDINNL TQREVENLPL NGIGLVDLTF DEPLVLDRYQ QNPVTGGLIF IDRLSNVTVG AGMVHEPVSQ ATAAPSEFSV FELELNALVR RHFPHWGARD LLGDK
|
| |