Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_02566 |
Symbol | cysN |
ID | 8113391 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 2713170 |
End bp | 2714597 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644848764 |
Product | hypothetical protein |
Protein accession | YP_003000337 |
Protein GI | 251786033 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2895] GTPases - Sulfate adenylate transferase subunit 1 |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR02034] sulfate adenylyltransferase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACACCG CACTTGCACA ACAAATCGCC AATGAAGGCG GCGTCGAAGC CTGGATGATT GCGCAACAAC ATAAAAGCCT GCTGCGTTTT CTGACCTGTG GTAGCGTCGA TGACGGCAAA AGTACTCTGA TTGGTCGCCT GCTGCACGAT ACCCGCCAAA TCTACGAAGA TCAGCTCTCA TCGCTGCATA ACGACAGTAA ACGTCACGGC ACCCAGGGCG AAAAGCTGGA TCTGGCACTG CTGGTGGACG GCCTGCAAGC TGAGCGCGAA CAGGGCATCA CCATTGACGT GGCTTACCGC TATTTCTCTA CCGAGAAGCG TAAATTTATT ATCGCCGACA CCCCAGGGCA CGAGCAGTAC ACCCGCAATA TGGCGACTGG CGCATCGACA TGTGAACTGG CGATCTTACT GATCGATGCC CGTAAAGGCG TGCTCGATCA AACCCGTCGT CACAGTTTTA TCTCCACACT GTTGGGGATC AAACATCTGG TCGTGGCGAT CAACAAAATG GATCTGGTGG ATTACAGTGA AAAGACGTTC ACCCGTATTC GTGAAGATTA TCTGACCTTT GCCGGGCAGC TGCCGGGTAA TCTGGATATC CGCTTTGTGC CGCTCTCTGC ACTGGAAGGC GACAACGTGG CTTCGCAAAG TGAAAGTATG GCGTGGTACA GCGGTCCAAC ACTGCTCGAA GTGCTGGAAA CCGTGGAGAT CCAGCGAGTG GTGGATGCTC AGCCAATGCG CTTCCCGGTG CAGTACGTTA ATCGCCCGAA TCTCGATTTT CGTGGTTACG CCGGAACGCT GGCATCCGGT CGCGTAGAAG TCGGGCAACG AGTCAAAGTG CTGCCCTCTG GTGTGGAATC AAACGTCGCC CGGATCGTGA CTTTTGATGG CGATCGCGAA GAAGCCTTTG CCGGAGAAGC GATCACCCTG GTGCTGACGG ATGAGATCGA CATCAGCCGT GGCGATCTGC TGCTGGCGGC AGACGAAGCG TTACCAGCTG TGCAGAGCGC GTCGGTGGAT GTGGTATGGA TGGCGGAACA GCCGCTTTCC CCGGGCCAGA GTTATGACAT CAAAATTGCC GGTAAGAAGA CGCGTGCTCG TGTTGATGGC ATTCGTTATC AGGTTGATAT TAATAACCTT ACCCAACGCG AAGTTGAAAA CCTGCCATTG AATGGGATCG GCCTCGTGGA TCTCACTTTT GACGAGCCGC TGGTGTTAGA TCGTTATCAA CAAAATCCGG TGACGGGTGG GCTGATTTTT ATCGATCGCC TGAGCAATGT GACCGTGGGT GCCGGTATGG TGCACGAGCC AGTTAGCCAG GCAACTGCTG CGCCATCTGA ATTCAGTGCA TTCGAACTGG AATTGAATGC TCTGGTTCGT CGCCACTTTC CGCACTGGGG CGCGCGCGAT TTGCTGGGGG ATAAATAA
|
Protein sequence | MNTALAQQIA NEGGVEAWMI AQQHKSLLRF LTCGSVDDGK STLIGRLLHD TRQIYEDQLS SLHNDSKRHG TQGEKLDLAL LVDGLQAERE QGITIDVAYR YFSTEKRKFI IADTPGHEQY TRNMATGAST CELAILLIDA RKGVLDQTRR HSFISTLLGI KHLVVAINKM DLVDYSEKTF TRIREDYLTF AGQLPGNLDI RFVPLSALEG DNVASQSESM AWYSGPTLLE VLETVEIQRV VDAQPMRFPV QYVNRPNLDF RGYAGTLASG RVEVGQRVKV LPSGVESNVA RIVTFDGDRE EAFAGEAITL VLTDEIDISR GDLLLAADEA LPAVQSASVD VVWMAEQPLS PGQSYDIKIA GKKTRARVDG IRYQVDINNL TQREVENLPL NGIGLVDLTF DEPLVLDRYQ QNPVTGGLIF IDRLSNVTVG AGMVHEPVSQ ATAAPSEFSA FELELNALVR RHFPHWGARD LLGDK
|
| |