Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0297 |
Symbol | proA |
ID | 6146166 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 305271 |
End bp | 306524 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641615194 |
Product | gamma-glutamyl phosphate reductase |
Protein accession | YP_001742403 |
Protein GI | 170682591 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0014] Gamma-glutamyl phosphate reductase |
TIGRFAM ID | [TIGR00407] gamma-glutamyl phosphate reductase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.000952781 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTGGAAC AAATGGGCAT TGCCGCGAAG CAAGCCTCTT ATAAATTAGC GCAACTCTCC AGCCGCGAAA AAAATCGCGT GCTGGAAAAA ATCGCCGATG AACTGGAAGC ACAAAGCGAA ATCATCCTCA ACGCTAACGC CCAGGATGTT GCTGACGCGC GTGCCAATGG CCTTAGCGAA GCGATGCTTG ACCGTCTGGC ACTGACGCCC GCACGGCTGA AAGGCATTGC CGATGATGTG CGCCAGGTGT GCAACCTCGC CGATCCGGTG GGGCAGGTGA TTGATGGTGG TGTGCTGGAC AGCGGTCTGC GTCTGGAGCG TCGTCGCGTA CCGCTGGGGG TGATTGGCGT GATTTATGAA GCTCGCCCGA ACGTGACAGT AGATGTCGCT TCGCTGTGCC TGAAAACCGG TAACGCAGTG ATCCTGCGTG GTGGCAAAGA AACCTGTCGC ACTAACGCTG CAACGGTGGC GGTGATTCAG GACGCCCTGA AATCCTGCGA TTTACCGGCA GGCGCAGTAC AGGCGATTGA TAATCCTGAC CGCGCGCTGG TCAGTGAAAT GCTGCGTATG GATAAATACA TCGACATGCT GATTCCACGC GGCGGGGCTG GGTTGCATAA ACTGTGCCGC GAGCAGTCGA CGATTCCGGT AATCACAGGT GGTATAGGCG TATGCCATAT TTACGTTGAT GAAAGTGCAG AGATCGCTGA AGCATTGAAA GTGATTGTCA ACGCGAAAAC TCAGCGCCCG AGCACGTGTA ATACGGTAGA AACGTTGCTG GTAAATAAAA ACATCGCAGA TAGCTTCCTG CCCGCATTAA GCAAGCAAAT GGCGGAAAGT GGCGTGACGT TACACGCAGA TGCTGCTGCG CTGGCGCAAT TGCAGGCAGG CCCCGCGAAG GTGGTGGCTG TTAAAGCCGA AGAGTATGAC GATGAGTTCC TGTCATTAGA CTTGAACGTC AAAATCGTCA GCGATCTTGA CGATGCTATC GCCCATATTC GTGAACACGG CACGCAACAC TCCGATGCGA TCCTGACCCG CGATATGCGC AACGCCCAGC GTTTTGTTAA TGAAGTAGAT TCATCCGCTG TTTACGTTAA CGCCTCTACG CGTTTTACCG ATGGCGGCCA GTTTGGTCTG GGAGCGGAAG TGGCGGTAAG CACACAAAAA CTCCACGCGC GTGGCCCGAT GGGCCTGGAA GCACTGACCA CTTACAAGTG GATCGGCATT GGTGATTACA CCATTCGTGC GTAA
|
Protein sequence | MLEQMGIAAK QASYKLAQLS SREKNRVLEK IADELEAQSE IILNANAQDV ADARANGLSE AMLDRLALTP ARLKGIADDV RQVCNLADPV GQVIDGGVLD SGLRLERRRV PLGVIGVIYE ARPNVTVDVA SLCLKTGNAV ILRGGKETCR TNAATVAVIQ DALKSCDLPA GAVQAIDNPD RALVSEMLRM DKYIDMLIPR GGAGLHKLCR EQSTIPVITG GIGVCHIYVD ESAEIAEALK VIVNAKTQRP STCNTVETLL VNKNIADSFL PALSKQMAES GVTLHADAAA LAQLQAGPAK VVAVKAEEYD DEFLSLDLNV KIVSDLDDAI AHIREHGTQH SDAILTRDMR NAQRFVNEVD SSAVYVNAST RFTDGGQFGL GAEVAVSTQK LHARGPMGLE ALTTYKWIGI GDYTIRA
|
| |