Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A3484 |
Symbol | aroF |
ID | 5801960 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | - |
Start bp | 3699561 |
End bp | 3700631 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641341301 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001607814 |
Protein GI | 162420963 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.242874 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAAAG ACTCGCTCAA TAACGTCCAT ATCAGTGCCG AACAAATCCT GATAACCCCG GAAGAACTGA AAAATCAGTT TCCACTGAGC GAAAATGATC AGTATTCGAT AGAGCGCGCA CGTAAAACCA TTGCTGACAT TATTCAGGGG CGAGATCCGC GTCTGTTGGT CGTTTGTGGG CCCTGTTCAA TTCATGATGT GGATGCGGCA CTGGATTACG CGCGTCGTTT GAAAAAACTC TCTGTGGAAT TGGATGACAG CTTATATATC GTTATGCGTG TCTATTTTGA GAAGCCAAGA ACTACCGTGG GTTGGAAAGG CCTGATCAAT GACCCTGCAA TGGATGGTTC ATTTGATGTA GAGGCAGGTT TACACATTGC CCGTCGTTTA TTGCTGGATT TAGTGGGCAT GGGGTTGCCG TTAGCGACTG AAGCTCTGGA TCCTAATAGC CCACAATATT TAGGTGACCT GTTCAGTTGG TCGGCCATTG GTGCCCGTAC AACGGAGTCA CAGACCCACC GTGAAATGGC ATCAGGCTTG TCTATGCCGG TTGGATTTAA AAATGGCACT GACGGTAGCC TAGGCACGGC AATCAATGCA ATGCGCGCCG CTGCCATGCC ACATCGCTTT ATGGGGATCA ATCAGTCGGG CCAGGTCTGC CTGTTACAAA CTCAGGGTAA CCCACACGGC CATGTCATTC TACGGGGAGG TAAAACACCA AACTACAGTG CACAAGATGT CGCTCAGTGT GAAAAACAGA TGCAGGATGC GGGACTCATC CCATCCTTAA TGATAGATTG CAGTCACGGT AATTCAAATA AAGACTACCG CCGTCAGGTT GCGGTGGCTG AATCTGTGGT TGAACAGATC AAGGCGGGCA ATCGTTCAAT TACAGGTGTG ATGCTGGAAA GCCACATCCA CGAAGGAAAT CAGTCATCTG AACAGCCACG TGCTGATATG CGCTACGGTG TTTCTGTGAC TGACGCCTGT ATTAACTGGG AAAGCACTGA AACCCTGTTA CGTGGTATGC GCCAAGAATT GCTTGCAGCA CTGACGGCAC GGACTGCATG A
|
Protein sequence | MQKDSLNNVH ISAEQILITP EELKNQFPLS ENDQYSIERA RKTIADIIQG RDPRLLVVCG PCSIHDVDAA LDYARRLKKL SVELDDSLYI VMRVYFEKPR TTVGWKGLIN DPAMDGSFDV EAGLHIARRL LLDLVGMGLP LATEALDPNS PQYLGDLFSW SAIGARTTES QTHREMASGL SMPVGFKNGT DGSLGTAINA MRAAAMPHRF MGINQSGQVC LLQTQGNPHG HVILRGGKTP NYSAQDVAQC EKQMQDAGLI PSLMIDCSHG NSNKDYRRQV AVAESVVEQI KAGNRSITGV MLESHIHEGN QSSEQPRADM RYGVSVTDAC INWESTETLL RGMRQELLAA LTARTA
|
| |