Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A0109 |
Symbol | araA |
ID | 6483191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 119861 |
End bp | 121363 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642735551 |
Product | L-arabinose isomerase |
Protein accession | YP_002039333 |
Protein GI | 194446528 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2160] L-arabinose isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 0.309204 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATTT TTGATAATTA TGAAGTATGG TTTGTGATTG GCAGCCAGCA TTTGTATGGC GCAGAAACCC TGCGTCAGGT CACCCAACAT GCCGAGCATG TGGTCAACGC GCTGAATACC GAAGCCAAAC TGCCATGTAA ACTGGTATTA AAACCGCTGG GCACCTCGCC GGATGAGATT ACCGCCATTT GTCGTGACGC CAATTATGAC GATCGCTGCG CAGGGCTGGT GGTCTGGCTG CACACCTTCT CCCCGGCCAA AATGTGGATC AACGGGCTGA GTATCCTTAA CAAACCACTA CTGCAATTCC ATACCCAATT TAACGCCGCC CTGCCGTGGG ACAGCATTGA TATGGACTTT ATGAACCTGA ACCAGACTGC GCACGGCGGC CGCGAGTTCG GTTTTATCGG CGCGCGGATG CGTCAGCAGC ACGCGGTCGT CACTGGCCAC TGGCAGGATA AAGAGGCCCA TACGCGTATC GGCGCCTGGA TGCGCCAGGC GGTCTCTAAA CAGGATACCC GCCAGCTAAA AGTCTGCCGC TTCGGCGACA ATATGCGTGA AGTCGCAGTG ACTGACGGTG ATAAAGTGGC CGCGCAAATC AAATTTGGCT TTTCGGTCAA TACCTGGGCG GTCGGCGATC TGGTGCAGGT GGTGAATTCT ATCGGCGACG GCGATATCAA CGCTCTGATT GACGAGTATG AAAGCAGCTA TACCCTGACG CCCGCCACCC AAATCCACGG CGATAAACGC CAGAACGTGC GGGAGGCGGC GCGTATTGAA CTCGGTATGA AGCGTTTCCT GGAACAGGGC GGCTTCCACG CATTCACTAC TACCTTTGAA GATTTACACG GTCTGAAACA GCTTCCGGGT CTGGCCGTAC AGCGTCTGAT GCAGCAAGGC TACGGCTTTG CGGGCGAAGG CGACTGGAAA ACCGCCGCTC TGCTTCGCAT TATGAAAGTG ATGTCAACCG GTCTGCAGGG CGGCACCTCA TTTATGGAGG ATTACACCTA CCACTTCGAG AAAGGCAACG ATCTGGTGCT CGGCTCGCAC ATGCTGGAAG TGTGTCCGTC CATCGCGGTG GAAGAGAAAC CGATCCTCGA CGTCCAGCAC CTCGGCATTG GCGGCAAGGA AGATCCGGCG CGTTTGATTT TCAATACCCA AACCGGCCCG GCGATCGTCG CCAGCCTGAT CGACCTCGGC GATCGTTATC GCCTGCTGGT CAACTGCATT GACACCGTAA AAACGCCGCA CTCCCTGCCG AAACTGCCGG TGGCTAACGC GCTGTGGAAG GCGCAGCCGG ATCTGCCGAC CGCCTCCGAA GCGTGGATTC TGGCTGGCGG CGCGCACCAT ACCGTCTTCA GCCACGCGCT GGATCTGAAC GATATGCGCC AGTTTGCAGA AATACACGAT ATCGAAATCG CGGTGATTGA TAACGATACC CATCTGCCGG CCTTTAAGGA CGCGCTGCGC TGGAACGAGG TGTATTACGG GTTCAAACGT TAA
|
Protein sequence | MTIFDNYEVW FVIGSQHLYG AETLRQVTQH AEHVVNALNT EAKLPCKLVL KPLGTSPDEI TAICRDANYD DRCAGLVVWL HTFSPAKMWI NGLSILNKPL LQFHTQFNAA LPWDSIDMDF MNLNQTAHGG REFGFIGARM RQQHAVVTGH WQDKEAHTRI GAWMRQAVSK QDTRQLKVCR FGDNMREVAV TDGDKVAAQI KFGFSVNTWA VGDLVQVVNS IGDGDINALI DEYESSYTLT PATQIHGDKR QNVREAARIE LGMKRFLEQG GFHAFTTTFE DLHGLKQLPG LAVQRLMQQG YGFAGEGDWK TAALLRIMKV MSTGLQGGTS FMEDYTYHFE KGNDLVLGSH MLEVCPSIAV EEKPILDVQH LGIGGKEDPA RLIFNTQTGP AIVASLIDLG DRYRLLVNCI DTVKTPHSLP KLPVANALWK AQPDLPTASE AWILAGGAHH TVFSHALDLN DMRQFAEIHD IEIAVIDNDT HLPAFKDALR WNEVYYGFKR
|
| |