Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SO_4374 |
Symbol | |
ID | 1171976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella oneidensis MR-1 |
Kingdom | Bacteria |
Replicon accession | NC_004347 |
Strand | + |
Start bp | 4567930 |
End bp | 4569495 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637346100 |
Product | histidine ammonia-lyase, putative |
Protein accession | NP_719898 |
Protein GI | 24375855 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2986] Histidine ammonia-lyase |
TIGRFAM ID | [TIGR01225] histidine ammonia-lyase [TIGR01226] phenylalanine ammonia-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCACG TAGTTACTCA AACGACAACC GTTGAGTCGC CAATCGAATT TGGCCGTCAG TTACTCACAT TAGAGCAGGT CGTCGCCGTG GCTAAGGGCG CAAAGGTCAA ACTCTGTGAT GATGCCGATT ATCAAGAATA TATCCAAAAG GGCGCCCGCT TTATCGATAG TTTGCTGCAC GAAGAAGGCG TGGTCTACGG CGTCACCACA GGCTATGGCG ACTCTTGCAC AGTGAATGTG AGTCTTGACT TGGTCCACGA GTTGCCGCTG CACTTATCCC GTTTTCATGG TTGTGGCCTT GGTGAAGTCT TAAGCGTAAT GCAAGCGCGC GCCGTGATGG CTTGCCGTTT AAACTCCCTC GCCATTGGCA AATCCGGCGT GACATATGAG CTATTAAAGC GCATCCAAAC CTTGCTTAAT CTCAATATCG TGCCAGTGAT CCCCGAAGAA GGCTCAGTCG GTGCCAGCGG AGACTTAACG CCACTGTCTT ACCTTGCCGC CGTGCTGGTT GGTGAGCGTG AGGTGATTTA CCAAGGCGAG CGCCGAGCCA CCAAAGAGGT TTATCACGAG CTGAATATCA CGCCCCATGT GCTACGTCCC AAGGAAGGTT TAGCCCTGAT GAACGGCACG GCAGTGATGA CAGCGTTAGC CTGTTTAGCC TTTGATCGCG CACAATATTT AGCGCGTTTA GCCAGCCGCA TTACCGCCAT GGCGTCGTTA ACCCTTAAAG GCAACTCCAA CCATTTCGAC GATATTCTGT TTGCCGCCAA ACCTCATCCA GGGCAAAACC AAATCGCCAC TTGGATACGG GAAGACTTGA ACCACCATGT TCACCCGCGC AATTCCGACA GATTGCAGGA CAGATATTCC ATCCGCTGTG CGCCGCATAT TATTGGCGTG CTGCAGGATG CGCTGCCCTT TATGCGCCAA TTTATCGAAA CCGAAGTTAA CAGCGCCAAC GACAACCCGA TTGTCGATGC TGAAGGCGAG CATATTCTCC ATGGCGGCCA TTTTTACGGC GGGCATATCG CCTTTGCGAT GGACTCCTTA AAAAATATTG TGGCCAATAT CGCCGATCTG ATTGATCGCC AAATGGCATT AGTGATGGAC CCTAAATTTA ACAACGGCTT ACCCGCTAAC CTTTCGGGTT CAACCGGGCC ACGCCGCGCC ATCAACCATG GCTTTAAGGC GGTGCAAATC GGCGTTTCAG CCTGGACGGC AGAAGCACTG AAACACACTA TGCCCGCAAG CGTTTTCTCA CGCTCAACCG AATGCCACAA CCAAGATAAA GTCAGCATGG GCACCATTGC CGCCCGCGAC TGTATGCGTG TATTGCAACT AACGGAACAA GTCGCCGCCG CGGCGCTACT TGCTATGACT CAAGGCATTG ATCTGCGTAT CACACAAAAC GAGTTAGACG AAGCCTCACT GACGCCATCA CTGGCGACCA CGCTCGCCCA AGTGCGCGCT GACTTTGAGC CATTAGTCGA AGACAGACCG CTCGAAGCCG TGCTACGCCA AACCGTCGCT AAAATCCAAG CCGGTGAATG GGAAGTGTGC CGATGA
|
Protein sequence | MSHVVTQTTT VESPIEFGRQ LLTLEQVVAV AKGAKVKLCD DADYQEYIQK GARFIDSLLH EEGVVYGVTT GYGDSCTVNV SLDLVHELPL HLSRFHGCGL GEVLSVMQAR AVMACRLNSL AIGKSGVTYE LLKRIQTLLN LNIVPVIPEE GSVGASGDLT PLSYLAAVLV GEREVIYQGE RRATKEVYHE LNITPHVLRP KEGLALMNGT AVMTALACLA FDRAQYLARL ASRITAMASL TLKGNSNHFD DILFAAKPHP GQNQIATWIR EDLNHHVHPR NSDRLQDRYS IRCAPHIIGV LQDALPFMRQ FIETEVNSAN DNPIVDAEGE HILHGGHFYG GHIAFAMDSL KNIVANIADL IDRQMALVMD PKFNNGLPAN LSGSTGPRRA INHGFKAVQI GVSAWTAEAL KHTMPASVFS RSTECHNQDK VSMGTIAARD CMRVLQLTEQ VAAAALLAMT QGIDLRITQN ELDEASLTPS LATTLAQVRA DFEPLVEDRP LEAVLRQTVA KIQAGEWEVC R
|
| |