Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_0344 |
Symbol | |
ID | 4250855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | - |
Start bp | 366588 |
End bp | 368153 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 638116899 |
Product | histidine ammonia-lyase |
Protein accession | YP_732481 |
Protein GI | 113968688 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2986] Histidine ammonia-lyase |
TIGRFAM ID | [TIGR01225] histidine ammonia-lyase [TIGR01226] phenylalanine ammonia-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCACG CAGTTACTCA CACGACAACC GCTGAGCCGC CAATCGAATT TGGCCGTCAG TTACTCACAT TAGAGCAAGT CGTCGCCGTT GCTAAGGGTG CTAAAGTCAA ACTCTGTGAT GATGCCGATT ACCAAGCGTA TATCCAAAAG GGCGCCCGCT TTATCGATAG CCTGCTGCAC GAAGAAGGTG TGGTCTACGG CGTTACCACC GGCTATGGCG ACTCTTGCAC TGTCAACGTG AGTCTCGACT TAGTCCATGA GTTGCCACTG CACTTATCCC GCTTCCATGG CTGTGGCCTA GGCGAAATCT TAAGCGTGAT GCAGGCGCGC GCCGTGATGG CTTGCCGTTT AAACTCCCTC GCCATTGGTA AATCCGGCGT GACCTATGAG CTGTTAAAGC GCATCGAAAC CTTGCTTAAT CTCAATATAG TGCCAGTGAT CCCAGAGGAA GGTTCAGTCG GCGCCAGCGG TGACTTAACG CCATTGTCTT ATCTCGCCGC CGTGCTAGTT GGCGAGCGCG AAGTGATTTA TAACGGCGAG CGCAGAGCCA CCCAAGAGGT TTACCGCGAG CTGAACATCA CGCCCCATGT GCTGCGCCCC AAGGAAGGTT TAGCCCTGAT GAATGGCACG GCGGTGATGA CGGCATTAGC CTGTTTAGCC TTTGATCGTG CACAATATTT AGCGCGTTTA GCCAGCCGCA TTACCGCCAT GGCGTCGTTA ACTCTCAAAG GTAACTCAAA CCATTTCGAC GATATTCTGT TTGCCGCCAA ACCCCATCCG GGACAAAACC AAATCGCGAC TTGGATCAGG GAAGATTTGA ACCACCATGT TCACCCCCGC AATTCAGATC GTCTGCAGGA CAGATATTCC ATCCGCTGCG CGCCGCACAT CATTGGCGTA TTGCAGGATG CGCTGCCCTT TATGCGCCAA TTTATCGAGA CCGAAGTTAA CAGCGCCAAC GATAACCCCA TAGTCGATGG TGAAGGCGAG CATATTCTCC ACGGCGGCCA TTTCTACGGC GGACACATTG CCTTTGCGAT GGACTCCTTA AAAAACACTG TGGCCAACAT CGCCGATCTC ATCGACCGCC AAATGGCATT AGTGATGGAC CCTAAGTTTA ACAACGGTTT GCCAGCTAAC CTTTCGGGTT CAACTGGGCC ACGCCGCGCC ATCAACCATG GCTTTAAGGC AGTGCAAATC GGCGTATCGG CTTGGACGGC TGAGGCGCTC AAGCACACTA TGCCTGCGAG CGTTTTCTCT CGTTCAACCG AATGCCATAA CCAGGACAAA GTCAGCATGG GCACTATCGC CGCCCGTGAC TGTATGCGTG TATTGCAGCT GACAGAACAA GTCGCCGCCG CAGCCCTGCT TGCCATGACC CAAGGCATTG GTCTGCGTAT CAAACAGAAT GAGTTAGACG AAGCCTCGCT GACGCCATCG CTGGCGACCA CGCTCGCCCA AGTGCGCGCC GATTTTGAGC CATTAGTCGA AGACAGACCG CTCGAAGCCG TGCTGCGCCA AACCGTTTCG AAAATCCAAG CGGGCGAATG GGAAGTATGC CGATGA
|
Protein sequence | MSHAVTHTTT AEPPIEFGRQ LLTLEQVVAV AKGAKVKLCD DADYQAYIQK GARFIDSLLH EEGVVYGVTT GYGDSCTVNV SLDLVHELPL HLSRFHGCGL GEILSVMQAR AVMACRLNSL AIGKSGVTYE LLKRIETLLN LNIVPVIPEE GSVGASGDLT PLSYLAAVLV GEREVIYNGE RRATQEVYRE LNITPHVLRP KEGLALMNGT AVMTALACLA FDRAQYLARL ASRITAMASL TLKGNSNHFD DILFAAKPHP GQNQIATWIR EDLNHHVHPR NSDRLQDRYS IRCAPHIIGV LQDALPFMRQ FIETEVNSAN DNPIVDGEGE HILHGGHFYG GHIAFAMDSL KNTVANIADL IDRQMALVMD PKFNNGLPAN LSGSTGPRRA INHGFKAVQI GVSAWTAEAL KHTMPASVFS RSTECHNQDK VSMGTIAARD CMRVLQLTEQ VAAAALLAMT QGIGLRIKQN ELDEASLTPS LATTLAQVRA DFEPLVEDRP LEAVLRQTVS KIQAGEWEVC R
|
| |