Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr7_3682 |
Symbol | |
ID | 4254892 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-7 |
Kingdom | Bacteria |
Replicon accession | NC_008322 |
Strand | + |
Start bp | 4383438 |
End bp | 4385003 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 638124370 |
Product | histidine ammonia-lyase |
Protein accession | YP_739719 |
Protein GI | 114049169 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2986] Histidine ammonia-lyase |
TIGRFAM ID | [TIGR01225] histidine ammonia-lyase [TIGR01226] phenylalanine ammonia-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCACG CAGTTACTCA CACGACAACC GCTGAGTCGC CAATCGAATT TGGCCGTCAG TTACTCACAT TAGAGCAAGT CGTCGCCGTT GCTAAGGGTG CTAAAGTCAA ACTCTGTAAT GATGCCGATT ACCAAGCGTA TATCCAAAAG GGCGCCCGCT TTATCGACAG CCTGCTGCAC GAAGAAGGTG TGGTCTACGG CGTTACCACT GGCTATGGCG ACTCTTGCAC TGTCAACGTG AGTCTCGACT TAGTCCATGA GTTGCCACTG CACTTATCTC GCTTCCATGG CTGTGGCCTC GGTGAAGTCT TGAGCGTGAT GCAGGCGCGC GCCGTGATGG CTTGCCGTTT AAACTCCCTC GCCATCGGTA AATCCGGCGT GACCTATGAG CTGTTAAAGC GCATCGAAAC CTTGCTTAAT CTCAATATAG TGCCAGTGAT CCCAGAGGAA GGTTCAGTCG GCGCCAGCGG TGACTTAACG CCATTGTCTT ATCTCGCCGC CGTGCTAGTT GGCGAGCGCG AAGTGATTTA TAACGGCGAG CGCAGAGCCA CCCAAGAGGT TTACCGCGAG CTGAACATCA CGCCCCATGT GCTGCGCCCC AAGGAAGGCT TAGCCCTGAT GAATGGCACG GCGGTGATGA CGGCATTAGC CTGTTTAGCC TTTGATCGCG CACAATATTT AGCGCGTTTA GCCAGCCGCA TTACCGCTAT GGCGTCGTTA ACCCTCAAAG GTAACTCAAA CCATTTCGAC GATATTCTGT TTGCCGCCAA ACCCCATCCG GGGCAAAACC AAATCGCGGC TTGGATCAGG GAAGATTTGA ACCACCATGT TCATCCCCGC AATTCAGATC GTCTGCAGGA CAGATATTCC ATCCGCTGCG CGCCGCACAT CATTGGCGTA TTGCAGGATG CGCTGCCCTT TATGCGCCAA TTTATCGAAA CTGAAGTTAA CAGCGCCAAC GATAACCCCA TAGTCGATGG TGAAGGCGAA CATATTCTCC ACGGCGGCCA TTTCTACGGC GGACACATTG CCTTTGCGAT GGACTCCTTG AAAAACACTG TGGCCAACAT CGCCGATCTC ATCGACCGCC AAATGGCATT AGTGATGGAC CCTAAGTTTA ACAACGGTTT ACCCGCTAAC CTTTCGGGTT CAACTGGGCC ACGCCGCGCC ATCAACCATG GCTTTAAGGC GGTGCAAATC GGCGTTTCGG CTTGGACGGC AGAGGCGCTC AAGCACACTA TGCCCGCGAG TGTTTTCTCT CGCTCAACCG AATGCCATAA CCAAGATAAA GTCAGCATGG GCACTATCGC CGCCCGTGAC TGTATGCGCG TATTGCAACT AACAGAGCAA GTCGCCGCCG CAGCCCTACT CGCCATGACC CAAGGCATTG GTCTGCGTAT CAAACAAAAC GAGCTAGACG AAACCTCGCT GACCCCTTCG CTGGCGACCA CGCTCGCCCA AGTGCGCGCC GATTTTGAGC CATTAGTCGA AGACAGACCG CTCGAAGCCG TGCTGCGCCA AACCGTTGCG AAAATCCAAG CGGGCGAATG GGAAGTGTGC CGATGA
|
Protein sequence | MSHAVTHTTT AESPIEFGRQ LLTLEQVVAV AKGAKVKLCN DADYQAYIQK GARFIDSLLH EEGVVYGVTT GYGDSCTVNV SLDLVHELPL HLSRFHGCGL GEVLSVMQAR AVMACRLNSL AIGKSGVTYE LLKRIETLLN LNIVPVIPEE GSVGASGDLT PLSYLAAVLV GEREVIYNGE RRATQEVYRE LNITPHVLRP KEGLALMNGT AVMTALACLA FDRAQYLARL ASRITAMASL TLKGNSNHFD DILFAAKPHP GQNQIAAWIR EDLNHHVHPR NSDRLQDRYS IRCAPHIIGV LQDALPFMRQ FIETEVNSAN DNPIVDGEGE HILHGGHFYG GHIAFAMDSL KNTVANIADL IDRQMALVMD PKFNNGLPAN LSGSTGPRRA INHGFKAVQI GVSAWTAEAL KHTMPASVFS RSTECHNQDK VSMGTIAARD CMRVLQLTEQ VAAAALLAMT QGIGLRIKQN ELDETSLTPS LATTLAQVRA DFEPLVEDRP LEAVLRQTVA KIQAGEWEVC R
|
| |