Gene SO_4374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSO_4374 
Symbol 
ID1171976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella oneidensis MR-1 
KingdomBacteria 
Replicon accessionNC_004347 
Strand
Start bp4567930 
End bp4569495 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content53% 
IMG OID637346100 
Producthistidine ammonia-lyase, putative 
Protein accessionNP_719898 
Protein GI24375855 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase
[TIGR01226] phenylalanine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCACG TAGTTACTCA AACGACAACC GTTGAGTCGC CAATCGAATT TGGCCGTCAG 
TTACTCACAT TAGAGCAGGT CGTCGCCGTG GCTAAGGGCG CAAAGGTCAA ACTCTGTGAT
GATGCCGATT ATCAAGAATA TATCCAAAAG GGCGCCCGCT TTATCGATAG TTTGCTGCAC
GAAGAAGGCG TGGTCTACGG CGTCACCACA GGCTATGGCG ACTCTTGCAC AGTGAATGTG
AGTCTTGACT TGGTCCACGA GTTGCCGCTG CACTTATCCC GTTTTCATGG TTGTGGCCTT
GGTGAAGTCT TAAGCGTAAT GCAAGCGCGC GCCGTGATGG CTTGCCGTTT AAACTCCCTC
GCCATTGGCA AATCCGGCGT GACATATGAG CTATTAAAGC GCATCCAAAC CTTGCTTAAT
CTCAATATCG TGCCAGTGAT CCCCGAAGAA GGCTCAGTCG GTGCCAGCGG AGACTTAACG
CCACTGTCTT ACCTTGCCGC CGTGCTGGTT GGTGAGCGTG AGGTGATTTA CCAAGGCGAG
CGCCGAGCCA CCAAAGAGGT TTATCACGAG CTGAATATCA CGCCCCATGT GCTACGTCCC
AAGGAAGGTT TAGCCCTGAT GAACGGCACG GCAGTGATGA CAGCGTTAGC CTGTTTAGCC
TTTGATCGCG CACAATATTT AGCGCGTTTA GCCAGCCGCA TTACCGCCAT GGCGTCGTTA
ACCCTTAAAG GCAACTCCAA CCATTTCGAC GATATTCTGT TTGCCGCCAA ACCTCATCCA
GGGCAAAACC AAATCGCCAC TTGGATACGG GAAGACTTGA ACCACCATGT TCACCCGCGC
AATTCCGACA GATTGCAGGA CAGATATTCC ATCCGCTGTG CGCCGCATAT TATTGGCGTG
CTGCAGGATG CGCTGCCCTT TATGCGCCAA TTTATCGAAA CCGAAGTTAA CAGCGCCAAC
GACAACCCGA TTGTCGATGC TGAAGGCGAG CATATTCTCC ATGGCGGCCA TTTTTACGGC
GGGCATATCG CCTTTGCGAT GGACTCCTTA AAAAATATTG TGGCCAATAT CGCCGATCTG
ATTGATCGCC AAATGGCATT AGTGATGGAC CCTAAATTTA ACAACGGCTT ACCCGCTAAC
CTTTCGGGTT CAACCGGGCC ACGCCGCGCC ATCAACCATG GCTTTAAGGC GGTGCAAATC
GGCGTTTCAG CCTGGACGGC AGAAGCACTG AAACACACTA TGCCCGCAAG CGTTTTCTCA
CGCTCAACCG AATGCCACAA CCAAGATAAA GTCAGCATGG GCACCATTGC CGCCCGCGAC
TGTATGCGTG TATTGCAACT AACGGAACAA GTCGCCGCCG CGGCGCTACT TGCTATGACT
CAAGGCATTG ATCTGCGTAT CACACAAAAC GAGTTAGACG AAGCCTCACT GACGCCATCA
CTGGCGACCA CGCTCGCCCA AGTGCGCGCT GACTTTGAGC CATTAGTCGA AGACAGACCG
CTCGAAGCCG TGCTACGCCA AACCGTCGCT AAAATCCAAG CCGGTGAATG GGAAGTGTGC
CGATGA
 
Protein sequence
MSHVVTQTTT VESPIEFGRQ LLTLEQVVAV AKGAKVKLCD DADYQEYIQK GARFIDSLLH 
EEGVVYGVTT GYGDSCTVNV SLDLVHELPL HLSRFHGCGL GEVLSVMQAR AVMACRLNSL
AIGKSGVTYE LLKRIQTLLN LNIVPVIPEE GSVGASGDLT PLSYLAAVLV GEREVIYQGE
RRATKEVYHE LNITPHVLRP KEGLALMNGT AVMTALACLA FDRAQYLARL ASRITAMASL
TLKGNSNHFD DILFAAKPHP GQNQIATWIR EDLNHHVHPR NSDRLQDRYS IRCAPHIIGV
LQDALPFMRQ FIETEVNSAN DNPIVDAEGE HILHGGHFYG GHIAFAMDSL KNIVANIADL
IDRQMALVMD PKFNNGLPAN LSGSTGPRRA INHGFKAVQI GVSAWTAEAL KHTMPASVFS
RSTECHNQDK VSMGTIAARD CMRVLQLTEQ VAAAALLAMT QGIDLRITQN ELDEASLTPS
LATTLAQVRA DFEPLVEDRP LEAVLRQTVA KIQAGEWEVC R