Gene Sbal223_0344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_0344 
Symbol 
ID7086631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp385285 
End bp386859 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content52% 
IMG OID643459265 
ProductHistidine ammonia-lyase 
Protein accessionYP_002356302 
Protein GI217971551 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase
[TIGR01226] phenylalanine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0617249 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCACA CCAAAGCAGC CGACTCCGAG AGTCAATCAG ACCTACAGAC TGTTGAGTTC 
GGCCGTCAAT ATCTCACCTT AGAACAAGTG GTTGCCGTCG CCAAGGGCGC GCCAGTCAAG
CTCTGTGACG ATGCGGATTA TCAAGAATAT ATCCAAAAAG GTGCGCGCTT TATCGATAGC
CTGCTGCACG AAGAAGGCGT GGTCTATGGC GTAACCACAG GTTATGGCGA CTCTTGCACT
GTCAATGTGA GCTTAGATCT GGTCCACGAA TTGCCACTGC ATTTAACCCG CTTCCACGGC
TGCGGTTTAG GCGAGACCTT AAGCATAATG CAGGCCCGCG CTGTGATGGC TTGCCGTTTA
AACTCCTTAG CCATTGGCAA ATCTGGCGTG ACCTATGAGC TATTAAAGCG TATCGAAACC
TTGCTCAATC TCAACATAGT GCCAGTGATC CCAGAGGAAG GTTCGGTCGG CGCCAGCGGG
GATTTAACGC CGTTGTCGTA TTTAGCCGCC GTGTTAGTCG GTGAGCGCGA AGTGATTTAC
CAAGGTGAGC GCCAAGCGAC TCAAGATGTG TACGCTAAGC TCAACATCAT CCCGCATGTC
CTGCGTCCAA AAGAAGGCTT AGCCCTGATG AACGGCACCG CCGTGATGAC AGCTTTAGCC
TGCTTAGCCT TTGATCGCGC GCAATATCTT GCTCGCTTGA GTAGTCGCAT TACCGCCATG
GCTTCGTTAA CGCTAAAAGG CAACTCGAAC CATTTTGATG AAATCTTATT TGCCGCCAAA
CCGCATCCAG GGCAAAACCA AATCGCCACT TGGATACGGG AAGATTTGAA TCACCACGAA
CATCCACGTA ATTCTGATCG TCTGCAGGAC AGATATTCCA TCCGCTGCGC ACCACACATT
ATCGGCGTGT TGCAGGATGC CCTGCCGTTT ATGCGTCAGT TTATCGAAAC CGAAATCAAC
AGCGCCAACG ACAACCCGAT TGTCGACGGC GAAGGCGAGC ATATTCTCCA CGGTGGCCAT
TTCTACGGCG GCCATATCGC CTTTGCGATG GATTCACTGA AAAATACTGT GGCCAACTTA
GCCGATCTTA TCGACCGCCA AATGGCGCTA GTAATGGACC CTAAGTTTAA CAATGGCTTA
CCTGCGAACC TATCCGGCTC AACCGGTCCA CGCCGCGCGA TTAATCATGG CTTTAAAGCG
GTACAAATTG GCGTGTCGGC TTGGACGGCA GAAGCGCTCA AGCACACTAT GCCGGCCAGC
GTGTTCTCTC GCTCGACCGA ATGCCACAAC CAAGACAAGG TGAGCATGGG TACTATCGCC
GCCCGCGATT GTATGCGCGT GCTGCAATTG ACCGAACAAG TCGCAGCGGC TGCATTACTT
GCCATGACCC AAGGTATTGG TCTGCGTATC GCTCAAAATG AACTGAGCGA AGCCTCGCTC
ACACCCTCAT TGGCGAAAAC CCTCGCTCAA GTGCGCGCCG ATTTTGAAAC CTTAATTGAA
GATAGGCCGC TCGAAACTGT GCTGCGCCAA ACCATAGCTA AAATCCAAGC GGGCGAATGG
GAAGTGTGCC GATGA
 
Protein sequence
MSHTKAADSE SQSDLQTVEF GRQYLTLEQV VAVAKGAPVK LCDDADYQEY IQKGARFIDS 
LLHEEGVVYG VTTGYGDSCT VNVSLDLVHE LPLHLTRFHG CGLGETLSIM QARAVMACRL
NSLAIGKSGV TYELLKRIET LLNLNIVPVI PEEGSVGASG DLTPLSYLAA VLVGEREVIY
QGERQATQDV YAKLNIIPHV LRPKEGLALM NGTAVMTALA CLAFDRAQYL ARLSSRITAM
ASLTLKGNSN HFDEILFAAK PHPGQNQIAT WIREDLNHHE HPRNSDRLQD RYSIRCAPHI
IGVLQDALPF MRQFIETEIN SANDNPIVDG EGEHILHGGH FYGGHIAFAM DSLKNTVANL
ADLIDRQMAL VMDPKFNNGL PANLSGSTGP RRAINHGFKA VQIGVSAWTA EALKHTMPAS
VFSRSTECHN QDKVSMGTIA ARDCMRVLQL TEQVAAAALL AMTQGIGLRI AQNELSEASL
TPSLAKTLAQ VRADFETLIE DRPLETVLRQ TIAKIQAGEW EVCR