Gene SbBS512_E4214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4214 
SymboltnaA 
ID6268783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3939016 
End bp3940431 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content50% 
IMG OID641728034 
Producttryptophanase 
Protein accessionYP_001882455 
Protein GI187731211 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3033] Tryptophanase 
TIGRFAM ID[TIGR02617] tryptophanase, leader peptide-associated 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAACT TTAAACATCT CCCTGAACCG TTCCGCATTC GTGTTATTGA GCCAGTAAAA 
CGTACTACTC GCGCTTATCG TGAAGAAGCA ATTATTAAAT CCGGTATGAA CCCGTTCCTG
CTGGATCGCG AAGATGTGTT TATCGATTTA CTGACCGACA GCGGCACCGG GGCGGTAACC
CAAAGTATGC AGGCAGCGAT GATGCGCGGC GACGAAGCCT ACAGCGGCAG CCGCAGCTAC
TATGCGTTAG CCGAGTCAGT GAAAAATATC TTTGGTTATC AATATACTAT TCCGACTCAC
CAGGGCCGTG GCGCAGAGCA AATCTATATT CCGGTACTGA TTAAAAAACG CGAGCAGGAA
AAAGGCCTGG ATCGCAGCAA AATGGTGGCA TTCTCTAACT ATTTCTTTGA TACCACGCAG
GGCCATAGCC AGATTAACGG CTGTACCGTG CGTAACGTCT ATATCAAAGA AGCCTTCGAT
ACGGGCGTGC GTTACGACTT TAAAGGCAAC TTTGACCTCG AAGGATTAGA ACGCGGTATT
GAAGAAGTTG GCCCGAATAA CGTGCCGTAT ATCGTTGCAA CCATCACCAG TAACTCTGCA
GGTGGTCAGC CGGTTTCACT GGCAAACTTA AAAGCGATGT ACAGCATCGC GAAGAAATAC
GATATTCCGG TGGTAATGGA CTCCGCACGC TTTGCTGAAA ACGCCTATTT CATCAAGCAG
CGTGAAGCAG AATACAAAGA CTGGACCATC GAGCAGATCA CCCGCGAAAC CTACAAATAT
GCCGATATGC TGGCGATGTC CGCCAAGAAA GATGCGATGG TGCCGATGGG CGGCTTGCTG
TGCATGAAAG ACGACAGCTT CTTTGATGTG TACACCGAGT GCAGAACCCT TTGCGTGGTG
CAGGAAGGCT TCCCGACATA TGGCGGCCTG GAAGGCGGCG CGATGGAGCG TCTGGCGGTA
GGTCTGTATG ACGGCATGAA TCTCGACTGG CTGGCTTATC GTATCGCGCA GGTACAGTAT
CTGGTCGATG GTCTGGAAGA GATTGGCGTT GTCTGCCAGC AGGCGGGCGG TCACGCGGCA
TTCGTTGATG CCGGTAAACT GCTGCCGCAT ATCCCGGCAG ACCAGTTCCC GGCACAGGCG
CTGGCGTGCG AGCTGTATAA AGTCGCCGGT ATCCGTGCGG TAGAAATTGG CTCTTTCCTG
TTAGGCCGCG ATCCGAAAAC CGGTAAACAA CTGCCATGCC CGGCTGAACT GCTGCGTTTA
ACCATTCCGC GCGCAACATA TACTCAAACA CATATGGACT TCATTATTGA AGCCTTTAAA
CATGTGAAAG AGAACGCGGC GAATATTAAA GGATTAACCT TTACCTACGA ACCAAAAGTA
TTGCGTCACT TCACCGCAAA ACTGAAAGAA GTTTAA
 
Protein sequence
MENFKHLPEP FRIRVIEPVK RTTRAYREEA IIKSGMNPFL LDREDVFIDL LTDSGTGAVT 
QSMQAAMMRG DEAYSGSRSY YALAESVKNI FGYQYTIPTH QGRGAEQIYI PVLIKKREQE
KGLDRSKMVA FSNYFFDTTQ GHSQINGCTV RNVYIKEAFD TGVRYDFKGN FDLEGLERGI
EEVGPNNVPY IVATITSNSA GGQPVSLANL KAMYSIAKKY DIPVVMDSAR FAENAYFIKQ
REAEYKDWTI EQITRETYKY ADMLAMSAKK DAMVPMGGLL CMKDDSFFDV YTECRTLCVV
QEGFPTYGGL EGGAMERLAV GLYDGMNLDW LAYRIAQVQY LVDGLEEIGV VCQQAGGHAA
FVDAGKLLPH IPADQFPAQA LACELYKVAG IRAVEIGSFL LGRDPKTGKQ LPCPAELLRL
TIPRATYTQT HMDFIIEAFK HVKENAANIK GLTFTYEPKV LRHFTAKLKE V