Gene SbBS512_E1211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1211 
SymbolhisC 
ID6269071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1118005 
End bp1119075 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content55% 
IMG OID641725342 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_001879856 
Protein GI187733922 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCG TGACTATTAC CGATTTAGCG CGTGAAAACG TCCGCAACCT GACGCCGTAT 
CAGTCGGCGC GTCGTCTGGG CGGTAATGGC GACGTCTGGC TGAACGCCAA CGAATATCCC
ACAGCCGTGG AGTTTCAGCT TACTCAGCAA ACGCTCAACC GCTACCCGGA ATGTCAGCCG
AAAGCGGTGA TTGAAAATTA CGCGCAATAT GCAGGCGTAA AACCGGAACA GGTGCTGGTC
AGCCGTGGCG CGGACGAAGG TATTGAACTG CTGATCCGCG CTTTTTGCGA ACCGGGTAAA
GACTCCATCC TCTACTGCCC ACCAACGTAC GGCATGTACA GCGTCAGCGC CGAAACGATT
GGCGTCGAGT GCCGCACAGT GCCGACGCTG GACAACTGGC AACTGGACTT ACAGGGCATT
TCCGACAAGC TGGACGGCGT AAAAGTGGTC TATGTTTGCA GCCCCAACAA CCCGACCGGG
CAACTGATCA ATCCGCAGGA TTTTCGCACC CTGCTGGAGT TAACGCGCGG TAAGGCGATT
GTGGTTGCCG ATGAAGCCTA TATAGAGTTT TGCCCGCAGG CATCGCTGGC TGGCTGGCTG
ACGGAATATC CGCACCTGGC TATTTTGCGC ACACTGTCGA AAGCTTTTGC TCTGGCGGGC
CTTCGTTGCG GATTTACGCT GGCAAACGAA GAAGTCATCA ACCTGCTGAT GAAAGTGATT
GCCCCCTACC CGCTCTCGAC GCCGGTTGCC GACATTGCGG CCCAGGCGTT AAGCCCGCAG
GGGATCGTCG CCATGCGTGA GCGGGTGGCG CAAATTATTG CAGAACGCGA ATACCTGATT
GCCGCACTGA AAGAGATCCC CTGCGTGGAG CAGGTTTTCG ACTCTGAAAC CAACTACATT
CTGGCGCGCT TTAAAGCCTC CAGTGCAGTG TTTAAATCTT TGTGGGATCA GGGCATTATC
TTACGTGATC AGAATAAACA ACCCTCTTTA AGCGGCTGCC TGCGAATTAC CGTCGGAACC
CGTGAAGAAA GCCAGCGCGT CATTGACGCC TTACGTGCGG AGCAAGTTTG A
 
Protein sequence
MSTVTITDLA RENVRNLTPY QSARRLGGNG DVWLNANEYP TAVEFQLTQQ TLNRYPECQP 
KAVIENYAQY AGVKPEQVLV SRGADEGIEL LIRAFCEPGK DSILYCPPTY GMYSVSAETI
GVECRTVPTL DNWQLDLQGI SDKLDGVKVV YVCSPNNPTG QLINPQDFRT LLELTRGKAI
VVADEAYIEF CPQASLAGWL TEYPHLAILR TLSKAFALAG LRCGFTLANE EVINLLMKVI
APYPLSTPVA DIAAQALSPQ GIVAMRERVA QIIAEREYLI AALKEIPCVE QVFDSETNYI
LARFKASSAV FKSLWDQGII LRDQNKQPSL SGCLRITVGT REESQRVIDA LRAEQV