Gene Sbal223_1931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_1931 
Symbol 
ID7090098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp2281582 
End bp2282781 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content52% 
IMG OID643460835 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_002357859 
Protein GI217973108 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.5021 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.393324 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAAG TGAGCAGCAT GGAAAAGACA GCAGAACCCG CAGCATTAGC ATCGATGTTA 
GCACCATCAA CGTTAGCGCA ATCGGCGTCG CTAAATTTAG CCGAGCGCCT AGCGCGCCCT
GAATTACTGG ATTTAACGCC CTACCAAAGT GCCCGCAGAC TGGGCGGTCG TGGTGATATT
TGGATCAATG CCAACGAATC GCCGTTCAAT AATGTCGATA CGGCAGCGCT CGATTTATCG
AAATTGAATC GTTACCCAGA ATGCCAACCG CCGCAGCTTA TCAATGCCTA CAGCGAATAC
AGCGGCGTGA GCGCCAGCAA GATAGTCGCC AGCCGCGGCG CCGATGAAGC GATTGAACTG
TTAATTCGCG CCTTTTGTAT TCCAGGTGTC GACAGCATCG CCTGCTTTGG CCCGACGTAC
GGCATGTATG CCATCAGCGC GAACACCTTT AATGTGGGCG TTAAAGCGTT AAATCTCAGC
GCGGAATATG GCTTGCCGAC AAGCTATGCC GAGGACGTTC GCGGCGCAAA ACTGGTGTTT
ATCTGCAATC CCAATAATCC GACCGGGACT GTGATCGACA AAGCCATTAT TGAGCAAGCG
ATCAAAGCCC TGCCCGATTC GCTTGTGGTG ATCGATGAGG CCTATATTGA GTTTTGTCCT
GAATACAGTG TGGCGGATTT ACTGGAAAGC TATCCGAATC TAGTGGTGCT ACGAACCCTG
TCAAAAGCCT TTGCACTCGC CGGCGCACGC TGTGGTTTTA TGCTGGCAAA CGAGGCAGTG
GTCGAAATCA TTATGCGCGT AATTGCACCC TATCCTGTGC CACTGCCCGT GAGTGGGGTC
GCCACACAGG CGCTATCAAG CGCTGGCGTT GCACGGATGA AAGTGCAAGT TGCACAATTA
AATGAGCAAG GCGCCAGACT CACAGCGGCG ATCAGCGCTT ATTGTTCAAA ATCGAATAGT
TCAGATTCGC GCGCCCGCGT GCTTAAGCCT AACGGCAATT ATGTGCTGGC TGAATTTGAT
GATGTCGCCA AGGTCGCAGC GTTACTGCAA GGCAGCGGCG TTGTCGCCCG CGCCTACAAA
GACCCAAGGC TTGCCAAGGC TATCCGCTTT AGCTTTAGTT CAAAGGCGGA TACCGATGTG
TTAGTGAATT TATTTGAATC GCAACACACT GAGCAAGCAC CAGAAACGAA TAATAAGTAA
 
Protein sequence
MSQVSSMEKT AEPAALASML APSTLAQSAS LNLAERLARP ELLDLTPYQS ARRLGGRGDI 
WINANESPFN NVDTAALDLS KLNRYPECQP PQLINAYSEY SGVSASKIVA SRGADEAIEL
LIRAFCIPGV DSIACFGPTY GMYAISANTF NVGVKALNLS AEYGLPTSYA EDVRGAKLVF
ICNPNNPTGT VIDKAIIEQA IKALPDSLVV IDEAYIEFCP EYSVADLLES YPNLVVLRTL
SKAFALAGAR CGFMLANEAV VEIIMRVIAP YPVPLPVSGV ATQALSSAGV ARMKVQVAQL
NEQGARLTAA ISAYCSKSNS SDSRARVLKP NGNYVLAEFD DVAKVAALLQ GSGVVARAYK
DPRLAKAIRF SFSSKADTDV LVNLFESQHT EQAPETNNK