Gene SNSL254_A0223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0223 
SymbolhemL 
ID6484245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp239370 
End bp240650 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content58% 
IMG OID642735660 
Productglutamate-1-semialdehyde aminotransferase 
Protein accessionYP_002039442 
Protein GI194444494 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0001] Glutamate-1-semialdehyde aminotransferase 
TIGRFAM ID[TIGR00713] glutamate-1-semialdehyde-2,1-aminomutase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.01485 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value0.360359 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAGT CTGAAAATCT CTATAGCGCG GCCCGCGAGC TGATCCCCGG CGGCGTGAAC 
TCCCCTGTTC GCGCCTTCAC TGGCGTAGGC GGTACTCCGC TGTTTATCGA AAAAGCGGAC
GGCGCTTATC TTTATGATGT CGATGGCAAA GCGTATATCG ACTATGTCGG TTCCTGGGGG
CCAATGGTAC TGGGGCATAA CCATCCGGCT ATCCGCAATG CGGTGATCGA AGCTGCGGAG
CGCGGTTTAA GCTTCGGCGC GCCAACCGAA ATGGAAGTAA AAATGGCGGA ACTGGTGACC
AACCTGGTGC CGACCATGGA CATGGTGCGC ATGGTGAACT CCGGCACCGA AGCGACGATG
AGCGCTATTC GCCTGGCGCG TGGTTTTACT GGCCGCGATA AGATTATCAA ATTCGAAGGC
TGCTACCACG GCCACGCAGA CTGTCTGCTG GTCAAAGCCG GTTCTGGCGC GCTGACGCTC
GGTCAGCCGA ACTCGCCGGG CGTGCCGGCA GATTTCGCGA AACATACGCT GACCTGCACT
TATAACGATC TGACGTCAGT GCGCGCGGCG TTTGAACAAT ATCCGCAGGA AATCGCCTGT
ATCATCGTCG AACCCGTAGC GGGCAATATG AACTGCGTCC CGCCGCTGCC GGAATTTCTG
CCAGGTCTGC GCGCCTTGTG CGATGAGTTC GGCGCGCTGC TGATTATCGA CGAAGTGATG
ACCGGTTTTC GCGTAGCGCT GGCCGGAGCC CAGGATTACT ACGGCGTCGT GCCTGACCTG
ACCTGTCTGG GTAAAATCAT CGGCGGCGGG ATGCCGGTAG GCGCGTTTGG CGGTCGTCGC
GATGTAATGG ATGCGCTGGC GCCGACGGGC CCGGTTTACC AGGCGGGCAC CCTTTCCGGC
AACCCGATTG CGATGGCGGC CGGTTTCGCC TGCCTGAATG AAGTCGCCCA GCCCGGCATT
CATGAAACGC TGGATGAGCT CACCACCCGT CTGGCGGAAG GTTTGCTGGA AGCTGCCGAA
GAAGCGAATA TTCCGCTGGT GGTTAACCAT GTCGGCGGCA TGTTCGGGAT TTTCTTCACC
GACGCTGAGA GCGTAACCTG CTATCAGGAC GTGATGGCGT GCGACGTGGA ACGCTTTAAG
CGTTTCTTCC ACCTGATGCT GGAGGAAGGC GTATATCTGG CGCCGTCCGC GTTCGAAGCA
GGCTTTATGT CCGTGGCGCA CAGTGAAGAA GATATCAATA ACACCATCGA CGCCGCGCGT
CGGGTGTTTG CGAAACTGTA A
 
Protein sequence
MSKSENLYSA ARELIPGGVN SPVRAFTGVG GTPLFIEKAD GAYLYDVDGK AYIDYVGSWG 
PMVLGHNHPA IRNAVIEAAE RGLSFGAPTE MEVKMAELVT NLVPTMDMVR MVNSGTEATM
SAIRLARGFT GRDKIIKFEG CYHGHADCLL VKAGSGALTL GQPNSPGVPA DFAKHTLTCT
YNDLTSVRAA FEQYPQEIAC IIVEPVAGNM NCVPPLPEFL PGLRALCDEF GALLIIDEVM
TGFRVALAGA QDYYGVVPDL TCLGKIIGGG MPVGAFGGRR DVMDALAPTG PVYQAGTLSG
NPIAMAAGFA CLNEVAQPGI HETLDELTTR LAEGLLEAAE EANIPLVVNH VGGMFGIFFT
DAESVTCYQD VMACDVERFK RFFHLMLEEG VYLAPSAFEA GFMSVAHSEE DINNTIDAAR
RVFAKL