Gene SeD_A0221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0221 
SymbolhemL 
ID6871923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp234864 
End bp236144 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content59% 
IMG OID642783468 
Productglutamate-1-semialdehyde aminotransferase 
Protein accessionYP_002214162 
Protein GI198244305 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0001] Glutamate-1-semialdehyde aminotransferase 
TIGRFAM ID[TIGR00713] glutamate-1-semialdehyde-2,1-aminomutase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00426685 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAGT CTGAAAATCT CTATAGCGCG GCCCGCGAGC TGATCCCCGG CGGCGTGAAC 
TCCCCTGTTC GCGCCTTCAC TGGCGTGGGC GGCACCCCGC TGTTTATCGA AAAAGCGGAC
GGCGCGTATC TGTACGATGT CGATGGCAAA GCGTATATCG ACTATGTCGG TTCCTGGGGA
CCAATGGTAC TGGGGCATAA CCATCCGGCT ATCCGCAATG CGGTGATCGA AGCTGCGGAG
CGCGGTTTAA GCTTCGGCGC GCCAACCGAA ATGGAAGTGA AAATGGCGGA ACTGGTCACC
AACCTGGTGC CGACCATGGA CATGGTGCGC ATGGTGAACT CCGGCACCGA AGCGACCATG
AGCGCTATTC GCCTGGCGCG TGGTTTTACT GGCCGCGATA AGATTATCAA ATTCGAAGGC
TGCTACCACG GCCACGCAGA CTGCCTGCTG GTCAAAGCCG GTTCTGGCGC GCTGACGCTC
GGTCAGCCGA ACTCGCCGGG CGTGCCGGCA GATTTCGCGA AACATACGCT GACCTGCACT
TATAACGATC TGGCGTCAGT ACGCGCGGCG TTTGAACAAT ATCCGCAGGA AATCGCCTGT
ATCATCGTCG AACCCGTAGC GGGCAATATG AACTGCGTCC CGCCGCTGCC GGAATTTCTG
CCAGGTCTGC GCGCCTTGTG CGATGAGTTC GGCGCGCTGC TGATTATCGA CGAAGTGATG
ACCGGTTTTC GCGTAGCGCT GGCCGGAGCC CAGGATTACT ACGGCGTCGT GCCGGACCTG
ACCTGTCTGG GTAAAATCAT CGGCGGCGGG ATGCCGGTAG GCGCGTTTGG CGGTCGTCGC
GATGTAATGG ATGCGCTGGC GCCGACGGGC CCGGTTTACC AGGCGGGCAC CCTTTCCGGC
AACCCGATTG CGATGGCGGC CGGTTTCGCC TGCCTGAATG AAGTCGCCCA GCCCGGCATT
CATGAAACGC TGGATGAGCT CACCACCCGT CTGGCGGAAG GTTTGCTGGA AGCTGCCGAA
GAAGCGAATA TTCCGCTGGT GGTTAACCAT GTCGGCGGCA TGTTCGGGAT TTTCTTCACC
GACGCTGAGA GCGTAACCTG CTATCAGGAC GTGATGGCGT GCGACGTGGA ACGCTTTAAG
CGTTTCTTCC ACCTGATGCT GGAGGAAGGC GTGTATCTGG CGCCATCGGC GTTTGAGGCG
GGCTTTATGT CGGTCGCACA CAGCATGGAC GACATTAATA ATACTATTGA CGCCGCGCGT
CGGGTGTTTG CGAAACTGTA A
 
Protein sequence
MSKSENLYSA ARELIPGGVN SPVRAFTGVG GTPLFIEKAD GAYLYDVDGK AYIDYVGSWG 
PMVLGHNHPA IRNAVIEAAE RGLSFGAPTE MEVKMAELVT NLVPTMDMVR MVNSGTEATM
SAIRLARGFT GRDKIIKFEG CYHGHADCLL VKAGSGALTL GQPNSPGVPA DFAKHTLTCT
YNDLASVRAA FEQYPQEIAC IIVEPVAGNM NCVPPLPEFL PGLRALCDEF GALLIIDEVM
TGFRVALAGA QDYYGVVPDL TCLGKIIGGG MPVGAFGGRR DVMDALAPTG PVYQAGTLSG
NPIAMAAGFA CLNEVAQPGI HETLDELTTR LAEGLLEAAE EANIPLVVNH VGGMFGIFFT
DAESVTCYQD VMACDVERFK RFFHLMLEEG VYLAPSAFEA GFMSVAHSMD DINNTIDAAR
RVFAKL