Gene SeHA_C4546 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4546 
Symbol 
ID6492385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4424289 
End bp4425332 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content55% 
IMG OID642744618 
Productgp47 
Protein accessionYP_002048195 
Protein GI194450026 
COG category[R] General function prediction only 
COG ID[COG3500] Phage protein D 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value0.494674 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAGA TAACGGTATC CGGCGGGGTG TTCGCCACCC TGACGCCCAT TTTTACCCTT 
TGGTACGGAC ATAAAGAGAT CACTTACGAC ATCGCGCCTT ATGTCACCAG CATCAGTTAC
AGCGACAGTA TTAAAAACGA GTCGGATGTC ATTGCCATTG CGCTGGAAGA TAGCACCGGG
CGCTGGGTAA ACGAATGGTA TCCGGGAAAA GGCGACACGC TGGCGCTGCG CCTGGGCTAC
CAGGGCGAAG ATCTGCTCGA TTGCGGAATC TATGTCATTG ATAAAATTGA TATCAGCGCG
CCGCCTTCGA CGGTCAATAT CGACGGTATC GCCACCTCGG TCAGCAAAGC GCTACGCACC
AAAAACAGCC AGGGCTTTGA GGAGACTACG CTTTCCGCCA TCGCCAGTCG CATCGCGCAA
AAACACGGTT TAACGCTGGC GGGCAAGATT GCGCCGCTGA CGATTGATCG GGTCACGCAA
TATGCCGAAA CCGATGTAGC GTTTCTCAAA CGGCTGGCGA GTGAATATGG CTATACCGTG
AAAGTGACGG CGACGGAGCT GATCTTTTCG CATCTGCCGA CGCTGCGCTG TCTGGCGCCG
GTGAAGACGC TCAGGCGGAC GGATGTTTCG CACTACACGT TCAAAGATAC CATCAACCGG
ATCTACAAAA ACGCCACCGT GCAGCATCAA AATAGCAAGC AAAAAGAACT GGTTATTTAT
ACCCATGATA GCCAGGAAAA GACCTCGGCG CGCGGTGCGG CGACCAGCGC CGATACCCTG
AAGATCAACA GTCGCGCTCC GGATACCGGC GCGGCGCAGG CTAAAGCCAA TGCCGCGCTG
GACAGCCACA ACGAATACCA GCAGACCGGC ACGCTCAGCT TGATGGGCTG CCCGCAGTTG
ACGGCGGGCA ACAAGATAGA ACTGAGCGAT TTTGGCGTAC TTTCCGGGCA GTGGCTGATT
GATAAATCCA TGCACAAACT CACGCGCAGC GGCGGCTACA CTACCGAAAT CGACATTTCA
CGCGGACCGG CAACCAGCCA GTAA
 
Protein sequence
MAEITVSGGV FATLTPIFTL WYGHKEITYD IAPYVTSISY SDSIKNESDV IAIALEDSTG 
RWVNEWYPGK GDTLALRLGY QGEDLLDCGI YVIDKIDISA PPSTVNIDGI ATSVSKALRT
KNSQGFEETT LSAIASRIAQ KHGLTLAGKI APLTIDRVTQ YAETDVAFLK RLASEYGYTV
KVTATELIFS HLPTLRCLAP VKTLRRTDVS HYTFKDTINR IYKNATVQHQ NSKQKELVIY
THDSQEKTSA RGAATSADTL KINSRAPDTG AAQAKANAAL DSHNEYQQTG TLSLMGCPQL
TAGNKIELSD FGVLSGQWLI DKSMHKLTRS GGYTTEIDIS RGPATSQ