Gene SeHA_C3476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3476 
Symbol 
ID6487706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3378277 
End bp3379461 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content55% 
IMG OID642743605 
Productphage protein 
Protein accessionYP_002047219 
Protein GI194449502 
COG category[S] Function unknown 
COG ID[COG3299] Uncharacterized homolog of phage Mu protein gp47 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.00000000149492 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGGAAA AGCCACAGGT TGACTTTGAA GAGGTGGTGA AAGCCAGCGG TATGCCGGTG 
ACGGAAGAAG AGATTCGCGA TCGCTTTAAT GCCATTGCGA CGGAGGAGGG AATTATCACG
AATACCTCCC GTATGTCTCC GTTCTGGCGA CTGGTCACGG CCATTGTAAC CGCGCCGGTG
ATGTGGCTGA AGGAGGTTCT GATCTCCACC GTACTGGCAA ATATGTTTGT GGCCACGGCC
AGTGGAAGCA TGTTACGGCT GCTGGCATGG GCGGTGAATA TCACGCCGAA GCCAGCCAGC
GCTGCACAGG GCGTTATCCG TTTTTACAAG GAAGACGCCA GCGCCGTGGT GACGGTGAAG
GCCGGAACGG TGATACAGAC AGAACGTATT AACGGCAGGG TGTATGAACT GGCCATCACG
GAAGATGTGG TGATTGCCTC CGGTACCGCC AGCGCACTGC TGCCGGTAAA GGCAACGGGA
ACGGGCGGCG CATATAACCT TGCGCCGGGA TATTACCGCA TTCTGCCGGT GGCCGTGGAC
GGCATCAGCC ATGTGGCCAG TGAAGAAAAC TGGCTGACCG TACCGGGCGC GGATGAGGAA
AGCGATGATG AACTGCGTGA GCGTTGCCGT AACCAGTTTA ACCTGGTGGG CAACTACCAT
ACGGACGCGG TGTACCGGTC GATGATAGCC GGTGTTGCCG GACTGAGCAT TGACCGGATT
TTCTTTGAGC ACGAAGCACC GAGGGGGCCG GGGACAGCCA ACGCCTATTT ATTGCTGGAC
AGCGGCGTGG CTTCTGCGCC GTTTGTGGAT GCCGTGAATG ACTATATCAA CACTCAGGGA
CATCACGGCC ACGGGGACGA TATGCAGTGT TATGCCATGC CGGAAACCCT GCACGATCTG
GTGGTCACTG TCTGGGTCAG GAACCTGAAC AACATCAGTG ATGATGAACA GAAGCGCCTG
AAGGACGGTA TTGAAAACCT GATCCGGTGC GCCTTCCGGG AAAATACGGA CTATGACGTC
AGAAGGACGT GGCCGTATTC ACGGTTCTCC TTCTCGCAGC TGGGGCGCGA AATCCATAAA
AATTTTCCGG TAACAGAATC GCTGAATTTT TCGCTGGATG ACATTGCCAG TGAGCTGAAT
GTGCCGCGCC TGAAATCGCT TGTGGTGAGT ATTGAGAATG AATGA
 
Protein sequence
MTEKPQVDFE EVVKASGMPV TEEEIRDRFN AIATEEGIIT NTSRMSPFWR LVTAIVTAPV 
MWLKEVLIST VLANMFVATA SGSMLRLLAW AVNITPKPAS AAQGVIRFYK EDASAVVTVK
AGTVIQTERI NGRVYELAIT EDVVIASGTA SALLPVKATG TGGAYNLAPG YYRILPVAVD
GISHVASEEN WLTVPGADEE SDDELRERCR NQFNLVGNYH TDAVYRSMIA GVAGLSIDRI
FFEHEAPRGP GTANAYLLLD SGVASAPFVD AVNDYINTQG HHGHGDDMQC YAMPETLHDL
VVTVWVRNLN NISDDEQKRL KDGIENLIRC AFRENTDYDV RRTWPYSRFS FSQLGREIHK
NFPVTESLNF SLDDIASELN VPRLKSLVVS IENE