Gene SeHA_C1410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C1410 
Symbol 
ID6491706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1365465 
End bp1366751 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content52% 
IMG OID642741642 
Producthypothetical protein 
Protein accessionYP_002045289 
Protein GI194450478 
COG category[S] Function unknown 
COG ID[COG2718] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.123931 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value0.301036 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTGGT TCATAGACCG ACGTCTTAAC GGCAAAAATA AAAGCACGGT GAATCGCCAG 
CGCTTTTTGC GCCGTTATAA AGCACAAATT AAGCAGTCGA TTTCCGAAGC GATTAATAAA
CGCTCTGTGA CCGATGTCGA CAGCGGAGAG TCCGTCTCTA TTCCAACCGA TGATATTAGC
GAACCGATGT TTCATCAGGG GCGCGGCGGT CTGCGCCATC GCGTCCATCC GGGTAACGAT
CACTTTATCC AGAATGATCG CATTGAGCGT CCGCAAGGCG GTGGCGGCGG CGGTTCCGGC
AGCGGTCAAG GTCAGGCCAG CCAGGACGGC GAAGGCCAGG ATGAGTTTGT TTTTCAGATT
TCAAAAGATG AGTATCTGGA TCTGCTCTTT GAAGATTTAG CGCTGCCTAA TCTGAAGAAA
AACCAGCATC GCCAGCTTAA CGAGTATAAA ACTCACCGCG CCGGTTTCAC CTCAAACGGC
GTACCGGCCA ATATCAGCGT GGTACGTTCG CTACAAAACT CTCTGGCGCG CCGTACAGCA
ATGACGGCAG GAAAACGCCG CGAACTGCAC GCGCTGGAAA CGGAACTGGA GACCATCAGC
CACAGCGAAC CAGCGCAACT GCTTGAAGAG GAGCGGTTAC GTCGGGAGAT TGCCGAACTA
CGGGCTAAAA TCGAGCGAGT GCCGTTTATC GACACCTTTG ATTTACGCTA TAAAAATTAT
GAAAAACGGC CTGAGCCCTC CAGCCAGGCG GTGATGTTCT GTCTGATGGA CGTCTCGGGG
TCGATGGACC AGGCAACCAA AGATATGGCC AAGCGTTTTT ACATTCTGCT CTATCTGTTT
TTGAGCCGAA CGTATAAGAA CGTAGAGGTG GTTTATATCC GCCACCATAC CCAGGCGAAG
GAAGTGGACG AACATGAGTT CTTTTATTCG CAAGAGACCG GCGGGACGAT TGTCTCCAGC
GCGCTTAAAC TCATGGACGA AGTGGTTAAA GAGCGCTACG ACCCCGGTCA GTGGAACATC
TATGCGGCGC AAGCGTCAGA CGGTGATAAC TGGGCCGACG ATTCACCGCT GTGTCATGAG
ATTCTGGCGA AAAAGCTGCT GCCGGTAGTG CGCTATTACA GCTATATCGA GATTACCCGC
CGCGCCCACC AGACCTTATG GCGCGAGTAT GAACATCTGC AGGCGACGTT CGATAACTTC
GCCATGCAGC ATATTCGCGA TCAGGAGGAT ATTTATCCGG TATTCCGCGA ATTGTTTCAG
AAACAGAGCG CCAATCAAAG TGCATAA
 
Protein sequence
MTWFIDRRLN GKNKSTVNRQ RFLRRYKAQI KQSISEAINK RSVTDVDSGE SVSIPTDDIS 
EPMFHQGRGG LRHRVHPGND HFIQNDRIER PQGGGGGGSG SGQGQASQDG EGQDEFVFQI
SKDEYLDLLF EDLALPNLKK NQHRQLNEYK THRAGFTSNG VPANISVVRS LQNSLARRTA
MTAGKRRELH ALETELETIS HSEPAQLLEE ERLRREIAEL RAKIERVPFI DTFDLRYKNY
EKRPEPSSQA VMFCLMDVSG SMDQATKDMA KRFYILLYLF LSRTYKNVEV VYIRHHTQAK
EVDEHEFFYS QETGGTIVSS ALKLMDEVVK ERYDPGQWNI YAAQASDGDN WADDSPLCHE
ILAKKLLPVV RYYSYIEITR RAHQTLWREY EHLQATFDNF AMQHIRDQED IYPVFRELFQ
KQSANQSA