Gene SeHA_C3075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3075 
Symbol 
ID6492029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3003628 
End bp3005409 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content52% 
IMG OID642743226 
Productcell invasion protein SipB 
Protein accessionYP_002046845 
Protein GI194449012 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAAATG ACGCAAGTAG CATTAGCCGT AGCGGATATA CCCAAAATCC GCGCCTCGCT 
GAGGCGGCTT TTGAAGGCGT TCGTAAGAAC ACGGACTTTT TAAAAGCGGC GGATAAAGCT
TTTAAAGATG TGGTGGCAAC GAAAGCGGGC GACCTTAAAG CCGGAACAAA GTCCGGCGAG
AGCGCTATTA ATACGGTGGG TCTAAAGCCG CCTACGGACG CCGCCCGGGA AAAACTCTCC
AGCGAAGGGC AATTGACATT ACTGCTTGGC AAGTTAATGA CCCTACTGGG CGATGTTTCG
CTGTCTCAAC TGGAGTCTCG TCTGGCGGTA TGGCAGGCGA TGATTGAGTC ACAAAAAGAG
ATGGGGATTC AGGTATCGAA AGAATTCCAG ACGGCTCTGG GAGAGGCTCA GGAGGCGACG
GATCTCTATG AAGCCAGTAT CAAAAAGACG GATACCGCCA AGAGTGTTTA TGACGCTGCG
ACCAAAAAAC TGACGCAGGC GCAAAATAAA TTGCAATCGC TGGACCCGGC TGACCCCGGC
TATGCACAAG CTGAAGCCGC GGTAGAACAG GCCGGAAAAG AAGCGACAGA GGCGAAAGAG
GCCTTAGATA AGGCCACGGA TGCGACGGTT AAAGCAGGCA CAGACGCCAA AGCGAAAGCC
GAGAAAGCGG ATAACATTCT GACCAAATTC CAGGGAACGG CTAATGCCGC CTCTCAGAAT
CAGGTTTCCC AGGGTGAGCA GGATAATCTG TCTAATGTCG CCCGCCTCAC TATGCTCATG
GCCATGTTTA TTGAGATTGT GGGCAAAAAT ACGGAAGAAA GCCTGCAAAA CGATCTTGCG
CTTTTCAACG CCTTGCAGGA AGGGCGTCAG GCGGAGATGG AAAAGAAATC GGCTGAATTC
CAGGAAGAGA CGCGCAAAGC CGAGGAAACG AACCGCATTA TGGGATGTAT CGGGAAAGTC
CTCGGCGCGC TGCTAACCAT TGTCAGCGTT GTGGCAGCTG TTTTTACCGG TGGGGCGAGT
CTGGCGCTGG CTGCGGTGGG ACTTGCGGTA ATGGTGGCCG ATGAAATTGT GAAGGCGGCG
ACGGGGGTGT CGTTTATTCA GCAGGCGCTA AACCCGATTA TGGAGCATGT GCTGAAGCCA
TTAATGGAGC TGATTGGCAA GGCGATTACC AAAGCGCTGG AAGGATTAGG CGTCGATAAG
AAAACGGCAG AGATGGCAGG CAGCATTGTT GGTGCGATTG TTGCCGCTAT TGCGATGGTA
GCGGTCATTG TGGTGGTCGC AGTTGTCGGG AAAGGCGCGG CGGCGAAACT GGGTAACGCG
CTGAGCAAAA TGATGGGCGA AACGATTAAG AAGTTGGTGC CTAACGTGCT GAAACAGTTG
GCGCAAAACG GCAGCAAACT CTTTACCCAG GGGATGCAAC GTATTACTAG CGGCCTGGGT
AATGTGGGTA GCAAGATGGG CCTGCAAACG AATGCCTTAA GTAAAGAGCT GGTAGGTAAT
ACCCTAAATA AAGTGGCGTT GGGCATGGAA GTCACGAATA CCGCAGCCCA GTCAGCCGGT
GGTGTTGCCG AGGGCGTATT TATTAAAAAT GCCAGCGAGG CGCTTGCTGA TTTTATGCTC
GCCCGTTTTG CCATGGATCA GATTCAGCAG TGGCTTAAAC AATCCGTAGA AATATTTGGT
GAAAACCAGA AGGTAACGGC GGAACTGCAA AAAGCCATGT CTTCTGCGGT ACAGCAAAAT
GCGGATGCTT CGCGTTTTAT TCTGCGCCAG AGTCGCGCAT AA
 
Protein sequence
MVNDASSISR SGYTQNPRLA EAAFEGVRKN TDFLKAADKA FKDVVATKAG DLKAGTKSGE 
SAINTVGLKP PTDAAREKLS SEGQLTLLLG KLMTLLGDVS LSQLESRLAV WQAMIESQKE
MGIQVSKEFQ TALGEAQEAT DLYEASIKKT DTAKSVYDAA TKKLTQAQNK LQSLDPADPG
YAQAEAAVEQ AGKEATEAKE ALDKATDATV KAGTDAKAKA EKADNILTKF QGTANAASQN
QVSQGEQDNL SNVARLTMLM AMFIEIVGKN TEESLQNDLA LFNALQEGRQ AEMEKKSAEF
QEETRKAEET NRIMGCIGKV LGALLTIVSV VAAVFTGGAS LALAAVGLAV MVADEIVKAA
TGVSFIQQAL NPIMEHVLKP LMELIGKAIT KALEGLGVDK KTAEMAGSIV GAIVAAIAMV
AVIVVVAVVG KGAAAKLGNA LSKMMGETIK KLVPNVLKQL AQNGSKLFTQ GMQRITSGLG
NVGSKMGLQT NALSKELVGN TLNKVALGME VTNTAAQSAG GVAEGVFIKN ASEALADFML
ARFAMDQIQQ WLKQSVEIFG ENQKVTAELQ KAMSSAVQQN ADASRFILRQ SRA