Gene SeHA_C3470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3470 
Symbol 
ID6487822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3372207 
End bp3373910 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content58% 
IMG OID642743599 
Productphage protein 
Protein accessionYP_002047213 
Protein GI194447335 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.0000394685 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGAAAC CAGTAAAACG CCTTTACCTT TCAACGGATG AAATACACCT GGCTGACGCC 
AGTCTGGTGC TGGAGCTGAA CAGCTGTGGA CGTGGCTTTA TTACGGCACA GACAACCACA
GACTACACCG GCAAACTGGT ACGGCTGGAT GTGGGGTATT CCGGTTTACT TCTGCGCTGG
TTTACCGGCT ATGTGGAGCG CTCACAGCCT GCCGAAAACG GTTATCAGCG TCTGTTTGTC
CGCGAGCTGG CTGGCGTGTT TGAGCGGATG TGGCCATGCT CATTTCAGCA TCCCACACTG
CGCGATGTGG CCGGATGGCT GGAGGAAAAC AGCGGGATCA GCATTGCGGT ACCGGATGTG
CCGTACAGTG ATAAACCGAT CCCCCATTTC ACCCATAACG GGACGGGATA CCAGCTGCTG
AATAACCTGG GCAGGGCATT CAGTATCACG GATTACATCT GGTATCCATT GCCGGATGGT
TCGCTGTATG TCGGCGGCGC AGAAAAGGCG CTGTTTGCCG GACGCCCGGT AGAAATCCCG
GCAGAGTTCA GCCAGGGAAC GGCGGGCGGT AATTCCATGA CATTGCCGGT GATCCAGAGT
CTTCGTCCGG GCGTGGACGT GAACGGGGAA CGCGTGACCA AAGTTCATCT GACGAATGAC
ACAATGACCA TCACCTGGAC ACCACGGAAC CGCGCCACAG GTCAGCCATT GCAGAAAACA
CCGGCGCAGC GTCAGATAGA AAACCATTAC CCGGAACTGG CTTCAGGTCT TCACCTGCCC
AAACTGGCCA GGGTGGTGGC ACCCAGCGAG GCCGTAAAAA GCGGTAATTT TGCCGACCCG
TTCCGGCCAC GGTACGCTGT TGACGTGCAG CTGCTTGACG CGGACGGCAA CCCGGACAAC
CAGACGCCGG TATATTCCGC CGTACCGCTG CCAGTGCCAA TGGCCGGTAA CGATTCGGGA
ATGTTCCAGT TTCCACCGGA AGGAACGCTG GTAGAGGTGG CGTTTACGGG CGGCAGGCCG
GATAAGCCCT TTATCAGGCA GACGCTGCCG GATGGCACCA GTCTGCCGGA CATTAAGCCC
GGCGAACAGC TGCAACAGCA GCGGGCGGAA GTCTCGCAAC GCGTGACACA GGCAGGAGAC
TGGGTACGCC AGACGGATCA GACCATCAGT GAAACATCGA TGGCGCGGAC GGTGAAAGCC
GATACGGAAC GGCGCGAACT GGTCAGCCGT GAAACCACGG TGAAAGCCAC GGATAAAATC
ACAGTACTGG GTACCGCCAC GCTGATGGCT GGAGCCATAC AGCAGGTCAG CGCTGGCGAC
TTCAGCCAGG CGGTAAAAGG AAACCGGCTG GCCAGTATTA CAGGAAATGA AGAAACCGAA
ATCGCCGGGC AGCAGTCCAC GAAAGTGGCC GGTGCCATGA ATGTTGATGT GGGGGGAACC
CTGACAGAAA AGATTGCCGC ATTGCGTAAG TCGGTGGCAT CGGGCGGTCA GCAAATTATG
GGGCCAACCG TCCATATTGG CAGTGAGAGC GTCAACACAC TGACCATGAT GCTGGACACC
ATTGATTTAC TGGCAGAGCT GGCGCAGCAG TGCGCGAGCC ATTCACACCC CAGTGTTGGC
ACGCCGACCA ATGCCGGAGC ATTCAACCAG ACGGCAGTAA AGGCCGGGCA GACCCGGAGC
AAGTACCAGA ACATCATCGC CTGA
 
Protein sequence
MMKPVKRLYL STDEIHLADA SLVLELNSCG RGFITAQTTT DYTGKLVRLD VGYSGLLLRW 
FTGYVERSQP AENGYQRLFV RELAGVFERM WPCSFQHPTL RDVAGWLEEN SGISIAVPDV
PYSDKPIPHF THNGTGYQLL NNLGRAFSIT DYIWYPLPDG SLYVGGAEKA LFAGRPVEIP
AEFSQGTAGG NSMTLPVIQS LRPGVDVNGE RVTKVHLTND TMTITWTPRN RATGQPLQKT
PAQRQIENHY PELASGLHLP KLARVVAPSE AVKSGNFADP FRPRYAVDVQ LLDADGNPDN
QTPVYSAVPL PVPMAGNDSG MFQFPPEGTL VEVAFTGGRP DKPFIRQTLP DGTSLPDIKP
GEQLQQQRAE VSQRVTQAGD WVRQTDQTIS ETSMARTVKA DTERRELVSR ETTVKATDKI
TVLGTATLMA GAIQQVSAGD FSQAVKGNRL ASITGNEETE IAGQQSTKVA GAMNVDVGGT
LTEKIAALRK SVASGGQQIM GPTVHIGSES VNTLTMMLDT IDLLAELAQQ CASHSHPSVG
TPTNAGAFNQ TAVKAGQTRS KYQNIIA