Gene SeHA_C4002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4002 
Symbol 
ID6487717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3878991 
End bp3880946 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content57% 
IMG OID642744103 
Producthypothetical protein 
Protein accessionYP_002047708 
Protein GI194451278 
COG category[S] Function unknown 
COG ID[COG3533] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value0.563325 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTAC TGGAAGTCGA TCTGCATAAA CTGACGGTCA GCGATCCGTT CCTCGGACAG 
TATCAACAAC TGGTTCGCGA TGTGGTTATT CCTTACCAGT GGGATGCGTT AAACGATCGT
ATTCCAGAGG CTGAACCCAG CCATGCCATT GAAAATTTCC GCATTGCCGC AGGCCAGCAG
ACGGGCGACT TTTACGGCAT GGTCTTTCAG GACAGCGACG TGGCGAAATG GCTGGAAGCG
GTTGCCTGGT CACTGTGCCA GAAGCCCGAT CCCGCGCTTG AGAAAACCGC CGATGAGGTG
ATTGAACTGG TGGCCGCCGC GCAGTGTGAC GATGGCTATC TCAATACGTA CTTTACGGCA
AAAGCCCCGC AAGAACGCTG GAGCAACCTG GCGGAGTGCC ACGAGCTTTA TTGCGCCGGG
CATCTGATTG AAGCAGGCGT CGCCTTCTTT CAGGCCACCG GTAAGCGTCG GCTGCTAGAC
GTCGTTTGTC GCCTGGCCGA TCATATCGAC AACACTTTCG GCCCTGGCGA AAATCAGCTG
CACGGCTATC CGGGCCACCC GGAAATTGAG CTGGCGCTGA TGCGTCTGTA TGAGGTAACA
GAGCAGCCGC GCTATATGGC GCTGGCAAGC TACTTTATCG GACAGCGCGG CACCCAACCA
CACTTCTACG ACGAAGAGTA CGAAAAACGC GGCCAAACCT CTTACTGGCA TACCTACGGC
CCGGCGTGGA TGGTCAAAGA CAAAGCCTAC AGCCAGGCGC ATCTGCCAAT TTCGCAGCAG
CAGACGGCCA TTGGTCACGC GGTACGTTTT GTCTATCTGA TGACTGGCGT GGCGCATCTC
GCTCGCCTGA GCAACGATGA AGGCAAACGC CAGGACTGCC TGCGTCTGTG GAAAAATATG
GCGCAGCGTC AGCTGTATAT CACCGGAGGC ATCGGTTCAC AGAGCAGTGG GGAAGCCTTT
AGCAGCGATT ACGATTTACC GAATGATTCG GTCTATGCGG AAAGCTGCGC TTCAATCGGC
CTGATGATGT TCGCCCGCCG GATGCTGGAA ATGGAAGCCG ATAGTCAGTA CGCCGACGTG
ATGGAGCGCG CGCTGTACAA CACCGTCCTC GGCGGTATGG CGCTCGATGG CAAGCATTTC
TTCTACGTCA ACCCGCTGGA AGTGCATCCA AGATCGTTAA AATTCAACCA TATTTACGAT
CACGTTAAGC CCATCCGCCA GCGCTGGTTT GGCTGCGCCT GCTGCCCGCC GAACATCGCC
CGCGTGCTCA CCTCTCTCGG TCACTACATC TACACGCCGC GTGCGGATGC GCTGTATATC
AATATGTACG TGGGTAACAG CCTGGAAGTA CCCGTTGAAA ATGGCGCGCT CAAACTGCGG
ATTGGCGGGA ACTACCCGTG GCATGAGCAG GTGAAGATTG CCATCGACTC TGTGCAGCCG
GTACGTCACA CGCTGGCGCT ACGCCTGCCG GACTGGTGCC CTGAGGCAAA AGTGACGCTC
AACGGGCTGG AAGTGGAGCA GGATATTCGC AAAGGTTATC TGCATATCCG TCGGACCTGG
CAGGAGGGCG ATACGATAAC CCTGACGCTG CCGATGCCGG TTCGCCGCGT GTATGGCAAT
CCGCTGGCGC GTCACGTCGC CGGTAAGGTC GCCATTCAGC GCGGGCCGCT GGTCTATTGC
CTTGAGCAGG CCGATAACGG CGAAGAACTG CATAATCTGT GGTTACCGAA AGAGAGTGAG
TTCCGGGTCT TTGAGGGCAA AGGGCTTTTT GCGCATAAGA TGCTGATTCA GGCTGAAGGC
GAGAAGCAAA GCGCCCCAGA TGCGCAGCAT CAGGCGTTGT GGCATTACGA CAACGCGCCA
TCATCGCGCC AGCCGCAGAC GCTAACGTTC ATTCCGTGGT TTAGCTGGGC CAACCGTGGC
GAGGGCGAAA TGCGGATTTG GGTTAACGAG CGGTAA
 
Protein sequence
MNVLEVDLHK LTVSDPFLGQ YQQLVRDVVI PYQWDALNDR IPEAEPSHAI ENFRIAAGQQ 
TGDFYGMVFQ DSDVAKWLEA VAWSLCQKPD PALEKTADEV IELVAAAQCD DGYLNTYFTA
KAPQERWSNL AECHELYCAG HLIEAGVAFF QATGKRRLLD VVCRLADHID NTFGPGENQL
HGYPGHPEIE LALMRLYEVT EQPRYMALAS YFIGQRGTQP HFYDEEYEKR GQTSYWHTYG
PAWMVKDKAY SQAHLPISQQ QTAIGHAVRF VYLMTGVAHL ARLSNDEGKR QDCLRLWKNM
AQRQLYITGG IGSQSSGEAF SSDYDLPNDS VYAESCASIG LMMFARRMLE MEADSQYADV
MERALYNTVL GGMALDGKHF FYVNPLEVHP RSLKFNHIYD HVKPIRQRWF GCACCPPNIA
RVLTSLGHYI YTPRADALYI NMYVGNSLEV PVENGALKLR IGGNYPWHEQ VKIAIDSVQP
VRHTLALRLP DWCPEAKVTL NGLEVEQDIR KGYLHIRRTW QEGDTITLTL PMPVRRVYGN
PLARHVAGKV AIQRGPLVYC LEQADNGEEL HNLWLPKESE FRVFEGKGLF AHKMLIQAEG
EKQSAPDAQH QALWHYDNAP SSRQPQTLTF IPWFSWANRG EGEMRIWVNE R