Gene SeHA_C0021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C0021 
Symbol 
ID6490703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp20059 
End bp23055 
Gene Length2997 bp 
Protein Length998 aa 
Translation table11 
GC content55% 
IMG OID642740315 
Productputative hydroxymethyltransferase 
Protein accessionYP_002043989 
Protein GI194449860 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATTT CCATCCATGC CAGCGCATTT GACGTCAACA GCTGGTATCA AAAAATCACC 
TTAACCTTCA TCAATGAGAG CGGTAATGCG GTCGATATGA ACCATGCCGC AATATCATTC
ACGGCTTCCG GGCACATCGA TCCATGGGGA AATAGCGGCG GTACGCTCAA AGGGAACCTG
CCGCTTACGC TGAATGATAG TTCGTATGGC ACGCTGGAAA CTAACAACAT CATCATTAAT
AACAGCGATG CATTACTTCT TCAGCCGGGC GAACGCGGGA CGCTCTCTTT CAGCCTCGCG
GCGACGCAGG TGCCGGTAAA AATGTCCGCC ATCACCTTGA CGCTGGCGTC ATCGTCATCC
GAAGACGCAG AGTCTGAAAC CCCATCCGAT CAGGAGACGC CAGCGATACC CGCCGCAGAC
GAACAACCCG CCGAACCCGA TGTGCCGGAA AAGGACAATG ACCTTCAGGA ACGCGGCCTT
ACGCTTAACG TTAGCGAGTT GAATACCGCG AGTTGGTATC AACGCGTCAC CTTTACGCTG
ACCAACCTCT ACGCTCAGGC GGTAGATCTC AATCAGCTTC AACTGAATTT TACGGCCAGC
GCGCACCCCG ATCCCTACAG TCCGTTTCAG GGAACAATGC TGGGGAATCA GGCCGTGACG
CTGGCCAGCG ATGGGGGATG GCCCATCGAG AAGAATACCA TCACCATTAA TCATGACGGC
GCGCTGATGC TGGCGGCGGG GGATACCGTC GAATTACAGT GCTATCTGGC CGCCACGCAG
ATGCCAGTTG CCATCAGCGA TTTGAGCGCG ACGTTGGCCC ATGACCCTGC CCGTCAGGGA
AAAATTTGTG TTCATTTTCC TGCCATGACG CAGACCGTGG CGCTCAAACC GGCGATTGAG
CTGCTGTTTC CTGCCGGCGA AACCCGGCGC TTTGTCGGTG AGTGGGGAGA GGTTCTGACA
ATAAGCGATC TTAGCGCAGG AACGTATCGG CTTACCGTAC CGGTACTGGC GAATGATGAG
ATGCAAATCG CGCCAGTCGA GTCCTCTTTT ACCGTTACGC TGCAATCCGG CGATGCCGCC
GCGCAGGTCC AGGTATCCTG TCTGCCGATT GTCCGTTATG CCAGCGCGCG TCTGATGATT
GACGCCCCTG CGCTTGGTAA TGCGAAATTG ACCGTTGATA TCGCTGATAC TACGCAAGCG
GATGAGCGTA CCGTCACGCT TATCGCCAAC CAACCGCAGT TAATCACCCG GCTACTGGCG
GGGCATCATT ATACGGTCAA TCTGCAGCCT GCGATGATTA ATAACCGCTT TATATCGGCA
CCCATACAGC TTACGGGGTT TATCCCTGCT GCGGCGCAGG TTGCCGAGGT TGCTGTCGCT
TACCAACAGT CAGCGCTCGA CACGGCGAGT TTCGTGACGG TGGATGCCAC TCTACTGGGC
CTGCCCGATG GCGTCGCGCC GCAGCGTTAT CTGTTCAGCA GCGGTAAATA TCAGTACTCA
TTAATGCTGG AGAGCGGCAG CGATCGGCAG ACGTTGGCAT TACGCTTTGC GCCCGGGCTG
TATGATGTTC AGACGGACGA TATTTTCATC GACAGCGTGC CGTGGCGTTG TGAACAGGCC
GGGCCGCTAC GGTTGTTGCA AAAGGTCAAC CATGTGGCGC TGGAGTTTCT GCCCGGCGTG
ACGCTACAGG TAAAAGGTTG GCCTGATTAC CTTGCTCATG GCGGCGTGAC GGTTAACGCG
CCAGAGACGG TTTCTCTTTA TCGCGATATA CCGTTTAGCG CGTTGTTTAA ATACGATGGT
TTTGACGGCG GCGGCGATCC GGTTCCGGCC GCGGAGGTTG ACGTGAACGG GGATGGTTTT
CTGGATTACG CGACGTTACC GATCCATAAA ACCGTTGCGC TGGTGCGCCA GATAGAAAAA
GAAGCCGGGC GTAGCGTCAT GCCGGTAATG GTCATTTATA CCGCGAATGC CAGCGGCGGT
AGCGCCCTGG CGGATTTACA GGATGCGCAA AAGCTACGTA ACCATTTTGG TAACTTTATT
ACCCAGTGTC TGGCGGCGCA GTCATACAAA GATGAGACGC ATCCTGTCCC GGCCACCTTT
GTGCTTAATC CGGATTTTCT CGGGGCGCTA CAGCAAGGAC CGTATGGCTA TACCGTAGTA
CGGCAAAAAA ACAGTGTGCC GGTGAATGCC CAACTGGCGG CGGCGATACA AGCTTTACCG
GCGATGGCTG GCTTTATCGC GCCTTCGTTG CCGACGTTTA GCGACGATCT CTACGGCTAT
ATTCAGGCGG TGAACTATCT TGTTCGTCAG TTTGCCCCGG ATGTGGCTTT TGGCTGGCAG
ACGAATGTCT GGGCGACAGG AACGGCGGAC TGGGTGCTGC GCGATACCGC TGATCCGGTA
GCTGAAGGGC AGGCGATCGC CGAATTTATT CATGAACTGG GCGTTTATAG CGGAGAATAT
GCGCCGGACT TTATTGCGTT TGATAAATTT GAGCGTGACT GTTTCAGTCC TGATGCGCTT
GCCCACTATG GCTGGAATGC GACATGCTGG CTTAATTACC TGGCGATGGT CAAACAGGTG
ACGAAATCGC TGCTGACGCC CGCCATGCTG TGGCAAATCC CTGGCGGCCA TATGCCTACA
GTAGAAGAGG GCGTCAGTAA AATCAGCGCT GCGCACTTTG CATCCGGCGG AACCTTTTTT
ATGGGGGACG CCCGCATTGG CAGCGATCCT GACACGCTCT CTTTGCAGCT ACTCAATACG
GCCTTAAATA GCGCGACTTA CGGCGTCCCG ACCGTCGGCG ACTTTTTACG TAAAGATAAA
GGGTATGACT GGGGCCAGAT GCAGGCGCTG AATCTACCGG ACTTTAACGT CTTTTCGATC
TTATGGGGCG GTGGTTCTAC TATCAGTATT ACGACAATCC ATTCTAACGG TGAAGACGGC
GGCTGGCTGG CGGATAAAAT GGTAGAGTAT TATGCTGCTC CACGCTATTT CAGATAA
 
Protein sequence
MTISIHASAF DVNSWYQKIT LTFINESGNA VDMNHAAISF TASGHIDPWG NSGGTLKGNL 
PLTLNDSSYG TLETNNIIIN NSDALLLQPG ERGTLSFSLA ATQVPVKMSA ITLTLASSSS
EDAESETPSD QETPAIPAAD EQPAEPDVPE KDNDLQERGL TLNVSELNTA SWYQRVTFTL
TNLYAQAVDL NQLQLNFTAS AHPDPYSPFQ GTMLGNQAVT LASDGGWPIE KNTITINHDG
ALMLAAGDTV ELQCYLAATQ MPVAISDLSA TLAHDPARQG KICVHFPAMT QTVALKPAIE
LLFPAGETRR FVGEWGEVLT ISDLSAGTYR LTVPVLANDE MQIAPVESSF TVTLQSGDAA
AQVQVSCLPI VRYASARLMI DAPALGNAKL TVDIADTTQA DERTVTLIAN QPQLITRLLA
GHHYTVNLQP AMINNRFISA PIQLTGFIPA AAQVAEVAVA YQQSALDTAS FVTVDATLLG
LPDGVAPQRY LFSSGKYQYS LMLESGSDRQ TLALRFAPGL YDVQTDDIFI DSVPWRCEQA
GPLRLLQKVN HVALEFLPGV TLQVKGWPDY LAHGGVTVNA PETVSLYRDI PFSALFKYDG
FDGGGDPVPA AEVDVNGDGF LDYATLPIHK TVALVRQIEK EAGRSVMPVM VIYTANASGG
SALADLQDAQ KLRNHFGNFI TQCLAAQSYK DETHPVPATF VLNPDFLGAL QQGPYGYTVV
RQKNSVPVNA QLAAAIQALP AMAGFIAPSL PTFSDDLYGY IQAVNYLVRQ FAPDVAFGWQ
TNVWATGTAD WVLRDTADPV AEGQAIAEFI HELGVYSGEY APDFIAFDKF ERDCFSPDAL
AHYGWNATCW LNYLAMVKQV TKSLLTPAML WQIPGGHMPT VEEGVSKISA AHFASGGTFF
MGDARIGSDP DTLSLQLLNT ALNSATYGVP TVGDFLRKDK GYDWGQMQAL NLPDFNVFSI
LWGGGSTISI TTIHSNGEDG GWLADKMVEY YAAPRYFR