Gene SeD_A0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0020 
Symbol 
ID6871671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp20060 
End bp23056 
Gene Length2997 bp 
Protein Length998 aa 
Translation table11 
GC content55% 
IMG OID642783280 
Productputative hydroxymethyltransferase 
Protein accessionYP_002213974 
Protein GI198242908 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.130275 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATTT CCATTCACGC CAGCGCATTT GACGTCAACA GCTGGTATCA AAAAATCACC 
TTAACCTTCA TCAATGAGAG CGGTAATGCG GTCGATATGA ACCATGCCGC AATATCATTC
ACGGCTTCCG GGCACATCGA TCCATGGGGA AATAGCGGCG GTACGCTCAA AGGGAACCTG
CCGCTTACGC TGAATGAGAG TTCGTATGGC ACGCTGGAAA CTAACAACAT CATCATTAAT
AACAGCGATG CATTACTTCT TCAGCCGGGC GAACGCGGGA CGCTCTCTTT CAGCCTCGCG
GCGACGCAGG TGCCGGTAAA AATGTCCGCC ATCACCTTGA CGCTGGCGTC ATCGTCATCC
GAAGACGCAG AGTCTGAAAC CCCATCCGAT CAGGAGACGC CAGCGATACC CGCCGCAGAC
GAACAACTCG CCGAACCCGA TGTGCCGGAA AAGGACAATG ATCTTCAGGA ACACGGCCTT
ACGCTTAACG TTAGCGAGTT GAATACCGCA AGTTGGTATC AACGCGTCAC CTTTACGCTG
ACCAACCTCT ACGCCCAGGC GGTAGATCTC AATCAGCTTC AACTGAATTT TACGGCCAGC
GCGCACCCCG ATCCCTACAG TCCGTTTCAG GGAACAATGT TGGGGAATCA GGCCGTGACG
CTGGCCAGCG ATGGGGGATG GCCCATCGAG AAGAATACCA TCACCATTAA TCATGACGGC
GCGCTGATGC TGGCGGCGGG GGATACCGCC GAATTACAGT GCTATCTGGC CGCCACGCAG
ACGCCAGTTG CCATCAGCGA TTTGAACGCG ACGTTGGCCC ATGATCCTGC CCGTCAGGGA
AAAGTTTGCG TTCATTTTCC TGCCATGACG CAGACCGTGG CGCTCAAACC GGTGATTGAG
CTGCTGTTTC CTGCCGGCGA AACCCGGCGC TTTGTCGGTG AGTGGGGCGA GGTTCTGACA
ATAGGCGATC TTAGCGCAGG AACGTATCGG CTTACCGTAC CGATACTGGC GAATGATGAG
ATGCAAATCG CGCCAGTCGA GTCCTCTTTT ACCGTTACGC TGCAATCCGG CGATGCCGCC
GCGCAGGTCC AGGTATCCTG TCTGCCGATT GTCCGTTATG CCAGCGCGCG TCTGATGATT
GACGCCCCTG CGCTTGGTAA TGCGAAATTG ACCGTTGAGA TCGCCGATGC TACGCAAGCG
GATGAGCGTA CCGTCGCGCT GATCGCCAAC CAACCGCAGT TAATCACCCG GCTACTGGCG
GGGCATCATT ATACGGTCAA TCTGCAGCCT GCGATGATTA ATAACCGCTT TATATCGGCA
CCCATACAGC TTACGGGATT TATCCCTGCT GCGGCGCAGA TTGCCGAGGT TGCTGTCGCT
TACCAACAGT CGGCGCTCGA CACGGCGAGT TTCGTGACGG TGGATGCCAC TATACTGGGC
CTGCCCGATG GCGTCGCGCC GCAGCGTTAT CTGTTCAGCA GCGGTAAATA TCAGTACTCA
TTAATGCTGG AGAGCGGCAG CGATCGGCAG ACGTTGGCAT TACGCTTTGC GCCCGGGCTG
TATGATGTTC AGACGGACGA TATTTTCATC GACAGCGTGC CGTGGCGTTG TGAACAGGCC
GGGCCGCTAC GGTTGTTGCA AAAGGTCAAC CATGTGGCGC TGGAGTTTCT GCCCGGCGTG
ACGCTACAGG TAAAAGGTTG GCCTGATTAC CTTGCTCATG GCGGCGTGAC GGTTAACGCG
CCAGAGACGG TTTCTCTTTA TCGCGATATA CCGTTTAGCG CGTTGTTTAA ATACGATGGT
TTTGACGGCG GCGGCGATCC GGTTCCGGCC GCGGAGGTTG ACGTGAACGG GGATGGTTTT
CTGGATTACG CGACGTTACC GATCCATAAA ACCGTTGCGC TGGTGCGCCA GATAGAAAAA
GAAGCCGGGC GTAGCGTCAT GCCGGTAATG GTCATTTATA CCGCGAATGC CAGCGGCGGT
AGCGCGCTGG CGGATTTACA GGATGCGCAA AAGCTACGTA ACCATTTTGG TAACTTTATT
ACCCAGTGTC TGGCGGCGCA GTCATACAAA GATGAGACGC ATCCTGTCCC GGCCACCTTT
GTGCTTAACC CGGATTTTCT CGGGGCGCTA CAGCAAGGAC CGTATGGCTA TACCGTAGTA
CGGCAAAAAA ACAGTGTGCC GGTGAATGCC CAACTGGCGG CGGCGATACA AGCATTACCG
GCGATGGCTG GCTTTATCGT GCCTTCGTTG CCGACGTTTA GCGACGATCT CTACGGCTAT
ATTCAGGCGG TGAACTATCT TGTTCGTCAG TTTGCCCCGG ATGTGGCTTT TGGCTGGCAG
ACGAATGTCT GGGCGACAGG AACGGCGGAC TGGGTGCTGC GCGATACCGC TGATCCGGTA
GCTGAAGGGC AGGCGATCGC CGGATTTATT CATGAACTGG GCGTTTACAG CGGAGAATAT
GCGCCGGACT TTATTGCGTT TGATAAATTT GAGCGTGACT GTTTCAGTCC TGATGCGCTT
GCCCACTATG GCTGGAATGC GACATGCTGG CTTAATTACC TGGCGATGGT CAAACAGGTG
ACGAAAGCGC TGCTGACGCC CGCCATGCTG TGGCAAATCC CTGGCGGCCA TATGCCTACA
GTAGAAGAGG GCGTCAGTAA AATCAGCGCT GCGCACTTTG CATCCGGCGG AACCTTTTTT
ATGGGGGACG CCCGCATTGG CAGCGATCCT GACACGCTCT CTTTGCAGCT ACTCAATACG
GCCTTAAATA GCGCGACTTA CGGCGTCCCG ACCGTCGGCG ACTTTTTACG TAAAGATAAA
GGGTATGACT GGGGCCAGAT GCAGGCGCTG AATCTACCGG ACTTTAACGT CTTTTCGATC
TTATGGGGCG GTGGTTCTAC TATCAGTATT ACGACAATCC ATTCTAACGG TGAAGACGGC
GGCTGGCTGG CGGATAAAAT GGTAGAGTAT TATGCTGCCC CTCGCTATTT CAGATAA
 
Protein sequence
MTISIHASAF DVNSWYQKIT LTFINESGNA VDMNHAAISF TASGHIDPWG NSGGTLKGNL 
PLTLNESSYG TLETNNIIIN NSDALLLQPG ERGTLSFSLA ATQVPVKMSA ITLTLASSSS
EDAESETPSD QETPAIPAAD EQLAEPDVPE KDNDLQEHGL TLNVSELNTA SWYQRVTFTL
TNLYAQAVDL NQLQLNFTAS AHPDPYSPFQ GTMLGNQAVT LASDGGWPIE KNTITINHDG
ALMLAAGDTA ELQCYLAATQ TPVAISDLNA TLAHDPARQG KVCVHFPAMT QTVALKPVIE
LLFPAGETRR FVGEWGEVLT IGDLSAGTYR LTVPILANDE MQIAPVESSF TVTLQSGDAA
AQVQVSCLPI VRYASARLMI DAPALGNAKL TVEIADATQA DERTVALIAN QPQLITRLLA
GHHYTVNLQP AMINNRFISA PIQLTGFIPA AAQIAEVAVA YQQSALDTAS FVTVDATILG
LPDGVAPQRY LFSSGKYQYS LMLESGSDRQ TLALRFAPGL YDVQTDDIFI DSVPWRCEQA
GPLRLLQKVN HVALEFLPGV TLQVKGWPDY LAHGGVTVNA PETVSLYRDI PFSALFKYDG
FDGGGDPVPA AEVDVNGDGF LDYATLPIHK TVALVRQIEK EAGRSVMPVM VIYTANASGG
SALADLQDAQ KLRNHFGNFI TQCLAAQSYK DETHPVPATF VLNPDFLGAL QQGPYGYTVV
RQKNSVPVNA QLAAAIQALP AMAGFIVPSL PTFSDDLYGY IQAVNYLVRQ FAPDVAFGWQ
TNVWATGTAD WVLRDTADPV AEGQAIAGFI HELGVYSGEY APDFIAFDKF ERDCFSPDAL
AHYGWNATCW LNYLAMVKQV TKALLTPAML WQIPGGHMPT VEEGVSKISA AHFASGGTFF
MGDARIGSDP DTLSLQLLNT ALNSATYGVP TVGDFLRKDK GYDWGQMQAL NLPDFNVFSI
LWGGGSTISI TTIHSNGEDG GWLADKMVEY YAAPRYFR