Gene SeHA_C4703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4703 
Symbol 
ID6489730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4584087 
End bp4585319 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content43% 
IMG OID642744762 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_002048339 
Protein GI194450873 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones101 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACTGCA ATGAATTAAA AGAATTTGCG CAAGAGCAGG AACAAAAACT TATCAGTGCG 
CTTGCTGATA CCAGAGAGCA CCCGTCCCCC GATGGTGAAA CAAGAAGCCG GGATAATTAT
CAACGGGATT ACGCGCGTAT TCTGTATTCA TCATCATTTC GCAGGCTGCA GGGAAAGATG
CAACTTTTTG AAATTGATCC TGAAAAATTC AACAGAAACA GATTAACGCA CAGTCTTGAG
GTTGCTCAGA TCGCCCGAAG TATCGCGTCA GACCTCAAAC TGATTAACCC GGTGGTGGTT
GAACTGGCAG CGCTGGCGCA TGACATCGGA AATCCTCCTT TCGGTCATTC CGGTGAAAAA
CTGCTTAATG AACTGTCTGA AGAAATCGGC GGCTATGAAG GTAATGCACA GGCGCTACGT
ATCCTGAGAA AACTGGAGAA GAAATTTTCA TACTGCAACG GATTAAATCT GACTCACCGT
AGTTTACTTT CGGTTGTTAA GTATCCCATC CCCCGCGCTG CAGCTACTGC CGGCAAGTTC
ATTTATGATG ATGATTACTA TTTTTACATT AACCTGCTTG CTGAAAATCA GCTCGATCTG
AATCCCGGAG AGAAAACGAT TGATGCGCAG ATAATGGATC TCGCGGATGA GATTGCCTAC
GCCGCGCATG ACCTGGAAGA TGCCCTTAGC AGGAACATGG TCACGATTGA AGATATTGAA
TATGAGTTTC AGATTTCTGA CGAATTCCGG GGAGCGAGGG AACAGTTCAG GGAAATCGTT
ACTCAGTCGA GAAACACCGC TTTTCAGGCT AACTTACTGA AAACCTCAGA AGAATTTGCC
ATCATATTCC GCAAGGAGTT AACTTCAAAT ATTGTCAATC GCCTCGTTGC AGATATTTCT
GTAGTAACGA ACCTGAATGG TTTTCAGGAA CTGGGGTTCG GGAAACTGAA TGCGTTATCC
GAAGGACTCA AAAAACTTCT CTTTAAAGTC ATCATGCGAA AACGTAATAT CCTCACCTAT
GAGTTCAGGG GAAATAAAAT AATCAGGGAT TTATATGACT TTTACAATGA AGGAGAGAAT
TATAAATTTC TGCCTCCTGA ACTTAAATTC ACCTTACCCC AACCAGATTC CTGTATATTT
GAAATCAGCA AAAAACGAGC AGTGGTTGAC TATATTTCAG GTATGATGGA TACATTTGCA
GTCAAGGAAT GGGAAACTCA CTGTCTGAAG TAA
 
Protein sequence
MYCNELKEFA QEQEQKLISA LADTREHPSP DGETRSRDNY QRDYARILYS SSFRRLQGKM 
QLFEIDPEKF NRNRLTHSLE VAQIARSIAS DLKLINPVVV ELAALAHDIG NPPFGHSGEK
LLNELSEEIG GYEGNAQALR ILRKLEKKFS YCNGLNLTHR SLLSVVKYPI PRAAATAGKF
IYDDDYYFYI NLLAENQLDL NPGEKTIDAQ IMDLADEIAY AAHDLEDALS RNMVTIEDIE
YEFQISDEFR GAREQFREIV TQSRNTAFQA NLLKTSEEFA IIFRKELTSN IVNRLVADIS
VVTNLNGFQE LGFGKLNALS EGLKKLLFKV IMRKRNILTY EFRGNKIIRD LYDFYNEGEN
YKFLPPELKF TLPQPDSCIF EISKKRAVVD YISGMMDTFA VKEWETHCLK