Gene SeHA_C4439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4439 
Symbol 
ID6489003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4321160 
End bp4322716 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content51% 
IMG OID642744521 
Product5'-Nucleotidase domain-containing protein 
Protein accessionYP_002048110 
Protein GI194449759 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value0.088476 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTAA AACTGCTTGC TGCCGGTATT TTGTTCACGC TGCCGTTCTG GGCCTGCGCC 
AAAGATGTCA CCATTATTTA CACCAACGAT CTACACGCTC ATGTGGAGCC TTATAAAGTG
CCGTGGATTG CTGACGGTAA ACGCGATATT GGCGGTTGGG CCAATATCAC TACGCTGGTG
AAGCAGGAAA AAGCCAAAAA TAAGGCGACG TGGTTTTTTG ATGCCGGAGA TTACTTTACC
GGACCGTATA TCAGCAGCCT GACGAAGGGT AAAGCGAATA TCGATATTTT GAATACCATG
CAGTATGACG CTGCCACTAT CGGTAATCAT GAGTTCGATC ATGGCTGGGA CAATACATTG
TTGCAACTGA GCCGAGCAAA ATTCCCTATC GTACAGGGCA ATATTTTTTA TGAGGACAGC
AGTAAATCCT TCTGGGATAA GCCGTACACC ATTGTTGAAA AAGATGGCGT CAAGATTGGC
GTAATCGGCT TACACGGTGT CTTTGCTTTT AATGATACGG TTTCTGCCGC GACGCGCGTG
GGCATTGAGG CACGCGATGA AATTAAGTGG CTGCAACGTT ACATTGATGA ACTTAAAGGT
AAAGTCGATC TGACCGTCGC GCTGATCCAC GAAGGCACCC CGGCCCGCCA GTCCAGCATG
GGGAATACCG ATGTGCGACG CGCGCTGGAT AAAGATATTC AGACCGCAAG TCAGGTAAAA
GGGCTGGATA TTTTGATTAC CGGCCACGCG CATGTCGGTA CGCCGGAACC GATTAAAGTC
GGTAATACGC TGATTCTTTC AACGGACAGC GGCGGCATTG ATGTGGGTAA ACTGGTGCTG
GATTACAAAG AGAAACCACA CCACTTTACG GTGAAGAACT TCGAGCTGAA GACCATTTTT
GCTGATGAGT GGAAGCCCGA TCCGCAAACG AAACAGGTGA TCGACGGCTG GAATAAAAAG
CTCGATAAAG TCGTGCAGCA GACGGTGGCG CAATCGCCGG TTGAGCTGAC CCGCGCGTAT
GGCGAATCGT CGTCGCTGGG GAATCTGGCG GCGGATGCGC TGCTTTTTAC GGCGGGGAAA
GACACCCAGT TAGCGCTTAC TAACTCTGGC GGTATCCGCA ACGAAATCCC GGCTGGCGCG
GTGACGATGG GGGCGGTAAT CAGTACCTTC CCGTTCCCTA ATGAACTGGT CACGATGGAT
TTAACCGGTA AACAATTGCG CAGCCTGATG GAGCATGGCG CTGGATTAAG CAACGGCGTA
TTGCAGGTGT CTAAAGGGCT GGAGATGAAG TATGACAGCA GCAAACCTGT CGGCCAGCGG
GTTACCGTGC TGACGCTCAA TGGCAAACCG ATTGACGATG CTACGGTTTA TCATATTGCC
ACCAACAGCT TCCTTGCCGA CGGCGGCGAT GGTTTTGCGG CGTTCACGGA AGGCCAGGCG
CGGAATACCT CCGGCGGCTA CTATGTGTCG AATGCGATAG TTGATTACTT TAAGGCGGGC
AACACCATCA CGGATGAGCA GCTCAAAGGG ATGCGCGTTG CGGATGTGAA GAAGTAA
 
Protein sequence
MKVKLLAAGI LFTLPFWACA KDVTIIYTND LHAHVEPYKV PWIADGKRDI GGWANITTLV 
KQEKAKNKAT WFFDAGDYFT GPYISSLTKG KANIDILNTM QYDAATIGNH EFDHGWDNTL
LQLSRAKFPI VQGNIFYEDS SKSFWDKPYT IVEKDGVKIG VIGLHGVFAF NDTVSAATRV
GIEARDEIKW LQRYIDELKG KVDLTVALIH EGTPARQSSM GNTDVRRALD KDIQTASQVK
GLDILITGHA HVGTPEPIKV GNTLILSTDS GGIDVGKLVL DYKEKPHHFT VKNFELKTIF
ADEWKPDPQT KQVIDGWNKK LDKVVQQTVA QSPVELTRAY GESSSLGNLA ADALLFTAGK
DTQLALTNSG GIRNEIPAGA VTMGAVISTF PFPNELVTMD LTGKQLRSLM EHGAGLSNGV
LQVSKGLEMK YDSSKPVGQR VTVLTLNGKP IDDATVYHIA TNSFLADGGD GFAAFTEGQA
RNTSGGYYVS NAIVDYFKAG NTITDEQLKG MRVADVKK