Gene SeHA_C2985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C2985 
Symbol 
ID6489038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp2924242 
End bp2925576 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content57% 
IMG OID642743141 
ProductGntR family transcriptional regulator 
Protein accessionYP_002046765 
Protein GI194447824 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value0.12333 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGCT ATCAGCACAT CGCTCGTCAG TTAAAAACGG CCATTGAGCA AGGAGAACTC 
GCGCCCGGAA CGCGTTTGCC TTCCAGTCGG ACGTGGGCGC AGGAACTTGG CGTTTCTCGC
GCCACGGTGG AAAATGCCTA TGGCGAGCTG GTGGCGCAGG GCTGGCTGGA GCGACGTGGC
CAGGCAGGCA CGTTTGTGAG CAACGCTCTA CGGTTTGAGA CGGCGCCGCC GATACCCGCT
GTTTTTGCCG GAGAAAGTCC GGAACCGAAA CCCTTTCAGA TGGGGTTACC GGCGCTGGAT
CTCTTTCCAC GCGAGAAGTG GGCGCGAGTA ATGGGGCGTC GGTTGCGCAC GCAGACACGC
TTCGATCTGG CATTAGGCGA CGTCTGCGGC GAGGCGATTT TGCGCCAGGC GATAGTCGAT
TACCTGCGGG TTTCGCGTAG CATTGAATGT CTGCCGGAAC AGGTATTTAT TACCTCCGGA
TATGCGGATT CTATGCGGCT AATCCTGCGT ACATTGTCTG TGCCGGGAGA CAGCATGTGG
GTGGAAGATC CCGGCTTTCC GTTAATTCGC CCGGTGATAA CGCAGGAGGG GATTACGCTG
GCGCCGATTC CGGTCGATGC CGATGGGCTG AATGTCGCGG CGGGGATGCG GGATTGCCCG
CAGGGACGCT TTGCATTGGT GACGCCCGCC CATCAAAGTC CGTTGGGGGT GGCGCTGTCG
TTAACTCGCC GACGGCAACT TCTGGCATGG GCGGCGAATG TGCAGGCCTG GATTATTGAA
GATGACTACG ACAGCGAATT TCGTTATCAC GGTAAACCGC TTCCGCCGCT CAAGAGTCTG
GATGCCCCGC AGCGAGTGAT TTACGCCGGA ACGTTCAGTA AGTCGCTCTT TCCGGCATTA
CGTACCGCCT GGCTGGTGGT GCCGATAAAG CAGATTGAGC ATTTCCGCCA GCAGGCATCG
CTGATGCCCT GTAGCGTACC GTTGTTATGG CAGCACACGC TGGCTGATTT TATCCGTGAT
GGCCATTTCT GGCGGCATCT GAAAAAGATG CGCCAACATT ATGCTCAGCG ACGGTTATGG
ATTGAAGAGG CGCTGGCAGA ACAGGGATTT GTCGTGACAT TACAGAAAGG CGGTATTCAA
TTGGTTATTG AGGTTGAAGG TGATGATAAA GCGCAGGTAG CAAAAGCGAA TCAGGCCGGA
CTGGCGGTAC AGGCGCTAAG CCGTTGGCGA GTGGTTTCGT CAGGAAAGGG GGGCATTTTA
CTGTCGTTTA CCAATATTAC TTCCGCTGGC ATGGCGAAAC AGGTCGCATG TCAGCTTCGA
CAGGCGATAC AGTAA
 
Protein sequence
MPRYQHIARQ LKTAIEQGEL APGTRLPSSR TWAQELGVSR ATVENAYGEL VAQGWLERRG 
QAGTFVSNAL RFETAPPIPA VFAGESPEPK PFQMGLPALD LFPREKWARV MGRRLRTQTR
FDLALGDVCG EAILRQAIVD YLRVSRSIEC LPEQVFITSG YADSMRLILR TLSVPGDSMW
VEDPGFPLIR PVITQEGITL APIPVDADGL NVAAGMRDCP QGRFALVTPA HQSPLGVALS
LTRRRQLLAW AANVQAWIIE DDYDSEFRYH GKPLPPLKSL DAPQRVIYAG TFSKSLFPAL
RTAWLVVPIK QIEHFRQQAS LMPCSVPLLW QHTLADFIRD GHFWRHLKKM RQHYAQRRLW
IEEALAEQGF VVTLQKGGIQ LVIEVEGDDK AQVAKANQAG LAVQALSRWR VVSSGKGGIL
LSFTNITSAG MAKQVACQLR QAIQ