Gene SeHA_C2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C2049 
Symbol 
ID6492099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1992420 
End bp1993703 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content52% 
IMG OID642742250 
Producthypothetical protein 
Protein accessionYP_002045893 
Protein GI194450916 
COG category[S] Function unknown 
COG ID[COG2995] Uncharacterized paraquat-inducible protein A 
TIGRFAM ID[TIGR00155] integral membrane protein, PqiA family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones90 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCTGA ACACCTCACA CGTCACGCCA ACAAAAAAGC TAACGATCAG GTCAATTAGC 
GAAGCGCTGC CGCGCAGCCA CTACCAGCGC TGCCCTGAAT GCGACATGCT GTTCAGCTTG
CCGGAGATGA GCGCTCATCA AAGCGCTTAT TGTCCTCGTT GCCAGGCCAA AATTCGCGAT
GGGCGCGACT GGTCGCTGAC GCGGCTGACC GCGATGGCAG TAACCATGCT GCTATTGATG
CCGTTTGCCT GGAGCGAACC GCTACTCCAT ATCTACCTGT TGGGCGTACG CATTGATGCC
AATGTGATGC ACGGCATCTG GCAAATGACG CAGCAGGGCG ATCCGTTAAC CGCCGCAATG
GTGCTCTTTT GCGTGGTGGG CGCGCCGCTT ATTCTGGTTT TTTCAATTGC TTATCTGTGG
TTTGGCAGCC TTCTCGGCAT GAATCTGCGT CCAGTCCTGC TGATGCTGGA AAAACTGAAA
GAGTGGGTGA TGCTGGACAT CTATCTGGTC GGTATTGGCG TTGCCTCTAT CAAAGTGCAG
GACTATGCCT TTCTGCAGCC GGGCATCGGG CTTTTAGCGT TCGTCTCGTT GGTGGTTCTT
AGCATTCTGA CTATGATTCA TCTGAATGTG GAGCAACTAT GGGAACGATT TTATCCGCAG
CGCCCTGCTC AACGTGCGGA CGAAAGATTG CGCGTCTGTC TTGGCTGCCA CTTTAGCGGC
TATCCGGATG CGAAAGGACG CTGCCCGCGT TGTCATATTC CGCTACGGTT ACGCAGAAAA
CAGAGCATAC AGAAGTGTTG GGCGGCCTTG CTGGCGTCTA TTGTCTTTTT GCTGCCGGCA
AACCTGCTGC CTATCTCGGT AATCTACATT AATGGCGGGC GTCAGGAAGA TACTATCCTG
TCGGGCATTA TGTCGCTTGC CAGCAGCAAT ATCGCCGTCG CCGCCGTCGT TTTTATCGCC
AGTATTTTGG TGCCGTTTAC CAAAGTCATC GTGATGTTTA CGCTACTGTT GAGTATCCAT
TTTAAATGCC AACAGGGACT GCGGACGCGA ATTCTGTTGC TGCGTCTGGT GACATGGATA
GGCCGCTGGT CGATGCTGGA TCTTTTCGTT ATCTCGTTAA CCATGTCTCT GATTAATCGC
GATCAGATTC TGGCTTTTAC TATGGGACCG GCTGCGTTTT ATTTCGGCGC AGCGGTAATT
TTGACTATTC TTGCAGTGGA ATGGCTGGAT AGCCGCTTAC TTTGGGATGC ACATGAGTCA
GGAAACGCCC GCTTCGAAGA CTGA
 
Protein sequence
MALNTSHVTP TKKLTIRSIS EALPRSHYQR CPECDMLFSL PEMSAHQSAY CPRCQAKIRD 
GRDWSLTRLT AMAVTMLLLM PFAWSEPLLH IYLLGVRIDA NVMHGIWQMT QQGDPLTAAM
VLFCVVGAPL ILVFSIAYLW FGSLLGMNLR PVLLMLEKLK EWVMLDIYLV GIGVASIKVQ
DYAFLQPGIG LLAFVSLVVL SILTMIHLNV EQLWERFYPQ RPAQRADERL RVCLGCHFSG
YPDAKGRCPR CHIPLRLRRK QSIQKCWAAL LASIVFLLPA NLLPISVIYI NGGRQEDTIL
SGIMSLASSN IAVAAVVFIA SILVPFTKVI VMFTLLLSIH FKCQQGLRTR ILLLRLVTWI
GRWSMLDLFV ISLTMSLINR DQILAFTMGP AAFYFGAAVI LTILAVEWLD SRLLWDAHES
GNARFED