Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C0331 |
Symbol | |
ID | 6492017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 341675 |
End bp | 346381 |
Gene Length | 4707 bp |
Protein Length | 1568 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642740608 |
Product | Rhs family protein |
Protein accession | YP_002044276 |
Protein GI | 194451156 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAGAAG CATTCTGGGC GGCAAGGGAA GGCGACGCGC TGCTGCATAC GTCCTTTCTG GCAGATCTGG TGGGCAGTGC GCTGGAATTC GCCATCAATG CGGTTATTGA TTTTGCGGCA CTGGCTGTTG TGGCGCTGGC GACGGGGGCT ACGGTGGCAA CGCTGGGATG CAGCGCCGTA TTACTTGTGG GGACCGTGGT CGGCGCGACG ATGTTGCTGA GTGGCGCCGG GGAGAAAATC AGTAAAGCCT GTGAAGACAT AGCCAACAGC CTTTTTCCGC CGAAAATTGA GGGATATATC CTGACCGGCT CAGGCGATAC CCGGATTAAC AGCAAACGGG CGGCGCGGGC GGCGGCAACG GCATTATCAC GCCATGATGT AGAAACCCTG GATGCGCAGG CGCAGGCAGA GGCAGAAGAA GAAAAACGCA GGCAGGACGC CAAATCAATG TGGGATGTGG CCGGGGAATA TCTGGATGCA GTAAAAGGCC GGGCAGAGAA TATGGCTAAG GGCGTCGTTT CCGTGGTGAG CCGGATGGGA TCAGCGGAAG GATGGGCCTC GCTGGGAAAT GATGCTCTGG CAGAGGCGGG CGATATGCTT TCCGACACGG GCCACTTTAT TTCAGAGATG TGGCAGCCCA CGGTGGCAAC GGCATATCCG GGGTCTGACC CGAAACAGGA TGATAAAATC GACTGCCACA AACATCCCTC TTCATTCACC CAGTTTATGG CACAGAAACT GCAGGCGCTG ACGGATGACC CGGTGGGGAC GGTACTGGGG GCGATGAACA CTTTCGATGT GCTGGAAGCC GGTTTTCAGG CGGCAAGCGC GCTTATCGGC AGCGTGTCGA ACCTGTTCAA AGGCGACGAT GAGCCGCCCG CAGCAGAATA TATCGCCGAA GGGACGCGGG ATGTGCGTAT CAACAGCCAG CCTGCGGCGC GCAGCGGGGT ACGCTGTACC TGCGAGGCAA AAGTGGTGGA TGAGCCTGAG AACGGCGTAC ATGTATCCGG TGATGTACGT ATCGGCGGGC CGCTTCTGGT GGTACGGGAT ATCAAAAGCG GTAAGTCACA AATAACCCTG GTGACCACCA TTGCGCTGAC GTTCATGCAG CCCGGACGGG CGTCAGCTAA AAGTGCCTGC TTTATGATGG GATTGGGCAT TAACATGATG GTTCAGAAGG CGGGGAGTGC GCTGAACCGC CCGGTAAACG CCGCCACCGG AGCCAAGTAC CTGGCAGGCG ATGATGACGT TGATTTCAGT CTGCCCGGCC ACTTCCCGCT GGAGTGGCAG CGGACCTACA GCAGCCGTGA TGAACGGACG GAGGGGATGT TCGGGCGGGG CTGGAGCGTG CTGTATGAAG TGTGCCTGGA GCGTACGCCG GACAACCCGG ATGAAAACTG CATGACGTAT GTATCCCCAA TGGGACGGCG AATTGACCTG CAGGCGGTGG AGCCGGGGAG CGGTTTTTAC AGCCCCGGCG AGGGGCTGGC AGTGCGGCGC AGCGAACAGG GCCACTGGCT TATCAGCAGT GATGACGGCG TGTACAGGCT GTTTGAAGCG GACCCGTTCA GCCCACAGCG TCGGCGGCTG AAAATGCTGG GCGACCGCAA CAGTAACTGC CAGCACCTGA CCTACGACAA CCACGGGCGT CTGGTGGAAA TCAGCGGCGA TCGGCAGCGC CCCTGCATCC GGCTGCACTA TGAGCTGGCA GCGCACCCGC AGCGCGTGAC GCGCATCTTC CGCCATTACC CGGAAGGGGA GCCGGAGCTG CTGCGGCGTT ACCGCTACGA TGAGGCGGGG CGGCTGAACG GGGTGGTGGA CAACGCAGGT CAGTATCAGC GCGAGTTTGC GTATGACGAC AACGACTGCA TGACGATGCA CCGGGAGCCG GGCGGCGAAC GGTATTACTA CTCCTGGGCG TGGTTTGAAG GCCCGGACGA TGCGGCGTGG AGGGTGACGG GCCATCATAC GGACAGCGGC GAGCAGTACC GTCTGGACTG GAATCTGGCA GAACGTTCGC TGTGCGTGAC GGATAGTCTG GGGCGTACGC GCTGCCACTG GTGGGATGCG CAGGGCCTGG TGACGGCGTA CCGGGACGAG GCCGGGCAGA TGACCACTTT CCGCTGGAGC GATGAAGAGC GGTTACTGCT GGGGATGACG GACGCGCAGG GCGGCAAATG GCGTTATGTC TATGACCGTC TCGGCCATCT GACGGAGACG CATGACCCGC TGGGCCGGGT TGAGCAGACG CAGTGGCATC CGGTGTGGCA CCAGCCGGAA ACGGAGGTGG ATGCCGCGGG GGCGGCGTGG CGTTATGAGT ATGATGAGCG GGGCAACCTG CAGGCGGTCA GCGACCCGCT GCACCAGCGC ACGGTATACG GGTATGACCG GCACGGCCAG GTGGTGCGGA TAACCGACGC GCGGGGCGGA GATAAATACC TGCAGTGGAA CGAAGACGGG CAGCTTATGC GCCACACGGA CTGTTCTGGC TCGCAGACGG CATGGTTTTA TGATGAACGC ACGCGGCTGG AAAGGGTGAC GGACGCGGAG AGTAACAGTA CGCGTTACAG CTATGACGGC AACGGACATC TGACGGAGGT CATGTTCGCG GACGGGCGTA CGGAGCGTTA CCAGCCGGAT GCGGCGGGAC GGCTGGTGAA ATACACCAGC CCGGCGGGGC AGATAACACG CTGGCAGCGG GACGGCCAGG GGCGGGTGCG CAGGCAGACG GATGCGACGG GGCGCAGGAC GGCGTATGAG TACGACGCTT ACGGGCGGCT GACCACGCTC ACGAACGAGA ACGGGGAAAG CTACCGGTTC CGGTACGATG TTCTGGACCG GGTGACGGAA CAGACGGACC CCGGCGGCAG CCGCCGGGCA TACGGGTATA ACGCGCTGAA TGCGGTGACG GCGGTGATAT ACGGCGGGGA GCGCGGGGGA GAAATCCGCC ACGGTCTGGA GCGTGATGCG GCGGGGAGGC TGACGGTGAA AATCACGCCG GAGACGCGCA CGGAATACCG GTACGACGCG GCGGACCGTC TGCTGGAAAT CCGCCGCAGG CAGCATGATG CGGCGGAAGG CGGAGAGCCG GAAGTTATCC GGTTCAGCTA CGACAGTGCG GGTAACCTGC TGAGCGAGGA GACGGCGCAG GGCGTGCTGC AGAACCGGTA CGATGTTCAG GGCAACCGCA CAGAAACGCA GATGCCGGAC GGGCGGACGC TGCGGTACCT GTACTACGGG AGCGGCCATC TCCAGCAAAT CAACCTGGGG CGTGATGTCA TCAGCGAGTT CACGCGCGAC CACCTGCACC GTGAGGTGCA GCGGAGCCAG GGGCGGCTGG ACACGCGGCG GATGTACGAC CGGACGGGCC GGTTAACGCG GAAACTGACC TGTAAAGGAA TGCGCGGTGT GGTGCCGGAG ACGTTTATCG ACCGGGAATA TGCGTACAGC GGCCAGGATG AGCTGCTGAA AAAGCGGCAC AGCCGGCAGG GGGTGACGGA TTATTTTTAC GACACGACGG GGCGCATCAC GGCGTGCCGG AATGAGGCAT ACCTGGACAG CTGGCAGTAC GACGCGGCGG CGAACCTGCT GGACAGGCGG CAGGGAGAGA CCGCGCAGGC GGGTGCAGGC AGCGTGGTGC CGTTCAACCG GATAACGTCA TACCGTGGGC TGCATTACCG TTACGATGAA TATGGCCGGG TGGTGGAAAA GCGGGGCCGC AACGGTACGC AGCACTATCG CTGGGATGCG GAGCACCGGC TGACGGAAGT GGCGGTCATC CGGGGGAGCA CCGTACGGCG TTACGGGTAC GTGTACGACG CGCCGGGCAG GCGGGTGGAG AAGCACGAAC TGGACGCGGA AGGAAAGCCG TATAACCGGA CGACGTTTTT ATGGGACGGA ATGCGGCTGG CGCAGGAGTG CAGGCTGGGG AGAAGCAGCA GCCTGTATAT CTACAGCGAC CAGGGGAGCC ACGAGCCGCT GGCGCGGGTG GACAGGGCGG CGCCGGGCGA AGCGGATGAG GTGCTGTATT ACCATACGGA CGTAAACGGC GCGCCGGAGG AGATGACGGA CGGCGGGGGC AATATTGTCT GGGAAGCGGG CTATCAGGTA TGGGGGAACC TGACGCATGA AAAAGAAACC CGGCCCGTAC AGCAGAACCT GCGTTTCCAG GGGCAATATC TGGACAGGGA AACGGGGCTG CATTACAATT TGTACAGATT TTATGATCCG GATATCGGGA AGTTTATATC GGGCGATCCA ATCTCGCTGA GGGGCGGAAT AAACTTATAT GCGTATGCAC AGAATCCGTT AAGCTGGATC GATCCGCTGG GATTGACAGG GGAGTGGGTT AATCCCAAAG ATATTAATTT CTCTCAAAGA ACAATATCAC CTCATGATTA TGCCGAAATT ATGCGTAATG GTGGTTGGGA TTGGGATAGG TCTCCATTAA GAGTAATTGA TATTGATGGG CAATTAGTCT CTTATGATAA TAGGCGATTA GACGCAGCTC TAGAGGCCGG ATTAGATAAG GTTAAAGTTA TTAGGATTGA CCCTAATGCT CCTCATCCTG ACTCATCTAC TGGAAAAACA TGGTTGCAAA AATTTCGAGA GAGATTTAGA GACAGAAGAA ATATAAAGGC CGGTGGAATA GTACCAGATA AAGGATTAAA TTCTCGACCG GAAAGAACAT CAAGAGGTTG CAAATGA
|
Protein sequence | MGEAFWAARE GDALLHTSFL ADLVGSALEF AINAVIDFAA LAVVALATGA TVATLGCSAV LLVGTVVGAT MLLSGAGEKI SKACEDIANS LFPPKIEGYI LTGSGDTRIN SKRAARAAAT ALSRHDVETL DAQAQAEAEE EKRRQDAKSM WDVAGEYLDA VKGRAENMAK GVVSVVSRMG SAEGWASLGN DALAEAGDML SDTGHFISEM WQPTVATAYP GSDPKQDDKI DCHKHPSSFT QFMAQKLQAL TDDPVGTVLG AMNTFDVLEA GFQAASALIG SVSNLFKGDD EPPAAEYIAE GTRDVRINSQ PAARSGVRCT CEAKVVDEPE NGVHVSGDVR IGGPLLVVRD IKSGKSQITL VTTIALTFMQ PGRASAKSAC FMMGLGINMM VQKAGSALNR PVNAATGAKY LAGDDDVDFS LPGHFPLEWQ RTYSSRDERT EGMFGRGWSV LYEVCLERTP DNPDENCMTY VSPMGRRIDL QAVEPGSGFY SPGEGLAVRR SEQGHWLISS DDGVYRLFEA DPFSPQRRRL KMLGDRNSNC QHLTYDNHGR LVEISGDRQR PCIRLHYELA AHPQRVTRIF RHYPEGEPEL LRRYRYDEAG RLNGVVDNAG QYQREFAYDD NDCMTMHREP GGERYYYSWA WFEGPDDAAW RVTGHHTDSG EQYRLDWNLA ERSLCVTDSL GRTRCHWWDA QGLVTAYRDE AGQMTTFRWS DEERLLLGMT DAQGGKWRYV YDRLGHLTET HDPLGRVEQT QWHPVWHQPE TEVDAAGAAW RYEYDERGNL QAVSDPLHQR TVYGYDRHGQ VVRITDARGG DKYLQWNEDG QLMRHTDCSG SQTAWFYDER TRLERVTDAE SNSTRYSYDG NGHLTEVMFA DGRTERYQPD AAGRLVKYTS PAGQITRWQR DGQGRVRRQT DATGRRTAYE YDAYGRLTTL TNENGESYRF RYDVLDRVTE QTDPGGSRRA YGYNALNAVT AVIYGGERGG EIRHGLERDA AGRLTVKITP ETRTEYRYDA ADRLLEIRRR QHDAAEGGEP EVIRFSYDSA GNLLSEETAQ GVLQNRYDVQ GNRTETQMPD GRTLRYLYYG SGHLQQINLG RDVISEFTRD HLHREVQRSQ GRLDTRRMYD RTGRLTRKLT CKGMRGVVPE TFIDREYAYS GQDELLKKRH SRQGVTDYFY DTTGRITACR NEAYLDSWQY DAAANLLDRR QGETAQAGAG SVVPFNRITS YRGLHYRYDE YGRVVEKRGR NGTQHYRWDA EHRLTEVAVI RGSTVRRYGY VYDAPGRRVE KHELDAEGKP YNRTTFLWDG MRLAQECRLG RSSSLYIYSD QGSHEPLARV DRAAPGEADE VLYYHTDVNG APEEMTDGGG NIVWEAGYQV WGNLTHEKET RPVQQNLRFQ GQYLDRETGL HYNLYRFYDP DIGKFISGDP ISLRGGINLY AYAQNPLSWI DPLGLTGEWV NPKDINFSQR TISPHDYAEI MRNGGWDWDR SPLRVIDIDG QLVSYDNRRL DAALEAGLDK VKVIRIDPNA PHPDSSTGKT WLQKFRERFR DRRNIKAGGI VPDKGLNSRP ERTSRGCK
|
| |