Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A0317 |
Symbol | |
ID | 6872593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 337906 |
End bp | 340680 |
Gene Length | 2775 bp |
Protein Length | 924 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642783558 |
Product | Rhs family protein |
Protein accession | YP_002214246 |
Protein GI | 198244314 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.325486 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 0.374096 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATGC ACCGGGAGCC GGGCGGCGAA CGGTATTACT ACACCTGGGC GTGGTTTGAA GGCCCGGACG ATGCGGCGTG GCGGGTGACG GGCCATCATA CGGACAGCGG CGAGCAGTAT CGTCTGGACT GGAATCTGGC AGAACGTTCG CTGTGCGTGA CGGATAGTCT GGGGCGTACG CGCTGCCACT GGTGGGATGC GCAGGGCCTG GTGACGGCGT ACCGGGACGA GGCCGGGCAG ATGACCACTT TCCGCTGGAG CGATGAAGAG CGGTTACTGC TGGGGATGAC GGACGCGCAG GGCGGCAAAT GGCGTTATGT CTATGACCGT CTCGGCCACC TGACGGAGAC GCATGACCCG CTGGGCCGGG TTGAGCAGAC GCAGTGGCAT CCGGTGTGGC ACCAGCCGGA AACGGAGGTG GATGCCGCGG GGGCGGCGTG GCGTTATGAG TATGATGAGC GGGGCAACCT GCAGGCGGTC AGCGACCCGC TGCACCAGCG CACGGTATAC GGGTATGACC GGCACGGCCA GGTGGTGCGG ATAACCGACG CGCGGGGCGG AGATAAATAC CTGCAGTGGA ACGAAGACGG GCAGCTTATG CGCCACACGG ACTGTTCTGG CTCGCAGACG GCATGGTTTT ATGATGAACG CACGCGGCTG GAAAGGGTGA CGGACGCGGA GAGTAACAGT ACGCGTTACA GCTATGACGG CAACGGACAT CTGACGGAGG TCATGTTCGC GGACGGGCGT ACGGAGCGTT ACCAGCCGGA TGCGGCGGGA CGGCTGGTGA AATACACCAG CCCGGCGGGG CAGATAACAC GCTGGCAGCG GGACGGCCAG GGGCGGGTGC GCAGGCAGAC GGATGCGACG GGTCGCAGGA CGGCGTATGA GTACGACGCT TACGGGCGGC TGACCACGCT CACGAACGAG AACGGGGAAA GCTACCGGTT CCGGTACGAT GTTCTGGGCC GGGTGACGGA ACAGACGGAC CCCGGCGGCA GCCGCCGGGT ATACGGGTAT AACGCGCTGA ATGCGGTGAC GGCAGTGATA TACGGCGGGG AGCGCGGGGG CGAAATCCGC CACGGTCTGG AGCGTGATGC GGCGGGGAGG CTGACGGCGA AAATCACGCC GGAGACGCGC ACGGAATACC GGTACGACGC GGCGGACCGT CTGCTGGAAA TCCGCCGCAG GCAGCATGAT GCGGCGGAAG GCGGAGAGCC GGAAGTTATC CGGTTCAGCT ACGACAGTGC GGGTAACCTG CTGAGCGAGG AGACGGCGCA GGGCGTGCTG CAGAACCGGT ACGATGTTCA GGGCAACCGC ACAGAAACGC AGATGCCGGA TGGGCGGACG CTGCGGTACC TGTACTACGG GAGCGGCCAT CTCCAGCAAA TCAACCTGGG GCGTGATGTC ATCAGCGAGT TCACGCGTGA CCACCTGCAC CGTGAGGTGC AGCGGAGCCA GGGGCGGCTG GACACGCGGC GGATGTACGA CCGGACGGGC CGGTTAACGC GGAAACTGAC CTGTAAAGGA ATGCGCGGTG TGGTGCCGGA GACGTTTATC GACCGGGAAT ATGCGTACAG CGGCCAGGAT GAGCTGCTGA AAAAGCGGCA CAGCCGGCAG GGGGTGACGG ATTATTTTTA CGACACGACG GGGCGCATCA CGGCGTGCCG GAATGAGGCA TACCTGGACA GCTGGCAGTA CGACGCGGCG GCGAACCTGC TGGACAGGCG GCAGGGAGAG ACCGCGCAGG CGGGTGCAGG CAGCGTGGTG CCGTTCAACC GGATAACGTC ATACCGTGGG CTGCATTACC GTTACGATGA ATATGGCCGG GTTGTGGAAA AGCGGGGCCG CAACGGCACG CAGCACTACC GCTGGGACGC GGAGCACCGG CTGACGGAAG TGGCGGTCAC CCGGGGGGAC ACCGTACGGC GTTACGGGTA CGTGTACGAC GCGCCGGGCA GGCGGGTGGA GAAGCACGAG CTGGACGCGG AAGGAAAGCC GTATAACCGG ACGACGTTTT TATGGGACGG AATGCGGCTG GCACAGGAGT GCAGGCTGGG AAGAAGCAGC AGCCTGTATA TCTACAGCGA CCAGGGGAGC CACGAGCCGC TGGCGCGGGT GGACAGGGCG GCGCCGGGCG AAGCGGATGA GGTGCTGTAT TACCATACGG ACGTAAACGG CGCGCCGGAG GAGATGACGG ACGGCGGGGG CAATATTGTC TGGGAAGCGG GCTATCAGGT ATGGGGGAAC CTGACGCATG AAAAAGAAAC CCGGCCCGTA CAGCAGAACC TGCGTTTCCA GGGGCAATAT CTGGACAGGG AAACGGGGCT GCATTACAAT TTGTACAGAT TTTATGATCC GGATATCGGG AAGTTTATAT CGGGCGATCC AATCTCGCTG AAGGGTGGAA TAAACTTATA TGCGTATGCA CCGAATCCTC TGTCATGGAT CGATCCTTTA GGTCTTAAAT GTGGATCTTC GTATGAGCAG GCCAGAAATA AAGCCCTCAA ATGGTTAGAA GAACGTGGAT TTAAAGCGGA GAGAGTTAAT ATAGGTAAGT TCGGTTCGAC AAGAGGAAAA CCTGTAGGAA TGACAACCGC CGATGGCAAA ACAGGATTTA GGATCGAATA TGATGAACGA AGTGGTGCCC ATATTAATGT CTTCAGTGGA AAGGATAAAG GCGAACATTT CTTATTTGAT GCAAGTGAAT CTATCGTAAC AAAACTCCAA AAATTATTTG ATCTTCCATC CAAACCTCAA AGGCCAATAT CATGA
|
Protein sequence | MTMHREPGGE RYYYTWAWFE GPDDAAWRVT GHHTDSGEQY RLDWNLAERS LCVTDSLGRT RCHWWDAQGL VTAYRDEAGQ MTTFRWSDEE RLLLGMTDAQ GGKWRYVYDR LGHLTETHDP LGRVEQTQWH PVWHQPETEV DAAGAAWRYE YDERGNLQAV SDPLHQRTVY GYDRHGQVVR ITDARGGDKY LQWNEDGQLM RHTDCSGSQT AWFYDERTRL ERVTDAESNS TRYSYDGNGH LTEVMFADGR TERYQPDAAG RLVKYTSPAG QITRWQRDGQ GRVRRQTDAT GRRTAYEYDA YGRLTTLTNE NGESYRFRYD VLGRVTEQTD PGGSRRVYGY NALNAVTAVI YGGERGGEIR HGLERDAAGR LTAKITPETR TEYRYDAADR LLEIRRRQHD AAEGGEPEVI RFSYDSAGNL LSEETAQGVL QNRYDVQGNR TETQMPDGRT LRYLYYGSGH LQQINLGRDV ISEFTRDHLH REVQRSQGRL DTRRMYDRTG RLTRKLTCKG MRGVVPETFI DREYAYSGQD ELLKKRHSRQ GVTDYFYDTT GRITACRNEA YLDSWQYDAA ANLLDRRQGE TAQAGAGSVV PFNRITSYRG LHYRYDEYGR VVEKRGRNGT QHYRWDAEHR LTEVAVTRGD TVRRYGYVYD APGRRVEKHE LDAEGKPYNR TTFLWDGMRL AQECRLGRSS SLYIYSDQGS HEPLARVDRA APGEADEVLY YHTDVNGAPE EMTDGGGNIV WEAGYQVWGN LTHEKETRPV QQNLRFQGQY LDRETGLHYN LYRFYDPDIG KFISGDPISL KGGINLYAYA PNPLSWIDPL GLKCGSSYEQ ARNKALKWLE ERGFKAERVN IGKFGSTRGK PVGMTTADGK TGFRIEYDER SGAHINVFSG KDKGEHFLFD ASESIVTKLQ KLFDLPSKPQ RPIS
|
| |