Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A3643 |
Symbol | infB |
ID | 6872416 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 3494737 |
End bp | 3497415 |
Gene Length | 2679 bp |
Protein Length | 892 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642786623 |
Product | translation initiation factor IF-2 |
Protein accession | YP_002217259 |
Protein GI | 198242350 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0532] Translation initiation factor 2 (IF-2; GTPase) |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR00487] translation initiation factor IF-2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.102541 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 73 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGATG TAACCCTAAA AGCGCTGGCC GCAGAGAGAC AGGTTTCCGT GGATCGCCTG GTACAGCAGT TTGCTGATGC AGGTATCCGG AAATCTGCTG ACGACTCTGT GTCCGCACAA GAAAAACAGA CTTTACTGGC GCACCTGAAC CGCGAAGCGG TTTCTGGTCC CGATAAACTG ACGCTGCAGC GTAAGACGCG CAGCACCCTG AACATTCCAG GTACCGGTGG AAAAAGTAAA TCGGTACAAA TCGAAGTCCG CAAGAAGCGC ACCTTTGTGA AACGCGATCC GCAAGAGGCT GAACGCCTGG CCGCGGAAGA GCAGGCGCAG CGTGAAGCGG AAGAGCAAGC CCGTCGTGAG GCAGAAGAAC AGGCCAAACG CGAGGCGCAA CAAAAAGCTG AACGCGAGGC CGCAGAACAA GCTAAGCGTG AAGCCGCTGA AAAAGCGAAA CGTGAAGCTG CGGAAAAAGA CAAAGTGAGC AATCAACAGA CTGACGATAT GACCAAAACC GCCCAGGCCG AAAAAGCCCG CCGTGAGAAT GAAGCTGCAG AGCTGAAGCG TAAAGCGGAA GAAGAAGCAC GTCGCAAACT CGAAGAAGAA GCGCGTCGTG TAGCAGAAGA AGCTCGCCGC ATGGCGGAAG AAAACAAATG GACCGCAACG CCTGAGCCAG TAGAAGACAC CAGTGACTAT CATGTCACCA CTTCTCAGCA CGCTCGCCAG GCCGAAGACG AAAACGATCG TGAAGTTGAA GGCGGTCGTG GTCGTGGCCG TAATGCAAAA GCAGCGCGTC CGGCGAAAAA AGGCAAACAT GCCGAATCCA AAGCCGATCG CGAAGAAGCG CGTGCTGCGG TTCGCGGCGG TAAAGGCGGC AAGCGTAAAG GGTCTTCTTT ACAGCAAGGC TTCCAGAAGC CCGCTCAGGC CGTTAACCGT GACGTGGTGA TCGGTGAAAC CATCACCGTT GGCGAACTGG CGAACAAGAT GGCGGTGAAA GGTTCTCAGG TCATCAAAGC GATGATGAAG CTGGGCGCCA TGGCCACCAT CAACCAGGTT ATCGATCAGG AAACCGCACA GCTTGTTGCC GAAGAGATGG GCCACAAAGT TATCCTGCGT CGTGAAAACG AACTGGAAGA AGCGGTAATG AGCGACCGTG ATACCGGCGC TGCGGCTGAA CCACGCGCAC CGGTTGTGAC CATCATGGGT CACGTTGACC ACGGTAAAAC CTCTCTGCTG GACTACATTC GTTCCACGAA AGTAGCCTCT GGCGAAGCGG GCGGCATTAC CCAGCACATC GGTGCTTACC ACGTTGAAAC TGACAACGGG ATGATCACCT TCCTGGACAC CCCGGGCCAC GCCGCATTTA CTTCTATGCG TGCTCGCGGC GCCCAGGCAA CGGATATCGT GGTTCTGGTG GTAGCGGCAG ACGACGGCGT GATGCCGCAG ACTATCGAAG CTATCCAGCA CGCGAAAGCG GCAGGTGTGC CGGTCGTGGT TGCGGTGAAC AAAATCGATA AGCCAGAAGC CGATCCGGAT CGCGTTAAGA ACGAACTGTC CCAGTACGGC ATTCTGCCGG AAGAGTGGGG CGGCGAGAGC CAGTTCGTTC ACGTGTCTGC GAAAGCAGGT ACCGGTATCG ACGAACTGCT GGACGCTATC CTGCTGCAGG CCGAAGTTCT GGAACTGAAA GCGGTACGCA AAGGTATGGC GAGCGGTGCG GTTATCGAAT CCTTCCTGGA TAAGGGTCGT GGTCCGGTGG CTACCGTGCT GGTTCGTGAA GGTACGCTGC ATAAGGGCGA TATCGTGCTG TGTGGCTTCG AATACGGCCG CGTGCGTGCG ATGCGTAACG AACTGGGTCA GGAAGTGCTG GAAGCGGGTC CGTCCATTCC GGTGGAAATC CTCGGCCTGT CCGGCGTTCC GGCTGCGGGT GACGAAGTGA CCGTCGTTCG CGACGAGAAG AAAGCCCGTG AAGTTGCTCT GTATCGTCAG GGTAAATTCC GTGAAGTTAA ACTGGCGCGT CAACAGAAAT CTAAACTCGA GAACATGTTC GCCAACATGA CCGAAGGCGA AGTTCACGAA GTGAACATCG TGCTGAAGGC CGACGTACAG GGTTCTGTGG AAGCGATCTC CGACTCCTTG CTGAAACTGT CTACCGACGA AGTGAAAGTG AAGATCATCG GTTCTGGCGT AGGTGGTATC ACCGAAACCG ACGCGACCCT GGCTGCGGCG TCCAACGCCA TTCTGGTTGG CTTCAACGTT CGTGCCGATG CCTCTGCGCG TAAAGTGATC GAATCTGAAA GCCTGGATCT GCGTTACTAC TCCGTCATCT ATAACCTGAT CGACGAAGTG AAAGCGGCGA TGAGCGGTAT GCTGTCTCCG GAACTGAAAC AGCAGATCAT CGGTCTGGCT GAAGTGCGCG ATGTGTTCAA ATCGCCGAAA TTCGGCGCGA TCGCGGGCTG TATGGTTACC GAAGGTACGA TTAAACGCCA TAACCCAATC CGCGTTCTGC GCGACAACGT GGTTATCTAT GAAGGCGAGC TGGAATCCCT GCGCCGCTTC AAAGATGACG TTAACGAAGT CCGTAACGGC ATGGAATGTG GTATCGGCGT GAAGAACTAC AACGACGTTC GCGTTGGCGA TATGATTGAA GTGTTCGAGA TTATCGAGAT CCAACGTACC ATCGCTTAA
|
Protein sequence | MTDVTLKALA AERQVSVDRL VQQFADAGIR KSADDSVSAQ EKQTLLAHLN REAVSGPDKL TLQRKTRSTL NIPGTGGKSK SVQIEVRKKR TFVKRDPQEA ERLAAEEQAQ REAEEQARRE AEEQAKREAQ QKAEREAAEQ AKREAAEKAK REAAEKDKVS NQQTDDMTKT AQAEKARREN EAAELKRKAE EEARRKLEEE ARRVAEEARR MAEENKWTAT PEPVEDTSDY HVTTSQHARQ AEDENDREVE GGRGRGRNAK AARPAKKGKH AESKADREEA RAAVRGGKGG KRKGSSLQQG FQKPAQAVNR DVVIGETITV GELANKMAVK GSQVIKAMMK LGAMATINQV IDQETAQLVA EEMGHKVILR RENELEEAVM SDRDTGAAAE PRAPVVTIMG HVDHGKTSLL DYIRSTKVAS GEAGGITQHI GAYHVETDNG MITFLDTPGH AAFTSMRARG AQATDIVVLV VAADDGVMPQ TIEAIQHAKA AGVPVVVAVN KIDKPEADPD RVKNELSQYG ILPEEWGGES QFVHVSAKAG TGIDELLDAI LLQAEVLELK AVRKGMASGA VIESFLDKGR GPVATVLVRE GTLHKGDIVL CGFEYGRVRA MRNELGQEVL EAGPSIPVEI LGLSGVPAAG DEVTVVRDEK KAREVALYRQ GKFREVKLAR QQKSKLENMF ANMTEGEVHE VNIVLKADVQ GSVEAISDSL LKLSTDEVKV KIIGSGVGGI TETDATLAAA SNAILVGFNV RADASARKVI ESESLDLRYY SVIYNLIDEV KAAMSGMLSP ELKQQIIGLA EVRDVFKSPK FGAIAGCMVT EGTIKRHNPI RVLRDNVVIY EGELESLRRF KDDVNEVRNG MECGIGVKNY NDVRVGDMIE VFEIIEIQRT IA
|
| |