Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1416 |
Symbol | infB |
ID | 4446052 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 1576415 |
End bp | 1579321 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639689227 |
Product | translation initiation factor IF-2 |
Protein accession | YP_830910 |
Protein GI | 116669977 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0532] Translation initiation factor 2 (IF-2; GTPase) |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR00487] translation initiation factor IF-2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.919668 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCCAAGG TCCGCGTACA TGAGCTTGCT AAAGAGCTCG GTATTACTTC CAAAGATGCA GTAACCAAAC TGCAGGAACT GGGCGAATTC GTTCGCTCTG CCTCTTCCAC CATTGAGGCC CCCGTTGTGA GGAAACTCCG CAACGCATAC CCCGCCGCGG GCGCTTCGAA GTCCGAAGCT CCCGCTGCAG CGCCCAAGGC GCCCGCCAGC CCCGCAGCTA CCCGACCGGC CCCCGCGCCG GGCCCGGCAG CACCCAAGGC TCCGGAACCC AAGGCAGAAG CTCCGGCTGC TGCCTCCGCT CCGTCGGCGC CTGCTCCGGC AGCACCTGCT CCCGCGGCCC CGGCTGCGGC AGCCTCTGCT CCGTCGGCGC CTGCTCCGGC CGCACCGTCC ACCGGTGCAA AGCCCGGCGC ACGCCCGGCC CCCAAGGCTG AAGCTCCGGC TGCCCCGGCC CGCTCCGGCG GCCAGGGCTC GGCACCCCGT CCGGGCGGCC CCCGTCCCGG CAACAACCCG TTCGCCACGT CCCAGGGCAT GCCCCGGGGC CGCGGCGGCG ACAACGAGCG TCCGCCGCGT CCGGGCAACA ACCCGTTCGC TCCTTCCCAG GGCATGCCCC GTCCGGGCGG AAGCCGCACC GAGGGCGAAC GCCCCGGCGG CCCGCGTCCG GCAGCCGGCG CAGGAGGTCC CCGTCCGGGT GCTCCGCGTC CCGGCGGCAC CCAGGGTGCA CGTCCCGGCG CTCCGCGTCC GGCCGGTGCT CCCGGTGCAC GCCCCGGTGC AGGCGGCGGA AACCGTCCTA CTCCGGGCAT GATGCCTAAC CGCACCGAAC GACCCGCACC CGCTGGTGCA GGCCGTCCCG GCGGCGGAGG CCGCGGTCCC GGACGCCCCG GTGGCGCACC GGGTACCGGC GGCGCTCCCG GCGCCGGCGG CGGTGCTCCG GCCGGCGGTG GCTTCGGCAA GGGTGGCCGC GGTCGCGGTG GCACCCAGGG TGCCTTCGGT AAGGGCGGCG CAGGCCGTGG CAAGCAGCGC AAGTCGAAGC GTGCCAAGCG CCAGGAACTC GAGCAGATGA GTGCTCCGTC GCTGGGTGGC GTCAGTGTGC CCCGCGGCGA CGGCAACACC GTAGTCCGGC TCCGCCGCGG CTCGTCCATC ACGGACTTTG CCGACAAGAT CGAGGCAAAC CCCGCTGCAC TGGTGACCGT GCTCTTCCAC CTCGGCGAAA TGGCCACGGC CACGCAGTCG CTGGATGAAG AGACCTTCGC CCTGCTCGGC GAGGAGCTTG GCTACAAGCT CCAGGTCGTG TCGCCGGAGG ACGAGGAGCG CGAGCTGCTC TCCGGCTTCG ACATCGACTT CGACGCCGAG CTGGAGGCCG AAGGCGACGA GGAACTCGAG GCACGTCCTC CTGTTGTCAC CGTCATGGGT CACGTTGACC ACGGTAAGAC CCGCCTGCTG GATGCCATCC GTAACTCCGA CGTCGTCGCG GGTGAACACG GCGGCATCAC GCAGCACATT GGTGCTTACC AGATCACCAC CGAGCACGAG GGCGCCGAAC GCAAGATTAC GTTCATCGAT ACTCCGGGCC ACGAGGCGTT CACCGCCATG CGTGCCCGTG GTGCGAAGGT CACCGACATT GCCATCCTGG TGGTCGCAGC GGACGACGGC GTTATGCCGC AGACCGTTGA AGCCCTCAAC CACGCGCAGG CCGCCAACGT GCCGATCGTC GTGGCCGTGA ACAAGATCGA TAAGGAAGGC GCCAACCCGG ACAAGGTCCG CGGCCAGCTG ACCGAGTACG GGCTCGTTCC GGAAGAATAC GGTGGCGACA CCATGTTCGT GGAGGTCTCT GCACGCCAGA ACCTCAACAT CGACGAGCTG CTCGAGGCTG TTCTGCTCAC CGCAGACGCA GCCCTGGACA TGCGCGCCAA CCCGAACAAG GACGCCCGCG GTATTGCGAT CGAAGCCAAC CTGGACAAGG GCCGCGGTGC GGTTGCTACC GTCCTGGTTC AGTCCGGTAC GCTTCACGTC GGCGACACCA TCGTGGCAGG CACGGCCCAC GGCCGCGTCC GTGCGATGTT CGACGACGAC GGCAGCGTCC TGACCGAGGC CGGCCCGTCC CGCCCCGTGC AGGTACTGGG TCTGTCCAAC GTCCCGCGCG CCGGCGACAC CTTCTTTGTG ACCGCTGACG AGCGCACCGC CCGCCAGATC GCCGAGAAGC GTGAAGCAGC AGACCGCAAC GCCGCTTTGG CCAAGCGCCG CAAGCGCATC AGCCTGGAAG ACTTCGACCA GGCCGTCGCC GAAGGCAAGA TCGACACCCT CAACCTCATC CTCAAGGGTG ACGTGTCCGG TGCCGTGGAA GCCCTCGAAG ACGCGCTGCT CAAGATCGAC GTCGGCGAAG GTGTCCAGCT CCGCGTTATC CACCGCGGTG TCGGTGCGAT CACGCAGAAC GACGTCAACC TGGCAACGGT GGACAGCGCC GTCATCATCG GCTTCAACGT CAAGCCCGCC GAGCGTGTTG CCGAACTGGC AGACCGCGAA GGCGTGGACA TGCGCTTCTA CTCCGTCATC TACGCAGCAA TCGATGACAT TGAGATGGCC CTCAAGGGCA TGCTCAAGCC GGAGTACGAA GAAGTCCAGC TTGGCACCGC CGAGGTCCGC GAAGTGTTCC GTTCCTCCAA GTTCGGCAAC ATCGCCGGTT CCATCGTTCG CTCGGGTGTT ATCCGACGCA ACTCGAAGGC CCGTATCAGC CGCGACGGCA AGATCATCGG TGACAACCTC ACCGTTGAGA CGCTCAAGCG CTTCAAGGAC GACGCCACCG AGGTCCGCAC GGACTTCGAG TGTGGTATCG GTCTTGGCTC GTACAACGAC ATCAACGAGG GTGACATCAT CGAGACCTTC GAGATGCGCG AGAAGCCGCG CGTCTAG
|
Protein sequence | MAKVRVHELA KELGITSKDA VTKLQELGEF VRSASSTIEA PVVRKLRNAY PAAGASKSEA PAAAPKAPAS PAATRPAPAP GPAAPKAPEP KAEAPAAASA PSAPAPAAPA PAAPAAAASA PSAPAPAAPS TGAKPGARPA PKAEAPAAPA RSGGQGSAPR PGGPRPGNNP FATSQGMPRG RGGDNERPPR PGNNPFAPSQ GMPRPGGSRT EGERPGGPRP AAGAGGPRPG APRPGGTQGA RPGAPRPAGA PGARPGAGGG NRPTPGMMPN RTERPAPAGA GRPGGGGRGP GRPGGAPGTG GAPGAGGGAP AGGGFGKGGR GRGGTQGAFG KGGAGRGKQR KSKRAKRQEL EQMSAPSLGG VSVPRGDGNT VVRLRRGSSI TDFADKIEAN PAALVTVLFH LGEMATATQS LDEETFALLG EELGYKLQVV SPEDEERELL SGFDIDFDAE LEAEGDEELE ARPPVVTVMG HVDHGKTRLL DAIRNSDVVA GEHGGITQHI GAYQITTEHE GAERKITFID TPGHEAFTAM RARGAKVTDI AILVVAADDG VMPQTVEALN HAQAANVPIV VAVNKIDKEG ANPDKVRGQL TEYGLVPEEY GGDTMFVEVS ARQNLNIDEL LEAVLLTADA ALDMRANPNK DARGIAIEAN LDKGRGAVAT VLVQSGTLHV GDTIVAGTAH GRVRAMFDDD GSVLTEAGPS RPVQVLGLSN VPRAGDTFFV TADERTARQI AEKREAADRN AALAKRRKRI SLEDFDQAVA EGKIDTLNLI LKGDVSGAVE ALEDALLKID VGEGVQLRVI HRGVGAITQN DVNLATVDSA VIIGFNVKPA ERVAELADRE GVDMRFYSVI YAAIDDIEMA LKGMLKPEYE EVQLGTAEVR EVFRSSKFGN IAGSIVRSGV IRRNSKARIS RDGKIIGDNL TVETLKRFKD DATEVRTDFE CGIGLGSYND INEGDIIETF EMREKPRV
|
| |