Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4217 |
Symbol | |
ID | 4073143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4992500 |
End bp | 4995535 |
Gene Length | 3036 bp |
Protein Length | 1011 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637986248 |
Product | translation initiation factor 2 |
Protein accession | YP_593291 |
Protein GI | 94971243 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0532] Translation initiation factor 2 (IF-2; GTPase) |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR00487] translation initiation factor IF-2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.726068 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0540264 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATTC GAATTAACGA TTTAGCACGA GAGCTGGAAG TGAAGAGCAA GGCTATTCTC GATGCGCTGA CCAAGGTCGG CGTGACCGAG AAGAAGACCC ACTCCAGTTC GATTGAAGAT CACGAAGCCG TGCTGGTGAA GAAGTACATC CATGAGCACG GGACCGAAGA ATCGCCGCGT CGGCGAAGCG CCGGAGAAGA CGAGTTCAAG CCGAAGATCG ATCTCTCAAA GATTTCGAAG CCCGGCGATG TGCTCAAGGC GCTCACGCAG AAGGCTGCCC CTCCGCCGCC ACCTCCGCCT CCGCCACGCC CAGCCGTGAA GGCGCCGAGC CCTGTTTCGC AGGAGCCGCG TCCGCCGGCC GTTCCGCCCG CACCTCAGAA GCCCGCCGTG TTTGCGCGTC CGGCCTCCGA GACGGTGCAT ACGCCGCCTG AGCCACCAAA GCCGCGTTTC ATTACGCCAG CGAGCGTTGC CGCGCAGCGT CCGGTGATCA CGCCTCCGAA GCCGCCAGTT CCGCCCGCGC CTCCGGTAGC CGTCGCGCCG CCGGCAGTGA TTGAACCGGC CGCTCCGGCC GAAGAGCCAA AGGCCGCTGC GCCAGCTACG ACTGCGCCGG AAGCGCCCGA AGTTAAAGCG CCGGTTTCGC CGGAGCGAGT CGCTCCCGCC GCGGACACTG GCGCACACGT AACCGCAAAG CCGGAAGCCC CAGCAGCTCC AGGCGCAGCC ACTCCCGCGC CTACGCCGGG ACGTCCGCTG CCGGGTGTGC CGTTGCGCCA ACAGACGCCG GGTCGTCGCA TGATCGTTCC GCAAACCGGA CCACGTCCGG TTTACAGCGC GCCGCCGCCG GCACCACCGC GTCCGACTCC GCCGCCGCAA ATGTCGCAGG GAGCAGGTAC GCGTCCGGGT ATGCCGGTGC GCGGTCAGCC CATTTTCCAG CGCCGTCCGC AAAGCGGTCC TGGTGGTGGT TCGGGAGGTC CAGGTGGATT CCAGCGTCCT GGCGGTCCGC CGCGTCCGGG GGATCGCCCG CGTGGTCCGC ATCCAACGCG GCAGTTCCCC AGCGGTCCCC GTCCGATGGG CGGGATCGGC CTAGCGCCTC CGGGAGCACC CGCGAATAAG CCGGCAGGCC GTCCGGCACC GGCACGGCGT CCGGGCCAGC GTTATGTCCC GCGTGGACAA AAAGAAGGCC CAATGAAGGG CTTTGTTCCG CCACCGCGGT TGTCGCTCTC CAATGAGCCG CTACCGATCA CGCGGAACAT CACGATCTCC GAAGGTATCA GCGTGAAAGA TCTCGCTGAG AAGCTCGGGA TTCGCGCGAA AGACCTCATC GCCCGTTTGT TGGCGCGTGG CGTATTCGCT ACCGTCAACC AGACGCTCGA AGCCAGTCTT GCCAGTGAAA TGGCGAACCA CTTCGGCGCC TCGACGGACG TCATTACCTT CGAGGACCAA CTTGCGCAGG AGACTGCCAA GGCTGCCGGT GAGACTCCGG AAGAAGCGGC CGCGAACGCT GTCGTGCGTC CTCCGGTCGT CACCATCATG GGCCACGTTG ATCACGGTAA GACGAGCTTG CTCGACGCAA TCCGCGCGAC CGACGTAGCG GGTGGCGAAG CCGGTGGCAT CACGCAGCAC ATCGGCGCTT ACAAGGTAGC GATCGGTGAT CCGAACTCTC CGGCGTTTGG CCGCGAGATC GTATTCCTCG ATACCCCAGG TCACGAGGCG TTTACCCGCA TGCGTGCCCG CGGCTCGAAG ATCACGGACA TCGTTGTGAT CGTTGTCGCT GCTGATGACG GCGTCATGCC GCAGACGGTC GAGGCCATCG ACCACGCGAG AGCGGCGAAC GTGCCGATCA TCGTGGCGGT GAACAAGATC GACAAGCCAG ACGCTATGCC CGAGCGCGTG AAGAAGCAAC TCGCTGATCG TGGCCTGATG CCGGAAGATT GGGGTGGCAA CACCGTGTTC GTCGACGTAT CGGCGAAACA GAAGACCAAT CTCAACCTGC TGATGGAAAT GATCTGCCTG GTTGCCGACC TCGGCGACCT GAAGGCGAAT CCCGATCGCA TGGCGAGCGG TACAGTTGTG GAAGCGAAAC TTGATCGCGG ACGCGGTCCG GTTGCAACCG TGCTGGTTCA GAATGGCACG CTCAGGACCA GCGACAACTT CGTGGTCGGC AACGCATTCG GCAAAGTCCG CGCCATGTTT AACGATCGTG GTGTGTCGCT CGACACCGCT GGACCTTCGA CTCCGGTCGA GATCATTGGT CTCGAGACAC TGCCGCAAGC CGGCGACCAG TTCACGGTCG TAGCCGATCG TGAGAAGGCC CGCGACATCT CCGAGTACCG CGAAGGCCGC GCTCGCGAAG CACAGCTTGC GAAGAGCTCG CGCGTTTCAC TCGAAGGCTT GGCTGAACAG CTCAAGACCG CCGGACAGAA GGACCTGCCG ATCATCCTCA AGGGCGATGT GCAGGGCTCG GTCGAAGTGC TGAATGACTT GCTGAGCAAG ATGTCGACGG AAAAGGTGAA GATCACCATG ATCCGTAGCG GAGTGGGTGC GATCACCGAA TCCGACGTGC TGCTGGCCTC GGCGTCGAAC GCGATCATCA TCGGGTTCAA CGTGCGACCG GAGCGCAAGG CGCAAGAGCT CGCCGTACAG GAGGGGGTCG ACATCCGCCT GCACTCGATC ATCTACGAGT TGCAGGACGA GATGAAGAAA GCCATGCTCG GCTTGCTCGA ACCGATCATC AAGGAAACCT ACCAGGGGCG CGCGGACGTC AAAGACACCT TCCGCATCCC GAAGGTGGGT ACCATCGCCG GTTGCCAGGT TGCGGATGGC ATCATCAAAC GCGACTCGCA CGTGCGCTTG GTGCGTGACA ACGTGGTGAT CTACACCGGC AAGATCGGAT CGCTGAAGCG TTTCAAAGAC GACGCCAGCG AGGTCCGTAA CGGCATGGAG TGCGGTATCG GTATCGCGGG TTACGGCGAC ATTCGCAGCG GGGACGTGAT CGAAGCGTTC ACCAGCGAAA AGATTGCTGC CGACTCGCTG CACTAG
|
Protein sequence | MKIRINDLAR ELEVKSKAIL DALTKVGVTE KKTHSSSIED HEAVLVKKYI HEHGTEESPR RRSAGEDEFK PKIDLSKISK PGDVLKALTQ KAAPPPPPPP PPRPAVKAPS PVSQEPRPPA VPPAPQKPAV FARPASETVH TPPEPPKPRF ITPASVAAQR PVITPPKPPV PPAPPVAVAP PAVIEPAAPA EEPKAAAPAT TAPEAPEVKA PVSPERVAPA ADTGAHVTAK PEAPAAPGAA TPAPTPGRPL PGVPLRQQTP GRRMIVPQTG PRPVYSAPPP APPRPTPPPQ MSQGAGTRPG MPVRGQPIFQ RRPQSGPGGG SGGPGGFQRP GGPPRPGDRP RGPHPTRQFP SGPRPMGGIG LAPPGAPANK PAGRPAPARR PGQRYVPRGQ KEGPMKGFVP PPRLSLSNEP LPITRNITIS EGISVKDLAE KLGIRAKDLI ARLLARGVFA TVNQTLEASL ASEMANHFGA STDVITFEDQ LAQETAKAAG ETPEEAAANA VVRPPVVTIM GHVDHGKTSL LDAIRATDVA GGEAGGITQH IGAYKVAIGD PNSPAFGREI VFLDTPGHEA FTRMRARGSK ITDIVVIVVA ADDGVMPQTV EAIDHARAAN VPIIVAVNKI DKPDAMPERV KKQLADRGLM PEDWGGNTVF VDVSAKQKTN LNLLMEMICL VADLGDLKAN PDRMASGTVV EAKLDRGRGP VATVLVQNGT LRTSDNFVVG NAFGKVRAMF NDRGVSLDTA GPSTPVEIIG LETLPQAGDQ FTVVADREKA RDISEYREGR AREAQLAKSS RVSLEGLAEQ LKTAGQKDLP IILKGDVQGS VEVLNDLLSK MSTEKVKITM IRSGVGAITE SDVLLASASN AIIIGFNVRP ERKAQELAVQ EGVDIRLHSI IYELQDEMKK AMLGLLEPII KETYQGRADV KDTFRIPKVG TIAGCQVADG IIKRDSHVRL VRDNVVIYTG KIGSLKRFKD DASEVRNGME CGIGIAGYGD IRSGDVIEAF TSEKIAADSL H
|
| |