Gene Acid345_4217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4217 
Symbol 
ID4073143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4992500 
End bp4995535 
Gene Length3036 bp 
Protein Length1011 aa 
Translation table11 
GC content63% 
IMG OID637986248 
Producttranslation initiation factor 2 
Protein accessionYP_593291 
Protein GI94971243 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0532] Translation initiation factor 2 (IF-2; GTPase) 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00487] translation initiation factor IF-2 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.726068 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0540264 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATTC GAATTAACGA TTTAGCACGA GAGCTGGAAG TGAAGAGCAA GGCTATTCTC 
GATGCGCTGA CCAAGGTCGG CGTGACCGAG AAGAAGACCC ACTCCAGTTC GATTGAAGAT
CACGAAGCCG TGCTGGTGAA GAAGTACATC CATGAGCACG GGACCGAAGA ATCGCCGCGT
CGGCGAAGCG CCGGAGAAGA CGAGTTCAAG CCGAAGATCG ATCTCTCAAA GATTTCGAAG
CCCGGCGATG TGCTCAAGGC GCTCACGCAG AAGGCTGCCC CTCCGCCGCC ACCTCCGCCT
CCGCCACGCC CAGCCGTGAA GGCGCCGAGC CCTGTTTCGC AGGAGCCGCG TCCGCCGGCC
GTTCCGCCCG CACCTCAGAA GCCCGCCGTG TTTGCGCGTC CGGCCTCCGA GACGGTGCAT
ACGCCGCCTG AGCCACCAAA GCCGCGTTTC ATTACGCCAG CGAGCGTTGC CGCGCAGCGT
CCGGTGATCA CGCCTCCGAA GCCGCCAGTT CCGCCCGCGC CTCCGGTAGC CGTCGCGCCG
CCGGCAGTGA TTGAACCGGC CGCTCCGGCC GAAGAGCCAA AGGCCGCTGC GCCAGCTACG
ACTGCGCCGG AAGCGCCCGA AGTTAAAGCG CCGGTTTCGC CGGAGCGAGT CGCTCCCGCC
GCGGACACTG GCGCACACGT AACCGCAAAG CCGGAAGCCC CAGCAGCTCC AGGCGCAGCC
ACTCCCGCGC CTACGCCGGG ACGTCCGCTG CCGGGTGTGC CGTTGCGCCA ACAGACGCCG
GGTCGTCGCA TGATCGTTCC GCAAACCGGA CCACGTCCGG TTTACAGCGC GCCGCCGCCG
GCACCACCGC GTCCGACTCC GCCGCCGCAA ATGTCGCAGG GAGCAGGTAC GCGTCCGGGT
ATGCCGGTGC GCGGTCAGCC CATTTTCCAG CGCCGTCCGC AAAGCGGTCC TGGTGGTGGT
TCGGGAGGTC CAGGTGGATT CCAGCGTCCT GGCGGTCCGC CGCGTCCGGG GGATCGCCCG
CGTGGTCCGC ATCCAACGCG GCAGTTCCCC AGCGGTCCCC GTCCGATGGG CGGGATCGGC
CTAGCGCCTC CGGGAGCACC CGCGAATAAG CCGGCAGGCC GTCCGGCACC GGCACGGCGT
CCGGGCCAGC GTTATGTCCC GCGTGGACAA AAAGAAGGCC CAATGAAGGG CTTTGTTCCG
CCACCGCGGT TGTCGCTCTC CAATGAGCCG CTACCGATCA CGCGGAACAT CACGATCTCC
GAAGGTATCA GCGTGAAAGA TCTCGCTGAG AAGCTCGGGA TTCGCGCGAA AGACCTCATC
GCCCGTTTGT TGGCGCGTGG CGTATTCGCT ACCGTCAACC AGACGCTCGA AGCCAGTCTT
GCCAGTGAAA TGGCGAACCA CTTCGGCGCC TCGACGGACG TCATTACCTT CGAGGACCAA
CTTGCGCAGG AGACTGCCAA GGCTGCCGGT GAGACTCCGG AAGAAGCGGC CGCGAACGCT
GTCGTGCGTC CTCCGGTCGT CACCATCATG GGCCACGTTG ATCACGGTAA GACGAGCTTG
CTCGACGCAA TCCGCGCGAC CGACGTAGCG GGTGGCGAAG CCGGTGGCAT CACGCAGCAC
ATCGGCGCTT ACAAGGTAGC GATCGGTGAT CCGAACTCTC CGGCGTTTGG CCGCGAGATC
GTATTCCTCG ATACCCCAGG TCACGAGGCG TTTACCCGCA TGCGTGCCCG CGGCTCGAAG
ATCACGGACA TCGTTGTGAT CGTTGTCGCT GCTGATGACG GCGTCATGCC GCAGACGGTC
GAGGCCATCG ACCACGCGAG AGCGGCGAAC GTGCCGATCA TCGTGGCGGT GAACAAGATC
GACAAGCCAG ACGCTATGCC CGAGCGCGTG AAGAAGCAAC TCGCTGATCG TGGCCTGATG
CCGGAAGATT GGGGTGGCAA CACCGTGTTC GTCGACGTAT CGGCGAAACA GAAGACCAAT
CTCAACCTGC TGATGGAAAT GATCTGCCTG GTTGCCGACC TCGGCGACCT GAAGGCGAAT
CCCGATCGCA TGGCGAGCGG TACAGTTGTG GAAGCGAAAC TTGATCGCGG ACGCGGTCCG
GTTGCAACCG TGCTGGTTCA GAATGGCACG CTCAGGACCA GCGACAACTT CGTGGTCGGC
AACGCATTCG GCAAAGTCCG CGCCATGTTT AACGATCGTG GTGTGTCGCT CGACACCGCT
GGACCTTCGA CTCCGGTCGA GATCATTGGT CTCGAGACAC TGCCGCAAGC CGGCGACCAG
TTCACGGTCG TAGCCGATCG TGAGAAGGCC CGCGACATCT CCGAGTACCG CGAAGGCCGC
GCTCGCGAAG CACAGCTTGC GAAGAGCTCG CGCGTTTCAC TCGAAGGCTT GGCTGAACAG
CTCAAGACCG CCGGACAGAA GGACCTGCCG ATCATCCTCA AGGGCGATGT GCAGGGCTCG
GTCGAAGTGC TGAATGACTT GCTGAGCAAG ATGTCGACGG AAAAGGTGAA GATCACCATG
ATCCGTAGCG GAGTGGGTGC GATCACCGAA TCCGACGTGC TGCTGGCCTC GGCGTCGAAC
GCGATCATCA TCGGGTTCAA CGTGCGACCG GAGCGCAAGG CGCAAGAGCT CGCCGTACAG
GAGGGGGTCG ACATCCGCCT GCACTCGATC ATCTACGAGT TGCAGGACGA GATGAAGAAA
GCCATGCTCG GCTTGCTCGA ACCGATCATC AAGGAAACCT ACCAGGGGCG CGCGGACGTC
AAAGACACCT TCCGCATCCC GAAGGTGGGT ACCATCGCCG GTTGCCAGGT TGCGGATGGC
ATCATCAAAC GCGACTCGCA CGTGCGCTTG GTGCGTGACA ACGTGGTGAT CTACACCGGC
AAGATCGGAT CGCTGAAGCG TTTCAAAGAC GACGCCAGCG AGGTCCGTAA CGGCATGGAG
TGCGGTATCG GTATCGCGGG TTACGGCGAC ATTCGCAGCG GGGACGTGAT CGAAGCGTTC
ACCAGCGAAA AGATTGCTGC CGACTCGCTG CACTAG
 
Protein sequence
MKIRINDLAR ELEVKSKAIL DALTKVGVTE KKTHSSSIED HEAVLVKKYI HEHGTEESPR 
RRSAGEDEFK PKIDLSKISK PGDVLKALTQ KAAPPPPPPP PPRPAVKAPS PVSQEPRPPA
VPPAPQKPAV FARPASETVH TPPEPPKPRF ITPASVAAQR PVITPPKPPV PPAPPVAVAP
PAVIEPAAPA EEPKAAAPAT TAPEAPEVKA PVSPERVAPA ADTGAHVTAK PEAPAAPGAA
TPAPTPGRPL PGVPLRQQTP GRRMIVPQTG PRPVYSAPPP APPRPTPPPQ MSQGAGTRPG
MPVRGQPIFQ RRPQSGPGGG SGGPGGFQRP GGPPRPGDRP RGPHPTRQFP SGPRPMGGIG
LAPPGAPANK PAGRPAPARR PGQRYVPRGQ KEGPMKGFVP PPRLSLSNEP LPITRNITIS
EGISVKDLAE KLGIRAKDLI ARLLARGVFA TVNQTLEASL ASEMANHFGA STDVITFEDQ
LAQETAKAAG ETPEEAAANA VVRPPVVTIM GHVDHGKTSL LDAIRATDVA GGEAGGITQH
IGAYKVAIGD PNSPAFGREI VFLDTPGHEA FTRMRARGSK ITDIVVIVVA ADDGVMPQTV
EAIDHARAAN VPIIVAVNKI DKPDAMPERV KKQLADRGLM PEDWGGNTVF VDVSAKQKTN
LNLLMEMICL VADLGDLKAN PDRMASGTVV EAKLDRGRGP VATVLVQNGT LRTSDNFVVG
NAFGKVRAMF NDRGVSLDTA GPSTPVEIIG LETLPQAGDQ FTVVADREKA RDISEYREGR
AREAQLAKSS RVSLEGLAEQ LKTAGQKDLP IILKGDVQGS VEVLNDLLSK MSTEKVKITM
IRSGVGAITE SDVLLASASN AIIIGFNVRP ERKAQELAVQ EGVDIRLHSI IYELQDEMKK
AMLGLLEPII KETYQGRADV KDTFRIPKVG TIAGCQVADG IIKRDSHVRL VRDNVVIYTG
KIGSLKRFKD DASEVRNGME CGIGIAGYGD IRSGDVIEAF TSEKIAADSL H