Gene Acid345_1861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1861 
Symbol 
ID4069203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2237304 
End bp2240114 
Gene Length2811 bp 
Protein Length936 aa 
Translation table11 
GC content60% 
IMG OID637983870 
Productisoleucyl-tRNA synthetase 
Protein accessionYP_590936 
Protein GI94968888 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0060] Isoleucyl-tRNA synthetase 
TIGRFAM ID[TIGR00392] isoleucyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCACTGG AGTTGAAAGA CACCATCAAC CTGCCCAAAA CCGACTTTGC CATGAAAGCC 
AACCTTCCCC TGAACGAGCC CAAAATGCTC GCCCGCTGGG AAGAGCAGCG AATCTACGAA
TTGATTCGTG AGTCGCGGCA GGGCAAGCCC AGCTACATCC TCCACGACGG CCCTCCCTAC
GCGAACGGAC CGATCCACCT CGGCCACGCC CTGAACAAGT GTCTAAAAGA TTTCGTCGTG
AAGTCGAAGA CCATGGCTGG CTTCGACGCG CCCTATATCC CCGGATGGGA CTGCCACGGG
CTGCCGATCG AGATCAAGGT TGACGAACAA CTCGGGCGTA AGAAGCTCGA AATGGACCCG
CTCGACGTTC GTGCGGCATG CGCAAAGTAC GCGCTGAAGT ATCTCGACAC CCAGCGCGAG
CAGTTCAAGC GTCTAGGGGT CTTCGGCCAG TGGGACAAAC CGTACTCGAC GATGACGCCC
GAGTACGAGT CCGTAGTGCT GCGCATCTTC TACGACTTCC TCGAACAGGG CGCGGTCTAC
AAAGGACTCC GGCCTGTGTA CTGGTGCATC CACGACAAGA CCGCTCTGGC GGAAGCTGAA
GTCGAATACG AGATGCACAC CAGCCCCAGC GTGTACGTGC GTTACATGAT GACTAGCGAT
CCGGGCGGCA TCGATCCGGC ACTCGCGGGT AAGCAGGCCG CCGCCATCAT CTGGACGACC
ACGCCTTGGA CGCTGCCCGC TTCCATGGCG ATCGCCTTCA GTCCGAACGC GGAGTACGTC
GGCCTTGAGC ACGATGGGCT CGTGTACATC GTGGCCGGAG AACTCGCCGA AGCGACGAAG
GCTAAGACCG ATCTTCATGA CGCGAAGGAA ATCGCTCGCT TCGCAGGCAG CAAACTCGAG
CGCGCTACCT TCCAGCACCC GTTCCTCGAT CGCTCCATCC TCGGCGTGCT CGCCGACTAC
GTCACCATGG ACACCGGCAC CGGCGCGGTA CACACCGCCC CTGCCCATGG CGCTGACGAC
TTCTACACCG GCGTGAAGTA CGGCATCGAC CAGACCTGCA ATGTTGACGA GGCGGGACGT
TTGCGCAATG GCCTGCCTGA GTACGACGGC ATGACCGTCT TCAAGGCCAA TCCGGTCATC
GTGCAATTGC TGCGCGAGCG CGGCGTGCTA CTCGGTTTCG AAAACATCGA GCACTCGTAT
CCGCATTGCT GGCGCTGCCA TAATCCCATC ATCTTCCGCG CCACCGAACA GTGGTTCATC
GCCATGGAAG CCAAGATGAG CAACGGCACG CTTCGTTCCG TCGCCCTCGA CGAAATCAAG
AAGGTCAAAT GGGACCCTTC GTGGGGTGAA GAGCGCATCT CCAACATGAT CGCCACCCGG
CCCGATTGGT GCATCTCACG GCAGCGCCTC TGGGGCGTCC CCATCGCGGT GTTCTTCTGC
GAGGGCTGCA ACACGCTCGT CAGCGACAAG GCCGTCAACG CCGGAGTCGT GGAACGAGTC
GTAAAAGAGG GTGGGGATAC CTGGTACAAG CACCAGGCCA GCGAACTTCT GCCCTCGGGC
TACAAGTGCT CCAAATGCGG CGGCACGAGC TTCCGCAAGG AGATGGACAT CATCGACGTG
TGGTTCGAGA GCGGCTCCAG CAAGCTGGCC GTGATCGGTG AGCCTACGGC CGATTTCTAC
ACCGAAGGCG GCGATCAGCA TCGCGGATGG TTCCACTCTT CCCTGCTCTG CCACATTGGC
GCACAGGGTC ACGCGCCTTA CAAACATGTG GCCACCAGCG GGTGGACGCT CGATCCACAG
GGTCGCGCTA TGTCAAAGTC GCTAGGCAAC GTCGTCGATC CAGTCGATAT CGCGAAGCGC
CTCGGCGCCG AGATCGTTCG CCTCTGGGTA GCCAGCGTGG ATTTCCGCGA GGACGTACGC
GCCTCAGAAG AGCTGATGCA GCGCGTTGCG GAGAACTACA AGAAGATCCG CAACACCTTC
CGCTACATCC TCGGCAACTT GAAGAACTTC GATCCGGCAA AAGACGCGCT CAAGTTCGAG
GAACTGCAGC CCTTCGATCA ATACATCCTG CTGCGCTTAG CTGAAGTCAT CGGCGACGTT
CGCGACTGGT ACGACGAGAT GAGCTTCCAC AAGCTCTTCA TGCGCCTGAA GGATTTCTGC
GTGGTGGACC TCAGCGCAGT GTACTTCGAC GTCATCAAGG ATAGGCTCTA CATCTCTTTA
CCCGATGCGA AAGCACGCCG CTCGGCCCAG ACCGCGATCT GGACGATCGG AGAGGCCCTC
GTCCGCCTGC TCGCTCCGCT GATGAGCTTT ACCGCCGAAG AACTGTGGCA GTTCTTTCCG
GCGGTCGAAG GTCGTCCAAC AACCGTACAT GCGGCCTACT TCCCCAAAGC CGAAGACGTT
GCCGACAACC GCAGCGGCGA AGCGGCAAAG ACGATTGAGT CTGAGTACGA GCGCCTGATC
GCCGTCCGAA CCGACGTCCT CAAGGCGCTC GAAGAAGCCC GCAATGCCAA GCTTATTGGC
AGTGGTCTCG AAGCACAGGT CGTGCTCACC GCCCCGGCGG AACTCGTCCC ACTGCTGGAG
AAGCACAAGG CAGAGCTGCG CTACCTGTTC ATCGTCTCCG ACGTCCAACT CGCGACGGGC
GGAACCAATG GCTCAGGATT GCAGGTGCAG GTGAATAAGG CTCCCGGACA GAAGTGCGAG
CGCTGCTGGA ACTACTCGAC CCATGTTGGC GAGGACGCTG AGTATCCGAC GGTCTGTGAA
CGCTGCAGCC CGGTCCTGCA TAAATTGGAG GCAACGGCAG GAGCTCACTA G
 
Protein sequence
MPLELKDTIN LPKTDFAMKA NLPLNEPKML ARWEEQRIYE LIRESRQGKP SYILHDGPPY 
ANGPIHLGHA LNKCLKDFVV KSKTMAGFDA PYIPGWDCHG LPIEIKVDEQ LGRKKLEMDP
LDVRAACAKY ALKYLDTQRE QFKRLGVFGQ WDKPYSTMTP EYESVVLRIF YDFLEQGAVY
KGLRPVYWCI HDKTALAEAE VEYEMHTSPS VYVRYMMTSD PGGIDPALAG KQAAAIIWTT
TPWTLPASMA IAFSPNAEYV GLEHDGLVYI VAGELAEATK AKTDLHDAKE IARFAGSKLE
RATFQHPFLD RSILGVLADY VTMDTGTGAV HTAPAHGADD FYTGVKYGID QTCNVDEAGR
LRNGLPEYDG MTVFKANPVI VQLLRERGVL LGFENIEHSY PHCWRCHNPI IFRATEQWFI
AMEAKMSNGT LRSVALDEIK KVKWDPSWGE ERISNMIATR PDWCISRQRL WGVPIAVFFC
EGCNTLVSDK AVNAGVVERV VKEGGDTWYK HQASELLPSG YKCSKCGGTS FRKEMDIIDV
WFESGSSKLA VIGEPTADFY TEGGDQHRGW FHSSLLCHIG AQGHAPYKHV ATSGWTLDPQ
GRAMSKSLGN VVDPVDIAKR LGAEIVRLWV ASVDFREDVR ASEELMQRVA ENYKKIRNTF
RYILGNLKNF DPAKDALKFE ELQPFDQYIL LRLAEVIGDV RDWYDEMSFH KLFMRLKDFC
VVDLSAVYFD VIKDRLYISL PDAKARRSAQ TAIWTIGEAL VRLLAPLMSF TAEELWQFFP
AVEGRPTTVH AAYFPKAEDV ADNRSGEAAK TIESEYERLI AVRTDVLKAL EEARNAKLIG
SGLEAQVVLT APAELVPLLE KHKAELRYLF IVSDVQLATG GTNGSGLQVQ VNKAPGQKCE
RCWNYSTHVG EDAEYPTVCE RCSPVLHKLE ATAGAH