Gene Acid345_0691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0691 
Symbol 
ID4071336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp849651 
End bp852590 
Gene Length2940 bp 
Protein Length979 aa 
Translation table11 
GC content61% 
IMG OID637982697 
Producthypothetical protein 
Protein accessionYP_589770 
Protein GI94967722 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACTCATC GTCTCTTGCG TACCGCGCTC GTCTGCGTTT CGTTCGTCTC GATTTTCGGA 
GTTCTCACCT CTGCCCAGGA ATTTCGCGGC GGACTAGCTG GTAGGGTCCA GGACGCCTCT
GGCGCCCGCG TCGCCGCAGC CCACGTCCAG GTGCAGGCTC CAGAGTCTTC CGTGCAACGC
GAGACCGCCA CAGACGCAAG CGGCAACTTC CGCTTCAGCG ATCTTCCCGT CGGACGCTAC
CAGGTGACCG TCAACGCGCA GGGTTTCGGT GTCGCGACCT CCGAAGTCGC GGTGCTGGTC
GGCTCAACCC GCGATGTCTT CGTGACCCTC CATCCGCCTT CGGTGAAGGA GAGTGTCGCG
GTCTCCGGTG AAGCATCCTC CATCACCATG CAGCCCCTCG ATACCTCAAG CCCCGTCCAC
CAGGCGGTGG TCTCGGCACA TGATCTCGAG GAACTCCCGC TCGCCAACCG CAGCTTCGCC
AACATGGCGT ATCTCGCACC CGGCACCCAG CCCATCGAAC CCAGCGACCC GACCAAGGCA
CGCATCACCG CCGTCGGCAC CGGTGGCAGT TCCGGGTTGA ATAACGAGAA CTCCGTGGAC
GGAGGCGACA ACTCCGACGA TTACATCGGC GGCTTCCTCC AGAACTTCTC CACCGACAGC
ATCCAGCAGT TCGCCTTCCG CGTCGCGCAG GAAGACGCCG ACACTGGCCG CACGACCGGC
GGCTCCGTCG TCATTACAAC CAAGCGCGGC ACCAACGACT GGCACGGCCT ATTCGGCTTC
TACGACCGCA CCTCTGGCCT CACCGCTCGC TACCCCATCG ACAATCCTGA GCCCAACCCG
AAGCAGCCTT TCTCGCGCCA GAACTACATC TTCAACGGCG GCGGGCCGAT TAAGAAAGAC
AAGCTCTGGG GTTTCGGATC GTTGGAATAC GTCCACGAGC GCGCCAGCAT CGCGTACAGC
AACGACAGCC TCGCACAGTT CAACGCGCTG GCCTCGCTCG CGCAGGCCGG CTATATTCCC
GGCGCTCCCG ATATCGCCGT TCCGCCCTAT GTCATCACTC CCTTTAACGA CTACATCGGC
GATGCCCGTC TCGATTGGGC GCAATCCGAT CATTCGCAGT GGTTCTTGCG CGGCGCCACC
GACCGTTACA CCACCGAGAA CGACATGGTG CAGCAGGGCA CGCTCCCGTC GGTCGGCGCC
ACGACGCGTT CGCTCTACTG GAATTTCGCG CTTAACAACC AGTACCAGTT CTCGAACACC
TGGCTCGGTT CCTTCACTTT CGACGCTTCC ATCCTGCATC GCACCGTCAA TCGCAACCAG
TACTACGGCT TCTCGCTCGA CTTCCCATTC ACCACCACGC CCAGCGTCAT CACCGGCGCC
GACACCTTCG GCGACAACTC CTTCGTAACG CCGATCACTG CCTTCCCAGT CCTGCGTAAC
CAGCAGAAGT ATCAGTTCCG CTACGATCTT TCCCACTCGA CGCCGAAGCA CACCATGAAG
TTCGGTGTGA ACTTGATCCA CGAGCCGGTC ATTGGGGGAG TATTGGCCTC ACAAGCCGAA
ACCGTCATCG CTTACTCGCA GAATCCCGTG GACTACGCGG CCAACCCTGC CAGTTTCGCG
TTTACGTCGG CGTATCTCAC CAATCCTGAT ACCTGCAACG AAAACGCTCT CGATCCCGAC
ACGGTTTGTA CCGCGACGCC TGCGGGCGAT GGGAGTTTTT GGCAGAACGT GCAGCGCCTG
GGCATCTACG CCGAGGACAT CTGGCGCGTG ACCCCGCACC TCACGCTGAA CTACGGCCTG
CGTTGGGACA CAACCTTCGG CCTCTTCGAC GTGGGTGGCC GCAGCCAGAA CGCGAACGTC
GCCCTGCAAA CCATTGCCTA CCCGCAGTAC AACGGCGTCC CGCTCGACAA CCGCAAGCAG
TTTGGTCCGC GCGTCGGCGC CATTTATTCA CCGGGAGATA GCGGCAAACT CGTTCTTCGC
GCTGGCTTTG GCATGTTCTA CAACGATCTC GCGCAAAACG GCTGGGTGGA TGCGTTGATG
GCCGTTAACC CCGGCAATGC CAACGTCAAC AGCACCGGCG CGATCATCGA CCCGCACTAC
CACACGCCCT ACGCCATCGA CGCTAGCGCC GGCGTGGAAT ACGCCTTCGA TCAAGACTGG
ATGGGTGCCG TCGAATTCAC GCACCAGACC GGCATGCACG GCTATCGTCG CTACGACTAT
CCCGATGTCT CAGTCTTCCG CAGCGACAAT CGCTCCGCCT ACGACGGCCT CGTCCTGCGC
GTACAGGGCA ACGTCTCCAA GCACTACAGC CTGACTGCCC ACTACACCTT TGCCAAGGCC
CAGACCTGGG GCTGCCAACT CGGCGAACTC TTTGACTACG TGAACGGCGT CTGCGACCCC
TTTAACGCCT TTGGCCCGGG GGACTACGGT CCAGCCGGTG AAGACGTACG CCACCGCTTC
GTGCTCGCAG GCACATGGCA CGCCCCGCTC GGCATCGAGC TTTCCACCAT GACGCAAGCC
GAGAGCGGTC GCCCCTTTAC CATTACGAAT CCCGACGGCT CCGGCCGCGC CGTCATTAAC
GGCGTCACCA CTACCATGGA CCAGTTCCGC GGACGCCCAT ACTTCCAGGT GGACCTCCGC
GTCTCCCGTC CGTTCCACAT CCAGGAACGC TGGCAAGTCA CCCCGTTCTT CGAGATGTTT
AACCTCTTCA ACCGCAACAA TCCCGGCGCG TTCTACCGCG CCAACATGGC GGATTTGCCG
GTCAACGATC CTGACAATGC CACAGCCATC TGTTTGAACG CTGATTGCAG CCAGACCAAA
CCCATCACCA GCCCGAACCA ACTCCGCATT CCAGCCGGCG CTTTCGGCGA CTTCTTCGGC
CCCGGTACTA CCGTCGGCAT TCCATTCAGC GGGCAATTCG GTGTTAGAGT GAGCTTCTGA
 
Protein sequence
MTHRLLRTAL VCVSFVSIFG VLTSAQEFRG GLAGRVQDAS GARVAAAHVQ VQAPESSVQR 
ETATDASGNF RFSDLPVGRY QVTVNAQGFG VATSEVAVLV GSTRDVFVTL HPPSVKESVA
VSGEASSITM QPLDTSSPVH QAVVSAHDLE ELPLANRSFA NMAYLAPGTQ PIEPSDPTKA
RITAVGTGGS SGLNNENSVD GGDNSDDYIG GFLQNFSTDS IQQFAFRVAQ EDADTGRTTG
GSVVITTKRG TNDWHGLFGF YDRTSGLTAR YPIDNPEPNP KQPFSRQNYI FNGGGPIKKD
KLWGFGSLEY VHERASIAYS NDSLAQFNAL ASLAQAGYIP GAPDIAVPPY VITPFNDYIG
DARLDWAQSD HSQWFLRGAT DRYTTENDMV QQGTLPSVGA TTRSLYWNFA LNNQYQFSNT
WLGSFTFDAS ILHRTVNRNQ YYGFSLDFPF TTTPSVITGA DTFGDNSFVT PITAFPVLRN
QQKYQFRYDL SHSTPKHTMK FGVNLIHEPV IGGVLASQAE TVIAYSQNPV DYAANPASFA
FTSAYLTNPD TCNENALDPD TVCTATPAGD GSFWQNVQRL GIYAEDIWRV TPHLTLNYGL
RWDTTFGLFD VGGRSQNANV ALQTIAYPQY NGVPLDNRKQ FGPRVGAIYS PGDSGKLVLR
AGFGMFYNDL AQNGWVDALM AVNPGNANVN STGAIIDPHY HTPYAIDASA GVEYAFDQDW
MGAVEFTHQT GMHGYRRYDY PDVSVFRSDN RSAYDGLVLR VQGNVSKHYS LTAHYTFAKA
QTWGCQLGEL FDYVNGVCDP FNAFGPGDYG PAGEDVRHRF VLAGTWHAPL GIELSTMTQA
ESGRPFTITN PDGSGRAVIN GVTTTMDQFR GRPYFQVDLR VSRPFHIQER WQVTPFFEMF
NLFNRNNPGA FYRANMADLP VNDPDNATAI CLNADCSQTK PITSPNQLRI PAGAFGDFFG
PGTTVGIPFS GQFGVRVSF