Gene Acid345_4269 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4269 
Symbol 
ID4071841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5073537 
End bp5075501 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content52% 
IMG OID637986301 
Producthypothetical protein 
Protein accessionYP_593343 
Protein GI94971295 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGATT TCGAACCATG CTATTTTCGC GGTACCGGTG CACGAAAGAG GTCCATCGGC 
GTAGATGGAT TCGCATTCGA TGACGCCGAT GGGTCTGTGC GCCTCTTCAT CGCCGAGTTT
GGCGGTGGTG AGGAGCCCGA GACCCTCACC CAAACCGACG CCAAGTCCCA TTTCGCTCGC
CTTCAGGCTT TTTGTGAAGA GGCTGTATCT GGCAAGCTCC ACCGTGAAAT TGAAGAGAGC
AACCCTGCTG CCGGACTCGC GCAATTACTA TTTAAAGAGA GAGGAGCTGT CACACGATTT
CGCCTCTATC TGATTACCGA CGCCGAGATG AGTTCCCGTA TAAGGGATTG GCCTGAATCT
GAAATCTCGA ACATCAAAGC TGAGTTTCAC ATATGGGACA TCGTCCGTTT TCAAAGAGCA
TTCGAATCAC GCACCGGCAA GGATGAACTG GAGGTAGATT TGCAGGAGCT CGTTGAAGGT
GGTGTTCCTT GCTTGGGCGC CAGCGTTGAT TCTGACGAAT ATCTTGCGTA TCTATGTGTT
ATCCCAGGCG AGGCCCTCGC GAACATCTAC GACGAATACG GGAGTAGGTT ACTGGAAGGA
AATGTGCGCT CATTCTTGAG CACGAAGGGC CGGGTCAACA AGGGAATTCG ACAGACAATC
CTTACACGAC CTCACATGTT TTTCGCGTTC AACAATGGGA TTGCATGCAC CGCGTCTAAA
GTGGATGTCG TCACCGGCCC ATCTGGGTTG CGGATCACTA AAGCGTCAGA CTTACAAATC
GTGAATGGAG GTCAGACGAC CGCATCGCTG GCAGCGGCGA AACGAAACGA CAAAGCAGCG
TTGGATCACG TTTTCGTCCA GATGAAGCTT TCAGTGGTGC CACCGGAGCG CTCGGGTCAG
GTCATACCAG AGATTTCGCG ATGCGCTAAC AGTCAGAATC GAGTGAGTGA CGCGGACTTC
TTCTCAAACC ATGAATTTCA TCGAAGAATT GAACAGATTT CTCGAAAATT ATGGGCGCCT
GCTGTCGGAG GAGCACAGCA CGGAACTCAA TGGTTCTACG AACGCGCAAG AGGTCAATAC
CTGAATGAAC AATCCGGCTT GTCATTGTCC GACCGCAAAC GCTTTGTTCT CCAGCATCCG
CGCCATCAGG TGATCGCCAA GACGGACTTG GCGAAGTACG AAAATGCATG GCGGCAACTT
CCGCATCTCG TCAGTCAGGG CGCGCAGAAG AACTTCCTCT CATTCAGTTC GTACGCTTCC
GACGCCTGGG ACAAGAACGA GGTTCAGTTT AACGATGAGT ATTTCAAGAG GGTAGTTGCG
AAGGCAATCT TGTTCCGTCG CACCGAACAA ATCGTGTCAA AACAACGTTG GTATCAGGGC
GGATATCGCG CGAATATAGT CGCTTATTCT ATCTCTAAGT TGTCACGGAT GATCGAAGTC
GAAGCACCGG GTCGCGCGCT AGATTTCCGA AGCATCTGGT TACGGCAAGC ATTAACGCCC
GCGACCGAGA CTCAGATTGC AAAAATCGCG GAATCAGTTT TCGACGTGAT TGTAAATCCT
GCTGGTGGAT TTCAAAACAT AACTGAATGG GGCAAGAAGG AGCTCTGCTG GAAGCGAGTT
GCTGAACTCG AGATTCCGCT CGATTCGACG TTCTACAAAG AACTGGCCGA TCATGAATCT
GATCTCCAGC AAAAAAAGGA CTCTGCAGTT GATCAGAAGA TCGAGATCGG GATTGAACAA
CAGGCGTCGG TGTTACAGCT TGGCGCGTCG TATTGGAAGC AGATCCGGGA GTTCGGCGCG
CGTGAAGGCT TGTTGAGCCC CGACGACATT TCCATTCTCG GGTTGGCGTG CCTAATTCCG
AACAAGGTTC CGTCTGAAAA ACAGAGTTCA CGATTGTTAC AGATGAAAAC TCGCATGGAG
TCGGAGGGTT TCCCCATTCG GACGATAGCG ACTGGAGCGT CCTAG
 
Protein sequence
MSDFEPCYFR GTGARKRSIG VDGFAFDDAD GSVRLFIAEF GGGEEPETLT QTDAKSHFAR 
LQAFCEEAVS GKLHREIEES NPAAGLAQLL FKERGAVTRF RLYLITDAEM SSRIRDWPES
EISNIKAEFH IWDIVRFQRA FESRTGKDEL EVDLQELVEG GVPCLGASVD SDEYLAYLCV
IPGEALANIY DEYGSRLLEG NVRSFLSTKG RVNKGIRQTI LTRPHMFFAF NNGIACTASK
VDVVTGPSGL RITKASDLQI VNGGQTTASL AAAKRNDKAA LDHVFVQMKL SVVPPERSGQ
VIPEISRCAN SQNRVSDADF FSNHEFHRRI EQISRKLWAP AVGGAQHGTQ WFYERARGQY
LNEQSGLSLS DRKRFVLQHP RHQVIAKTDL AKYENAWRQL PHLVSQGAQK NFLSFSSYAS
DAWDKNEVQF NDEYFKRVVA KAILFRRTEQ IVSKQRWYQG GYRANIVAYS ISKLSRMIEV
EAPGRALDFR SIWLRQALTP ATETQIAKIA ESVFDVIVNP AGGFQNITEW GKKELCWKRV
AELEIPLDST FYKELADHES DLQQKKDSAV DQKIEIGIEQ QASVLQLGAS YWKQIREFGA
REGLLSPDDI SILGLACLIP NKVPSEKQSS RLLQMKTRME SEGFPIRTIA TGAS