Gene Acid345_4346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4346 
Symbol 
ID4071764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5154984 
End bp5156651 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content63% 
IMG OID637986379 
Producthypothetical protein 
Protein accessionYP_593420 
Protein GI94971372 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.372687 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCAG ACTGCAAGTT TTATGTGTTT CGTGAAGGCC GCAGGACTGT TCCGGGCGAG 
CAGTTACTCA CCGGGTTGCG CGGGAGTTTG TTCCGGGCGA AAGACGAAGA CTCGTGGACC
GATGCGATGC TCCGGTGCGG CGAGCTCGAA TGCGCCCTGG AAGATGCGGG CGATCCTCAG
GCGCGGCATG TGGCGGCGGT GAGCAACGCG TGCGCCGATA AGCTCGTGCA CCGAGATTAT
TTCGGAGGCC GCGAACTTGG CGGCTGGTTG CCGGTGCGGC TGGAAGGCGA GGTCACGGTC
GCGACGCCGG AGGGGTTCGC GTATTACGCG CTGCATCCGC GACAGTATGC GGATGTGGCG
GCGAAGTTCG GCGTCTTTCG GCATGGCGCG AAGACGCACG AGGCCGCGCC CGAAGTGGTG
GTGATTGGGA TACGCAGCAT CGGGACGACG CTGAGCGCGA TGACGACGGC GGCGTTGCGA
TTGCGGGGGC TGCACGTGGA GCGATTCACG GTGCGGCCAG CGGGGCATCC GTTCGACCGT
AAAGTGAAGT TCAATCCGGC ACAATCGCAC ACCATCCGGA CGGCGCGATT GCAGGATGCA
TTGTTCGTGA TCGTGGATGA AGGACCGGGG TTGAGCGGGT CATCGTTTTT GTCGGTGGCG
GAGGCGCTAG TGGCGGAGCG TGTGCCGTCA GGACAGATTG TGCTGGTGCC GAATCATGCG
CCGCATTTGC CGTGGTTGCG GGCCAACAAT GCGGCGGTGC GATGGCAGCG CTATCGCACG
GTGACGCCGG CCGCGGGACG GTCTCCGGAG GGGGAGTGGA TAGGAGGAGG CGAGTGGCGG
AAAAAAACGT TCGCGAATGA GAGTGAGTGG CCGGGCGTGT GGACGAGCAT GGAGCGTGCG
AAGTTCGCGG ATGATCGCGT GCTGTGGAAG TTTGAAGGCA TCGGGCCGTA TGGTGCGCGT
GCACGCAGTA CTGCCCGTGC GCTGGCGGAT GCTGGCTTCG CGCCGAACGC GGTGCGGGAT
GAACATGGCT ATGTGGGTTA TGACCTCATA CGCGGCCGCG CGGCGAAGGC ACAAGACCTG
AGTGATGAGA GATTGAAACG GATCGCGGAG TACTGCGCGT TTCGAAGCGA AGAGTGCAAG
ACCGAAGTGA CCGAGGCGCA GCAGAAAGAT CTTGCGACGA TGCTGCGGGT GAACTACGAG
CGCGGCTTCG ACCGGAAGCT CGCGCCGCAA TTCAGGAATC TGCCGGTGGA GCGGCCGACG
GTTTGCGACG GCAAAATGTC GCCGCACGAG TGGCTGCTGA CGGAAGATGG ACGCATGCTG
AAGCTGGATG CGACCTCGCA CGGCGACGAT CATTTCTTCC CCGGCCCGTG CGACGTGGCG
TGGGACCTGG CGGGCGCGAT CGTGGAGTGG GGGATGGACC GTGCGACGGG CGAGCAATTC
CTGCGCCAGT ACACGGCGCT GACCGGCGAC AACGTGACCG GGCGGATGAG GAATTACCTG
CTGGCGTATG CGATGTTTCG CATGGCATGG ACGCACATGG CCGCTGCGGC GATGAAGGGC
ACGGCCGAGG CGACACGGCT GATGCGAGAT TCAGATCACT ATCGCGAGTA CGTGTCGGGG
CTGGTTATGG GAGCCGCAAA AGCGGTTCCA GTGGCGCGTG CTTCGTAA
 
Protein sequence
MNSDCKFYVF REGRRTVPGE QLLTGLRGSL FRAKDEDSWT DAMLRCGELE CALEDAGDPQ 
ARHVAAVSNA CADKLVHRDY FGGRELGGWL PVRLEGEVTV ATPEGFAYYA LHPRQYADVA
AKFGVFRHGA KTHEAAPEVV VIGIRSIGTT LSAMTTAALR LRGLHVERFT VRPAGHPFDR
KVKFNPAQSH TIRTARLQDA LFVIVDEGPG LSGSSFLSVA EALVAERVPS GQIVLVPNHA
PHLPWLRANN AAVRWQRYRT VTPAAGRSPE GEWIGGGEWR KKTFANESEW PGVWTSMERA
KFADDRVLWK FEGIGPYGAR ARSTARALAD AGFAPNAVRD EHGYVGYDLI RGRAAKAQDL
SDERLKRIAE YCAFRSEECK TEVTEAQQKD LATMLRVNYE RGFDRKLAPQ FRNLPVERPT
VCDGKMSPHE WLLTEDGRML KLDATSHGDD HFFPGPCDVA WDLAGAIVEW GMDRATGEQF
LRQYTALTGD NVTGRMRNYL LAYAMFRMAW THMAAAAMKG TAEATRLMRD SDHYREYVSG
LVMGAAKAVP VARAS