Gene Acid345_1354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1354 
Symbol 
ID4070892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1642696 
End bp1644639 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content58% 
IMG OID637983363 
Producthypothetical protein 
Protein accessionYP_590430 
Protein GI94968382 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTCG TGTGCGCAGT CGTCGTACTG CTTGCCATTA CCTTGGTTGG TTTTTCGCAA 
ACGCAGCCTG CGACGGAAGT GGTTGTACCG CATCTCATTC GTTTTGCCGG TACCGTAAAA
GGCGCTACCG GCCGGGTTGC CATCACCTTC AGCTTGCATA AGAGCGATCG CGACGATGCC
GCACTGTGGA CTGAAACACA GAACGTGCAG CTTGAAGACG GCAAGTACAC CATCCTGCTT
GGCGCAACTA AAGCCGAGGG ACTACCCTTC GACCTGTTTA CTTCCGGTGA AGCGCAATGG
CTCTCGATTC GCGTGGAAGG CCGTCCTGAG CAGCGCGTAC TGCTGGTCAG CGTGCCTTAT
GCGTTGAAAG CCGCCGAGGC CGAGACCCTC GCTGGACATA GCGCCAGCGA GTTCGTGACG
ACCGAAAAAG TCACCAACCT GGTGCAACAG CAGTTGCAGC AGCAACAAAC TGCACCCACG
AAATCCGCAA CAACGAAGAA AGACGCGGGC GCAAAAGGCA ACGTCTTAAC GAGTACTGCC
ACGAACTTCA CTGACAACAC CGCAAACCAG GTTGTCCTCG TCACACAGAA AGGCGCCGGC
AATGGCCTGG TGTCGAATTC GATTTCTGCC AACGGCGTTT CCGGAACGAC CGCGTCGAGT
GCGGGGGTTG GTGTCTCTGG CGCGAACACT GCGGCAACGG GCCTCGCGAT TGGTGTGCGT
GGATCAACGG TTGCTGATAG CGGCATCTCC GTATACGGCA CAGCGGGCGG AACTAGCGGC
ACGGCGACTG GCGTGAAAGG AATTTCGGCG GCGCCGAATG GCTACGGCGT CTTCGGCCAG
AACACTGCGA CCACCGGGCT GGCAATCGGC TTCCGTGGCA CAACGGCTTC CACCAGCGGC
ATCGGCATCT ATGGCACTTC GACGGCGACA ACAGGCAGCA CTGTCGGCGT ACGTGCGTCT
GTGGCGAGCG TCAGCGGCAC CTCTGCCATC CTGCAAAACA CCGCGGGCGG AAAATTATTG
AGCGGCCTCT CTGGCAGCGG ATCGAGCGAA GTCTTCAGCG TGCTTGGCAA CGGGAACCTC
ACCGCCGGCG CAGCGAGTTT CAACGGGCCC GTCACCTTCG CTTCCGGCCA GACTTTCCCC
GGAACGCTAC CCAATTTCGG CGTGCAAACC ATGAATGGAG ACCTGATCCT GCAAGGTTTA
TCCAGTCCTA CGCCGTTGAA CGCAGCCAGT GACAGTACCG CCGGAACCTA CTTCACGCTC
TTCAACAACA GCGGCAGCGG AAGTGGCTGG CAGTTCGTCA CCACCGGCAC GGGCGCCTCG
CAAGGAGCCG GACACTTGCT GTTCTACGGA GGCCCAAATC CCCAGTCGGT GATCATCCAG
GCGCCTGTAA GCGCCGGCGA TGTCACATCG AGCTTCCTGC ATTCGAACAC TACGGTGCGC
GCGGAAACCG GGCTCTCCCT CGGCGGCAAT GCAACTTTGA AAGTCGATGC GCCGGGCATT
GTCGGCGGGC AGTTCGTGGT TCAGAACGGC ACCATGAGCA TTAATCAGGA CGTTCCCGTG
AGCAGCAATT CGCGCATGGT GTTCACCGGT TACCTGTTCG GCGATACCGG CGACTCCGGC
CTGCTCGGGT CGACTCTCGG TTACATCCAT CCGGAACGCG ACATCGTGGT CACCGGAATT
TTTGGATCCA CCAACAACAA GGGAGTTGGG AACTGCGGCA ACGACGCGAT CATCACGCTC
GAGCAACCAG GAAATCCAAG TACACCCAAG GTCAACCTGG ACATCATCGA GGGCATCCCG
ACGTGGTCAA ACATGTTCCT GAGTGTCCCA TTCAACTCGG TTTACGATCT TCAAATCGTG
CTGACGCAAA ATTCAGGAGG CTGTGCGCCG TTTTCGCATA CCACGAATAA CCCGGTGATC
TCGGTGGTCT ACTACATGAA GTGA
 
Protein sequence
MKFVCAVVVL LAITLVGFSQ TQPATEVVVP HLIRFAGTVK GATGRVAITF SLHKSDRDDA 
ALWTETQNVQ LEDGKYTILL GATKAEGLPF DLFTSGEAQW LSIRVEGRPE QRVLLVSVPY
ALKAAEAETL AGHSASEFVT TEKVTNLVQQ QLQQQQTAPT KSATTKKDAG AKGNVLTSTA
TNFTDNTANQ VVLVTQKGAG NGLVSNSISA NGVSGTTASS AGVGVSGANT AATGLAIGVR
GSTVADSGIS VYGTAGGTSG TATGVKGISA APNGYGVFGQ NTATTGLAIG FRGTTASTSG
IGIYGTSTAT TGSTVGVRAS VASVSGTSAI LQNTAGGKLL SGLSGSGSSE VFSVLGNGNL
TAGAASFNGP VTFASGQTFP GTLPNFGVQT MNGDLILQGL SSPTPLNAAS DSTAGTYFTL
FNNSGSGSGW QFVTTGTGAS QGAGHLLFYG GPNPQSVIIQ APVSAGDVTS SFLHSNTTVR
AETGLSLGGN ATLKVDAPGI VGGQFVVQNG TMSINQDVPV SSNSRMVFTG YLFGDTGDSG
LLGSTLGYIH PERDIVVTGI FGSTNNKGVG NCGNDAIITL EQPGNPSTPK VNLDIIEGIP
TWSNMFLSVP FNSVYDLQIV LTQNSGGCAP FSHTTNNPVI SVVYYMK