Gene Acid345_3934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3934 
Symbol 
ID4071317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4649007 
End bp4652360 
Gene Length3354 bp 
Protein Length1117 aa 
Translation table11 
GC content59% 
IMG OID637985960 
Producthypothetical protein 
Protein accessionYP_593008 
Protein GI94970960 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.515383 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAGT TCTTCGCGTC CGTTTGGGGC CCTGCGGCGT TTCTTTGTGT GTTCCTTTCC 
GTGAGCTTTG CTCAACAAGC TTCAGGCCCT GATGCGAAGC ACCTGAGTCC TTCGCAGCGG
GAGTTCCTCG AACAGCGATC CGTTCCCGGT AAGGGCATTC CGGCTGGAGC CTATGCAAAA
GCTGTAGAGC AGGCGCGCGC GATCCGCGCC CGGGAATTGG CAGCGGGAAC GAATACTTCG
CTGCCAGCGT GGTCACCGGC GAAGCCGAAT GCGAATGACG ACAGCGCGAA TGGCAACGGA
ATCACGACCG GACGAGTGAC TGCAATCGCG ATCGATCCGA CAACTTCCGG CGCGAGCCTC
ACGGTCTACA TCGGCACCGG TGGCGGAGGC GTGTGGAAGA GCACCGATTC AGGTACGACT
TGGACCCCCC TGACCGATAC GCAGGCAAAC CTGACGATCG GTTCGTTGGC GATTGATCCC
AACAATCACA GCATCATCTA CGCAGGCACC GGGGAACTCG ATTTCGCAGC CGATTCTTAT
TACGGCGGTG GAGTTCTGAA ATCCACGAAT GGCGGAACCA CCTGGAGCAT GGTCGGCCAG
AGTACCTTCG GAGCGGTGGA GGGAGCGTCG TTTACCTACA ATGGTCCGGC GCGTATCGGA
GCGATCGCGG TGCAGCCGAG CGTGCCGAGC GGAACACCCG TGGTTCTGGC CGGGACCGCC
TACGGTTATC AAGACAGCAC GCACAAGAGC GAGTCAGGCA TCTGGCGTTC CACCGATGGC
GGCACCACAT GGAACCGCGT TCTCCCAGAC TCCAGCACAG ACATCCCGTA TGCCTTTGGA
ACCTCGATCT TCTGGTTAAA CAATACGACG GCGTACGCGG CGATCGGAAA TGTTTACGGA
TATGCATCGG TGCCTGGTGG TGTTTACAAG TCCACCGATA GCGGTGCGAC CTGGACGCCA
GTGAATGGCT CGACTGGGCA TGCGTTGCCA GTGGGCACGG ATTTCGGGAC CATCGTGATG
GCGCCCGCCG TGAGCACACC GGGAACGATT TATCTCGCTG CGGAAGAAGT ATCCACCGGC
GGTGGTCTCC AGAACCTCTA CAAGACCACT GACGGTGGCA CAACGTGGAA CCCGATCTCG
ACGCCATTGA ACGCGGGGGG AACGACGAAT GATTTCTGCG GAAGTTTCTG CTGGCACTCC
ATGGTGATTG CCGTCGATCC GGCGAACGCG AACAACGTCG TCGTCGGAGG AACTAACGGC
GACAGCTTAT ACACGGACAC GACGGGAGGT ACGACCGGAT CGAGTGCGTG GAAGTCGCTG
AATACTGGCA CAGCGGGGTT CAAGATTCCC CCAGGGATCC ACGCGTTTGC ATTCATCGCT
GCTGGAGGCG CGTTCGTAGG CGGAGACAAG GGGCTTTGGA AGACGACGAC GCTGTCCGCA
GCTCCGCCCG CGGCGAAGAA CCTGAACGGA CCGACGTGGG AAGAGCTGAA TGACCCAAAC
GACCCCAACT ATTACGGGGG CTACTACGAC GCTTACTACA CGGCATACAA CGGGCGCCGT
AAAAAGGCTG TGCAAGCGCA GGAGTTCCCG ATTGGCTCGG GAATGTATCA GGTCACGCCC
ACAACCATCA AAGTGATCAA TATGCGTTGT CAGGGAACCC CGGCGGTTTT CGATCCGAAC
GCTTCAGTCG CGATGTTCGT GGCGTGTTCT CCGGCCAATG GCGGCCCGCA AGTGTCGCTT
ACGGGAGGTG ATCCGGGGAC GTGGAACCCG ATGACGACCG GAATCAATCT GGCCGACAAC
TCGGCGTTCT ATCCTCCGAT CCTCTTCGTT CCCTATCCTG GGTTGCCGAC GATGCTCTAC
GGTACGACGC ACATCTATCA ATCCACAAAT GCAACGGATC CCTCGGCGCC CACGTGGGAA
GACCTCGGCA GTGCAGCGAT CGCGGAATTC GGAGGAGCGA CTACGACCTT GGACACGGCA
CTTGCGCAGA ATGGCGGACA CCAAGGGACG GTAAGAGCCA GCACCTCGGC AATTACATCG
GTGACGAACG ACACGCTGTT CGCGGGATCC AACGATTCGT CGGTGAACTA CTCGACGAAC
GGCGGCGTGA ACTGGGCGCA CATCCGCAGC ATCCTGCCGT ATCGTCCGGT GACTCGCGTG
ATGGCCGATC CGCTCGATAG TACGAAAGTC TTTGCGGCCT ACGCAGGCTT CAGCGGCTTT
GGCGATAGCG TGGGACATGT GTTCCTGTGT TCGATCACCA GTAACACCTG CACCGACGTC
AGCGGCAATC TGCCGAATGC GCCGGTGAAC GACCTGGCTT ACGATCCCGA TTTTCCGAAC
ACGATGTTTG CGGCAACCGA CGTAGGTGTC TTCACTGCGA CGACTGGCGG AACCGCGTGG
TCAACATCTG GAACGGGTTT GCCGAATGTC GTGTGCAAGA GCTTTACCGT CTCGGAAATC
GATCGCACAT TGCAGGTGAA TACGCACTCG CGCGGAGTCT GGACCCTGAA TCTGCCGCCG
CTTGTGCTGG CGACGATGAC GAACCCGGCG CCAGGATCAA AGTTCTCTGG TGCGAGTGCG
AACTTTACGT GGAATGCCGG ACAGAGCGCC ACGGGCTACT CTCTCTACGT AGGAAGCACC
GCCGGCGCGC ACGACATCGC CTACGTGAAT GCGGGGACAG CGCTCACCAC GCCGGTGTCG
GGACTGCCAA CGAACGGCGA ACGCTTGTAC GTAACGCTGA ACACGCTGAT CTATGGCAAC
TGGCACGCGA ATAGCTATAC GTACGTCGCC AGCGGCACGG GTGCAGCGGC AACGATGACA
TCGCCGGCAA ATGGGTCAAC CTTCAGTGCG GCGAGTGCGA CGTTTAACTG GACGGCGGGG
GCGGGGATTA CGCAGTACTC GCTATACATT GGCACCACCT CCGGCGCGCA CGACATTGCT
TACATCAACG CAGGAGCGGC ACACACTACG ACCTTCAACA CTCTCCCGAC GAACGGCGAG
AAGATCTACA TCGCCCTGTA TTCACTAAAC GGTAATACAT GGCTGGCGAA CTACTACGTG
TACTACGCAC CTGGAACCGG AACTGCTGCC ACGATGACTT CGCCGGCGCC CGGATCGACA
TTTACCGGAT CCTCGGCGAC ATTTAGCTGG GGCGCAGGCA ATGGCATATC GGAATACTCG
CTGTATGTAG GAACAACCGC GGGAGCTCAC GATATCGCCT ATGTGGATAC CGGAAAGGCG
ACCTCAACGA CGGTGAATAC GTTGCCGACG AACGGATCAA AGGTGTACGT GACTCTGTAT
TCGTTGAATG GAAAAACGTG GAGGAAGAAC TCGTACACGT ATACGGCGAA GTGA
 
Protein sequence
MRKFFASVWG PAAFLCVFLS VSFAQQASGP DAKHLSPSQR EFLEQRSVPG KGIPAGAYAK 
AVEQARAIRA RELAAGTNTS LPAWSPAKPN ANDDSANGNG ITTGRVTAIA IDPTTSGASL
TVYIGTGGGG VWKSTDSGTT WTPLTDTQAN LTIGSLAIDP NNHSIIYAGT GELDFAADSY
YGGGVLKSTN GGTTWSMVGQ STFGAVEGAS FTYNGPARIG AIAVQPSVPS GTPVVLAGTA
YGYQDSTHKS ESGIWRSTDG GTTWNRVLPD SSTDIPYAFG TSIFWLNNTT AYAAIGNVYG
YASVPGGVYK STDSGATWTP VNGSTGHALP VGTDFGTIVM APAVSTPGTI YLAAEEVSTG
GGLQNLYKTT DGGTTWNPIS TPLNAGGTTN DFCGSFCWHS MVIAVDPANA NNVVVGGTNG
DSLYTDTTGG TTGSSAWKSL NTGTAGFKIP PGIHAFAFIA AGGAFVGGDK GLWKTTTLSA
APPAAKNLNG PTWEELNDPN DPNYYGGYYD AYYTAYNGRR KKAVQAQEFP IGSGMYQVTP
TTIKVINMRC QGTPAVFDPN ASVAMFVACS PANGGPQVSL TGGDPGTWNP MTTGINLADN
SAFYPPILFV PYPGLPTMLY GTTHIYQSTN ATDPSAPTWE DLGSAAIAEF GGATTTLDTA
LAQNGGHQGT VRASTSAITS VTNDTLFAGS NDSSVNYSTN GGVNWAHIRS ILPYRPVTRV
MADPLDSTKV FAAYAGFSGF GDSVGHVFLC SITSNTCTDV SGNLPNAPVN DLAYDPDFPN
TMFAATDVGV FTATTGGTAW STSGTGLPNV VCKSFTVSEI DRTLQVNTHS RGVWTLNLPP
LVLATMTNPA PGSKFSGASA NFTWNAGQSA TGYSLYVGST AGAHDIAYVN AGTALTTPVS
GLPTNGERLY VTLNTLIYGN WHANSYTYVA SGTGAAATMT SPANGSTFSA ASATFNWTAG
AGITQYSLYI GTTSGAHDIA YINAGAAHTT TFNTLPTNGE KIYIALYSLN GNTWLANYYV
YYAPGTGTAA TMTSPAPGST FTGSSATFSW GAGNGISEYS LYVGTTAGAH DIAYVDTGKA
TSTTVNTLPT NGSKVYVTLY SLNGKTWRKN SYTYTAK