Gene Acid345_0941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0941 
Symbol 
ID4070823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1197457 
End bp1199832 
Gene Length2376 bp 
Protein Length791 aa 
Translation table11 
GC content59% 
IMG OID637982948 
Producthypothetical protein 
Protein accessionYP_590018 
Protein GI94967970 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.10709 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCGCT CGACCATTCA GTCTCAGTCC ATCGTTTCCC TGGCTCCTGC TCCGAGACCA 
CAAGGATCAA GCCTTGCCCG CCAGGCGACT GACGCAACGT TCCAGAAGGC TCCGGCCAAC
TTTCACGTTT TTGTCGCTGC CGGGGTAGGA GAGAACGCGG GCGCGGAAAC ACTGCTCTTC
AATTTCTCCG CGGCAACCAC CCTCACGCAC ATCAAGATGG CGAACAAAAA CTTTGTCATC
GTGCCGGGCG GCTCTTGTTA TGAGGGGAAC CACTACAACC GCGGTGAGAG TTGTTCGCTG
CTGGTGCGGT TCACCCCACA GGGACCGGGC CACCGACTTG GCTTTGTGCA GATGACGAAT
TCATCCGAAA CCGCGGTCAC GAGCTTCGGC CTCGTCGGCA ACGGGTATGC CCCGATTCTG
AGCTTTACGC CTGCGCAGAT CACCACGGTG ACTGGGACCG CAACCGCCGG CACCGGGACG
ATCAATAACG CAACCAACAT GACGGTCGAT GGCGGGGACG TTCTATACAT CGCTGACGTT
GGGAATAACC GCATCAAAGA GATTGATTCC AGCGGTGCAA TTGGCACCAT CTCGCCAGCC
TTTGCCACTC CCCAGTCTCT GGCCGTGGAC AGTCTCGGTA TCATTTACTC CGCTAACACC
GCGGGTACCT ACTACTTCAG CTACTTCGCG CCTTGGGCGA CCCAGACTGC GTATGGGACC
ACTTACGCTC CGGGGAGCTG TACTCCGAAT ACTCCCTGCC CATTGACCAC AGTTGGCATG
TCTAAGCCGG CCAATATGAG CATTGATGCT TACGACAACC TTTTCTTTGA GGAGGGAACA
AAGGGCGCAG CCGAGATGCC TGTGGCCAAC GTTTCAGGCG GCGCCGGATC CTTCAACTTG
TGGTATCTCA CGAACCAGTT CGTCTACTCC AACGGGACTC CGGCATCGTT CGCAGTGGAT
GCCAGCGGAA ACCTCTACAA CAGCTATATC TATAACACCG CCAATACCTG CTTCCTGCTG
GAGGAGCCAC TCTACAATGC CGAGTACTCA CCCACGGCCA ACCGCGTGGC GGGGGGAGTG
AAATGCGGGT TCTCCGGGGA TGGCGGCAAG GCGAGCGGCG CCGAGATCAG CAACAAAATC
GGGCAGATTG CTTTCGACAT CGCCGGCAAC CTCTATTTCG CGGATGCCGG TAATCAGCGC
ATTCGCCGGA TTGATGCTGT CACTGGGATC ATAAGCACAA TTGCCGGAAA CGGAACGGCG
GGCAACGCCG GCGATGGCGG CGCGGCGGTT TTCGCGACGA TCAGCAATCC TACCGGTCTG
GCTGTCGATT CCCAAGGACA GGTGTATTTC CTGACGAACG CGCCGACGGC CGGGCCGACG
CAATCGATTC GAAAGCTGGG GACCTTCGGC TATTGGAAGT GGACTTATCA AGCCAAGGGA
ACTATTAGTG CTCCGAAAAT ATTCACCGTC TCGAATACCG GGAACCAGCC CCTCGCGCTG
AGTGCGAATG CATTCTTCGG AGGGACCAAT CCGACTGATT TTGCGACTGA TCCGAGTACC
ACGAGTTGTG GACTGACTGC GGGCGCGACC CTGGCGGCAG GCCACACCTG CCAGGTCGGC
ATCGTGTTCA AGCCGTCGAC CGTAGGTACC CGTACCGCGA GCCTTCTGTT CCCCGGCAAT
ACCGTTGCCG GCATCAACAG CATTCAACTG GTTGGCATCG GCTACCTGGG GACACCGACG
ATCACCATCA CATCACCGAG CGGCACCGCA ACGAAGGGAA CGACGATCAC CTTCACTGCC
AGTGTCACAT CAACGCCAAC GACGAAGCCG ACCGGAACCG TTACCTTCAA AGTGAACGGA
ACTCAGATCG GTAGCCCGAT TACGCTATCG TCGGGCACGG CATCTACAAC CTTCACCGAG
GCAACCGCCA ACACGTACAC GCTTACGGCC ACTTATAACG GCGATGTGAA TTACCCTGCC
GCGACAGCGA CGAAAAGCCT CGTCGTTAGC GGCACGATCA AGGCTGGAGT GCAGGTGAAC
CTTGCTCCTG CCGTCGAGCC CGCATCCGCG TGCGGTCCGG CAAGCTTTGC GGTTCGGGTA
TTGTCCGCGT CGGATGGTAC TCCGACCGGT ACGGTCGAAC TGATGAGCGG ATCCACCAGC
CTGGGGTCGG CGGTTCTCCG CGACGGTATC GCGACCCTCA GCACAACCGC CAACACGCGG
GGTACGCAGT CATTCTTCGC GAAGTATGGC GGTGATGACT TGCACCAGGC AGGAGTCTCA
GCTTCGGTGA CGAAGAGGAT TCCGGCGGCT GGCGTTTGCA ATGGTGAAAC TCGGGCGAGA
GGCTTCTCGG CGGTGTCGGC AATCCGAATT CGCTGA
 
Protein sequence
MVRSTIQSQS IVSLAPAPRP QGSSLARQAT DATFQKAPAN FHVFVAAGVG ENAGAETLLF 
NFSAATTLTH IKMANKNFVI VPGGSCYEGN HYNRGESCSL LVRFTPQGPG HRLGFVQMTN
SSETAVTSFG LVGNGYAPIL SFTPAQITTV TGTATAGTGT INNATNMTVD GGDVLYIADV
GNNRIKEIDS SGAIGTISPA FATPQSLAVD SLGIIYSANT AGTYYFSYFA PWATQTAYGT
TYAPGSCTPN TPCPLTTVGM SKPANMSIDA YDNLFFEEGT KGAAEMPVAN VSGGAGSFNL
WYLTNQFVYS NGTPASFAVD ASGNLYNSYI YNTANTCFLL EEPLYNAEYS PTANRVAGGV
KCGFSGDGGK ASGAEISNKI GQIAFDIAGN LYFADAGNQR IRRIDAVTGI ISTIAGNGTA
GNAGDGGAAV FATISNPTGL AVDSQGQVYF LTNAPTAGPT QSIRKLGTFG YWKWTYQAKG
TISAPKIFTV SNTGNQPLAL SANAFFGGTN PTDFATDPST TSCGLTAGAT LAAGHTCQVG
IVFKPSTVGT RTASLLFPGN TVAGINSIQL VGIGYLGTPT ITITSPSGTA TKGTTITFTA
SVTSTPTTKP TGTVTFKVNG TQIGSPITLS SGTASTTFTE ATANTYTLTA TYNGDVNYPA
ATATKSLVVS GTIKAGVQVN LAPAVEPASA CGPASFAVRV LSASDGTPTG TVELMSGSTS
LGSAVLRDGI ATLSTTANTR GTQSFFAKYG GDDLHQAGVS ASVTKRIPAA GVCNGETRAR
GFSAVSAIRI R