Gene Acid345_0687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0687 
Symbol 
ID4068777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp844985 
End bp846493 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content62% 
IMG OID637982693 
Producthypothetical protein 
Protein accessionYP_589766 
Protein GI94967718 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGACT TTCATGGCAG CTTCGAGTAC GCCGGTCCGG ACCGCGCCGT CCGGCAACAA 
GGCAATTGCC GCATTCAAAT CGATCCCGAG CTATTCACTT GCTCGCCCGA CCTCGGCACG
CCCATCGTCT TCGATCTCGG CGACATTGAC GCCATTCACA CCGAACGTTT CACCATTCGC
ATTGCGCTCT ACACCGGGTC AACTCTCACG CTGAATCAGC TCGGCCGCGA CATGCAACCG
CTGATCAGCG ATCTCATGCC AAAGTTCCGC GACCGCATGG CGAAGTGCCT GCTTGTCGGC
GATCTCGAAG AAGTAGATCG CTTCGAAGGT TCATTCCAAC TGGAAAGCGT GGTTCAGAAG
TGCCCGTCCT GCGGCGCCGC GACTACCTCA TCGAAGTTCT GTCCGTCGTG CGGCAAGCCG
ACGCATCCGG AAGAGACTGC GCCAATGCAA TGCCCAGGTT GCGGCGCAAG TGTGCAGGGC
AAGAACTTCT GCCCTGAGTG CGGCGGGGCG CTGCGAGCTA ACACCGCCGT CGCGCCGCAC
GGAGGCCCCG CGCAGATCCG TCTCTACAAG AGCAACGTCG GCGTGCTCGC CACGGAATCC
CAGAGCTTTC AATGGCGTCT CGCAGATCTT GATGCGGTCA AGGCCGATCC GAAGACCTAC
CAGGCCGTCC TCGAGATCGG CTGCACCGCA CTACGCATCA ACGAACTAGG CAAGCGTACT
GAAGATTTCG CGCGAAAAGT CCGCGAGACT ACCAGCGAAC TCCTCGCCAA TGGGTCGAAA
GCGCTGCACA CCGCGTTCCC GTTCCTCAAT CCCGACCAGC TTCAGTCGGT AGCACAACTC
CTGCGCGAAG GCAGTTCCGC CCCGGTCACG AAGCTCGACG CGATCCATCC GCAGATGAGC
GCCGCGCTGG AGAAGAATGC TGTTGACGCT ACCCTCAAGC CCTACTACGA CGACCTCGTA
AAGCGCACCG CCCCCGGCTT GCTCTACGCC GGCTTCAAAA TCATTCGTCC CGAAGACGAA
GACCTCGCCG CCCCGAAAGA AGAAGCGGCG CCCGAAGATC AGGCTCCCGG CGATGCCGAT
GGTGCTCCCG CTGCCGACGC CGCTGGCCCG GCTACTCTCT ACTGGTTCTT TTTCCCGCTC
TCCACAAAGC CCGGCAGCGC TGATCTTGCC AATGCCGTCG CGTGGGAAGC CAGCTCTGCC
TCCGGCCGTG CCACCTACTT CTTCCGCCTC TTCGATCGCG CCGAGAAAGC CAGGTTGCAA
GATCCTCTAG CTGTCGCCGA TTCCATCCGT CGCCTCAACT CGGTCCTAGG CACGCTCAAC
TTCCGCCGCC GACCGATCTA CCTCAGCGAC GCCGAACTCA ACACCGACCC TCGCTTCCAT
CGCTATGCCA TCGCTGCGCG TCGCATCCCG GAGTTGCGCC AGGTGCGCGC CAGCCTGCTC
GGGCGCGCCA TCCACAGCAG CTTCGACGCA TGGCAGACCC AGGTCGCCAA CCTCCTCGAT
AAGGCATAA
 
Protein sequence
MADFHGSFEY AGPDRAVRQQ GNCRIQIDPE LFTCSPDLGT PIVFDLGDID AIHTERFTIR 
IALYTGSTLT LNQLGRDMQP LISDLMPKFR DRMAKCLLVG DLEEVDRFEG SFQLESVVQK
CPSCGAATTS SKFCPSCGKP THPEETAPMQ CPGCGASVQG KNFCPECGGA LRANTAVAPH
GGPAQIRLYK SNVGVLATES QSFQWRLADL DAVKADPKTY QAVLEIGCTA LRINELGKRT
EDFARKVRET TSELLANGSK ALHTAFPFLN PDQLQSVAQL LREGSSAPVT KLDAIHPQMS
AALEKNAVDA TLKPYYDDLV KRTAPGLLYA GFKIIRPEDE DLAAPKEEAA PEDQAPGDAD
GAPAADAAGP ATLYWFFFPL STKPGSADLA NAVAWEASSA SGRATYFFRL FDRAEKARLQ
DPLAVADSIR RLNSVLGTLN FRRRPIYLSD AELNTDPRFH RYAIAARRIP ELRQVRASLL
GRAIHSSFDA WQTQVANLLD KA