Gene Acid345_1858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1858 
Symbol 
ID4069200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2234570 
End bp2235526 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content62% 
IMG OID637983867 
Producthypothetical protein 
Protein accessionYP_590933 
Protein GI94968885 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.293436 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGATCA CGATCGCGAC ATCCCCAGCC GAGATCGAGC GCCTACGCCC TGCGTGGGAG 
CGGTTGCACG ACCGCGAACG CCAGAGCATC TTCCAGGACT ACACCCTCAA CTGTCTTGCC
GCGACCCATT TCGCGGAGCG CGAATCGCCT TACATCATGA TGGCGGAGAG CGAATCGTCG
GTGGCGATCG TACCGGCGGC GGTGCGTAAG CACGATGGGT CAATCACGTT GCTCGGCGAG
ACGTTGTTCG ACTATCGCGA TGTGCTTTGC GGCGGTACCA ATGAGGCGCT GGAAGCGGCG
TGGGCGGAGA TCGCCAAGCT GCAGCGTCGG CTCTCGTTCT TCGCGGTGCA TCCTGATGCG
CAAGCGCGCT GGCAGATGAT TCCGATGCAT GACTTCGCCA ATGCTCCGCG GGTGCGCGCG
GCGGATTGCG ACTCCGATGA GTTCCGTGCC TCGCATAACA AGCTCGGCAT GTTTTACCGG
CGGATGATCA AGCGCGGCGC GCACCTGTTC ACGCACGCGG GAGACAATTG TGCGCTCATC
CGAACGATTT ATGAGCGCAA GGCTTCACAG TTCCCGCACG AGACGAACAA CATTTTTCTC
GATCCGCATC GGCGCGAATT CATGGAGGTT GCGTGCGCTG CACTTGGGTC GCGTTGCGAG
ATTTTTTCTC TCGAGATCGG GACCGAGTTG ATCGCAGCGC TGGTGACGCT ACGCGATCAC
ACGGTGCGGC GCTTCTACAC CGTCTACTTC CACCCGGCCT GGTCGAAATT TTCGCCGGGC
GTGGTGCTCA TCTACGAGGT CACCGCGCGT TCCCTTACCG AGGGACTCGA CTGCGATTAC
CTGACCGGCG AGTATGGCTA CAAGAACCGG CTGGCCACGG CGATGGTGCC GCTGCGGCGA
GTAGAAGCGT CGGCCGAAGA ACTGGCGGCA ATCGCGGCGC GGAAGAAGGC GGCTTAA
 
Protein sequence
MRITIATSPA EIERLRPAWE RLHDRERQSI FQDYTLNCLA ATHFAERESP YIMMAESESS 
VAIVPAAVRK HDGSITLLGE TLFDYRDVLC GGTNEALEAA WAEIAKLQRR LSFFAVHPDA
QARWQMIPMH DFANAPRVRA ADCDSDEFRA SHNKLGMFYR RMIKRGAHLF THAGDNCALI
RTIYERKASQ FPHETNNIFL DPHRREFMEV ACAALGSRCE IFSLEIGTEL IAALVTLRDH
TVRRFYTVYF HPAWSKFSPG VVLIYEVTAR SLTEGLDCDY LTGEYGYKNR LATAMVPLRR
VEASAEELAA IAARKKAA