Gene Acid345_0206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0206 
Symbol 
ID4069675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp218675 
End bp220552 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content60% 
IMG OID637982206 
Productpeptidoglycan binding domain-containing protein 
Protein accessionYP_589285 
Protein GI94967237 
COG category[S] Function unknown 
COG ID[COG2989] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.350389 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGCGCG ACTCGCGGTC TATAATCTCT CACGTTTTGG GAACGGCGCA GCACGCAAGA 
GCGCGCACGA GCAAGGCCAA CATCGTCTAC GTTGACTGCC TACTGGCCAC GGATTCACAA
CGCAAGCTTG CCGCAAAAAG AACGGACGAT ACCGCGATGA TGAACCTTGG CTTTGCCCGA
TGTGCTTTTG TGCGCGGGTG TACGTTCGGC TTGTGTGTGG TTGGGATCCT CGCGACGAGT
GCATGCGCGA CGGGCAAGGC CCTCCTTCCA GGCGGCGAGA CGTTTCAGGC CGCAACCAGC
GGACCGACTG TTGCCGATAG CTCGCTGCGT GAGATTGTTG CCGCGGGGCA GCTTTCCGAC
CTTCGGTGGC CGGATTTTTC GGATTACCGC GCTTATGTGC AGACCTTTTA CGAGTCCTCT
GGGTACAACC TAGCTTGGAC TCGTGGCGGC CAGACCACAC CCCAGGCGCT GGCGATCATC
GAGATTTTGA AACAGGCGGA CGGCAAAGGC TTGAACGCGG AAGACTACGA CGCTTCCCGA
TGGGCTGATC GGACAAAACA GCTGAGCCAG CCGGCTGCGG CAGCACGGTT CGACACGGCA
CTCACCGTCT GCGTGATGCG CTACATCTCC GACCTGCATA TCGGGAGGGT CAATCCGACG
CACGTCAAAT TCGCACTCAC CGGAAGAAGC GCGAAATACG ATCTCCCGCA ATTCCTGACC
CAACGCCTGG TGAACGGCCA GAACGTTGAA GCGGAGCTTG CGGCGGTGCA ACCTCAATTC
GCCGGCTACA AAGCGACGCA GGCCTGGCTG CAACGCTACA TAGAGCTGGC GCGCCAGGAC
AACGGTGAGC AGTTGCCCGT TCCGACGAAG GCCCTTGATC CAGGAAAGCC CTATGCGGGG
ATACCGCGCC TTACGAGTTT GCTGCACCTG CTCGGTGACC TGCCGGCCGA TGCGGTTGTC
CCGGCCGGTG ACGTTTACCA GGCGCCATTG GTGGATGCGG TGAAGCGTTA CCAGTCCCGC
CATGGTCTCA CAGCTGATGG TCGCCTGGGA GCCCAAACCG TGAAGGAACT CAATACGCCG
CTAAGCACCC GCGTGGAACA GTTGCGCCTG ACCCTGGAGC GCTGGCGGTG GCTGCCGCAG
GAGTTCCCGC AACCTCCGGT GGTGGTGAAT ATTCCGGAGT TCCGCCTGCG AGCCTATGAC
GCGAACCACA AGGTTGTGTT GAGTATGAAT GTGGTGGTGG GCAAAGCGCT CCGCCACGAG
ACGCCGGTTT TCGACGACGA AATGAAGTAC GTTGTTTTCC GTCCGTACTG GAATGTACCG
CCGAGCATTC AACGTTCCGA GATTGTGCCC GCCATTCAGC GCGATCGCGA CTATATATCG
AAGAAGAACT ACGAAGTGAC CACGCAGGCT GGGCAGGTCG TGACCTCAGG CACCATCAGC
GATGAGGTGC TGCAGCAGTT GCGTGCGGGG AAGCTCGCGG TGCGGCAGAA GCCGGGGCCC
ACCAATGCAC TGGGTCTGGT GAAGCTGATC TTCCCCAACC AATACAACGT CTACCTGCAC
AGCACGCCCT CGCAGCAGCT GTTTTCGCAA GCGCGGCGGG ACTTCAGTCA CGGTTGCATT
CGCGTAGAAA AGCCGGCCGA GTTGAGCGCC TGGGCGTTAC AGGACAAACC GGAATGGACG
GTGGAAAGAG TCCGCGCCGC GATGCAAAAG GGACCGGACA ACGTCCAGGT CAACCTGTCG
AAGCCAGTGC CAGTGCTCAT TCTCTATGGC ACTGCGGTCG CCGAGGAAGA TGGATCCGTC
CACTTTTTCG ACGATCTCTA TGGGTACGAT GCGGACCTTG AGAAGGCCTT GGCAAGGGGA
TATCCGTACC CCTTGTAA
 
Protein sequence
MSRDSRSIIS HVLGTAQHAR ARTSKANIVY VDCLLATDSQ RKLAAKRTDD TAMMNLGFAR 
CAFVRGCTFG LCVVGILATS ACATGKALLP GGETFQAATS GPTVADSSLR EIVAAGQLSD
LRWPDFSDYR AYVQTFYESS GYNLAWTRGG QTTPQALAII EILKQADGKG LNAEDYDASR
WADRTKQLSQ PAAAARFDTA LTVCVMRYIS DLHIGRVNPT HVKFALTGRS AKYDLPQFLT
QRLVNGQNVE AELAAVQPQF AGYKATQAWL QRYIELARQD NGEQLPVPTK ALDPGKPYAG
IPRLTSLLHL LGDLPADAVV PAGDVYQAPL VDAVKRYQSR HGLTADGRLG AQTVKELNTP
LSTRVEQLRL TLERWRWLPQ EFPQPPVVVN IPEFRLRAYD ANHKVVLSMN VVVGKALRHE
TPVFDDEMKY VVFRPYWNVP PSIQRSEIVP AIQRDRDYIS KKNYEVTTQA GQVVTSGTIS
DEVLQQLRAG KLAVRQKPGP TNALGLVKLI FPNQYNVYLH STPSQQLFSQ ARRDFSHGCI
RVEKPAELSA WALQDKPEWT VERVRAAMQK GPDNVQVNLS KPVPVLILYG TAVAEEDGSV
HFFDDLYGYD ADLEKALARG YPYPL