Gene Acid345_4188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4188 
Symbol 
ID4072147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4954751 
End bp4955764 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content58% 
IMG OID637986219 
Productprotein of unknown function DUF900, hydrolase-like 
Protein accessionYP_593262 
Protein GI94971214 
COG category[S] Function unknown 
COG ID[COG4782] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.90269 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.255036 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATTGGC TCATCACCAA TCGCAATATC GAAGCCAATG GTTTCGGCAC GACATTCGCA 
GACGTCACGT ACTGGACCGC TCCGGCAGAC GCCGATCCCA AGCAAAAAGC GTCGTGGAGC
CAGGAAACGA AGGACAGTTT CCGCGCCAAG CTCGTCGCGG TTGCGGATAC TTTTCCTCCT
CCGACCACCA CTCAGCCCGC GGACCAGAAG CACATTACGT TTCTCGTGCA CGGCTACAAC
AACTCGTGGT CGGATGCGAT GGGCCTCTAC CAGCGCGTTG CAACCAGCAT GTATTCCGGC
GCGAACGGCC TCGGCGAGTG CATTTCGTTC GACTGGCCTT CCAAGGGCGA CCTGATGGGC
TATCTGCCCG ACCGGAGCCA AGCGCGGCAA TCTGCCGAGG ATTTCGCCGA CGTGCTCAGC
GATCTCTACG ATTGGTTGCT GATCAAGGAA AACGCCGCCG CCAAGGATGT AAACAACGCC
TGCAAGGCGA AGACCTCGCT CATCGCCCAC AGCATGGGCA ATTACGTCTT CCAGTGCGCG
ATGAATTACA CCTGGACGAA GAAGAACCGT CCGCTGCTGG TAAGCCTGGT ACATGAAGCG
TTAATGGTCG CGGCCGATGT GGACAACGAT CTCTTCCGAA GCGGCGAAGT CGTCGAATCG
GGCGACGGTG AGGGCATCGC CAATCTCACG TACCGGATCA CCGCCCTCTA CAGCGGCCGA
GACGCCGTGC TGGGCGTCTC ATCGGGACTA AAGCATTTCG GCAAACGACG CCTCGGACGC
AGTGGACTCG ATCAGACTAC CGCGCTTCCG GACAACGTTT GGGACATCGA TTGCACCAAC
CTCATCCATC CGGATGTCAG CGGTATTTCC GTGCACGGAT CTTACTTTTT CCCCGACGAA
TCGGATTGCT ATCCCCTGAT GAGAGAGTTG CTTCGCGGCA TTGATCGCAG CGTGCTAATC
GCCAAAGGTA TGGTTCCCTC GGCGCTCTCG AAGACACAGA CCGTCGGATC GTAA
 
Protein sequence
MYWLITNRNI EANGFGTTFA DVTYWTAPAD ADPKQKASWS QETKDSFRAK LVAVADTFPP 
PTTTQPADQK HITFLVHGYN NSWSDAMGLY QRVATSMYSG ANGLGECISF DWPSKGDLMG
YLPDRSQARQ SAEDFADVLS DLYDWLLIKE NAAAKDVNNA CKAKTSLIAH SMGNYVFQCA
MNYTWTKKNR PLLVSLVHEA LMVAADVDND LFRSGEVVES GDGEGIANLT YRITALYSGR
DAVLGVSSGL KHFGKRRLGR SGLDQTTALP DNVWDIDCTN LIHPDVSGIS VHGSYFFPDE
SDCYPLMREL LRGIDRSVLI AKGMVPSALS KTQTVGS