Gene Acid345_4612 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4612 
Symbol 
ID4070769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5464902 
End bp5466215 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content60% 
IMG OID637986652 
Producthypothetical protein 
Protein accessionYP_593686 
Protein GI94971638 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.439374 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACAA GCCCTGAGAC CTTGCCCGGC CTCCCAGCCC GCCAATCCCG CGGGCTGGCG 
CAGTCTCTGC GCATTGCGCC GGCGCACTCG CGGCTTGTGG GCGGCAGTCT CATCATGCTC
GGCGGCATGG TGCTCGTCAG CCTTCTGAAC TTCGGTTACA ACATCGCCGT TGCCCGCATG
CTCGGCGCCG CCGAATTCAG CCAGGCAGCA GCGGCGGTCA CCCTGCTGAT GATTGTTTCC
TGTCTCACAC TGGCTTTCCA GATGGTCTGC GCCAAGTTCG TGGCCAGGAA CGCAACCAAC
TCGGAGAAAT CGCACGTCTA TCGCGCGCTG TTGCGCCGTG CCTGGACTGC CGGCCTCAGC
ATTGGCATTG TCCTTACGAT CTTCAACCGC CAGGTCGCTG CGTGGCTCAA CATGCCCTCC
GCGACGCTCG TTATCGTCCT CGCGCTCGGC ATGGCTTTCT ACGTTCCTCT CGGCGTGCGA
CGCGGCGGCA TGCAGGGTGT TTATCAATTC CGCCGGCTGA GCCTCAATTT CATCATCGAG
ACCAGCGTCA AGCTCGTCTC CGCAATCGTC TTAGTCCACT TGGGTTACGG AATTCTCGGC
GCCGTCGCCG CCATCTCCAT CTCAGTGGTG GCTGCCTACT TCCTTCCTCC CACTCCAATT
GCCTTACGTG AGCAGCCGAA AGCAGGGCTG CCGGCATCTT TTGGGGAAGG CATACAGGCG
ATCATCTTCT TCATTGGACA GGTGATCATC AACAACATCG ACATCCTGAT GGTGAAGCAT
TTCTTCCGAC CCGATGTCGC CGGTCTGTAC GCCGCAGTTG CTTTGGTCGG ACGCGTTCTT
TACATCGCGT CCTGGCAAGT GATCAGCGCT ATGTTTCCGA TTGCCGCCGC AGGCCGCTCC
GAATCCGAAG GCCGTGAAAG CCGAATGGTC GTGCTCATTC CATTCGGCTT CGTCACCGCG
ATGACCGTGG TCTTCATGGC GATTCTCGGT CTCTTCCCGC AAACGATCCT GCACTTGCTC
TTCGGCGCGA AGTTCAACAC TGACTCCAGC AACCTGCTTC TTCTCTACGC CGCCGCTACC
GGCGGTTACG CACTCAGCGT GGTTCTGATG GCCTACGAGA TGTCGCGCCG CATCGCCAAC
ACCGGCTGGT TCCAGCTCGT CATCAGCGGA CTCGTCGTCC TCGGCATCAC CATGTTCCAT
AACACGCTCC GCGACGTCAT CGTGGTGCAG CAGGTCCTGA TGGTCGTCCT ATTTACCGCC
GTAGCCGTGC CGTTTGTTCT CGCGCGGCGC TTCCGAACCC GGGGGGCAGC ATGA
 
Protein sequence
MSTSPETLPG LPARQSRGLA QSLRIAPAHS RLVGGSLIML GGMVLVSLLN FGYNIAVARM 
LGAAEFSQAA AAVTLLMIVS CLTLAFQMVC AKFVARNATN SEKSHVYRAL LRRAWTAGLS
IGIVLTIFNR QVAAWLNMPS ATLVIVLALG MAFYVPLGVR RGGMQGVYQF RRLSLNFIIE
TSVKLVSAIV LVHLGYGILG AVAAISISVV AAYFLPPTPI ALREQPKAGL PASFGEGIQA
IIFFIGQVII NNIDILMVKH FFRPDVAGLY AAVALVGRVL YIASWQVISA MFPIAAAGRS
ESEGRESRMV VLIPFGFVTA MTVVFMAILG LFPQTILHLL FGAKFNTDSS NLLLLYAAAT
GGYALSVVLM AYEMSRRIAN TGWFQLVISG LVVLGITMFH NTLRDVIVVQ QVLMVVLFTA
VAVPFVLARR FRTRGAA