Gene Acid345_2586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2586 
Symbol 
ID4070549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3054737 
End bp3055972 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content60% 
IMG OID637984603 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_591661 
Protein GI94969613 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGGGT CGCGCACCGC CCGCACGCTG GTTACAGCGC TTTATTTCAT TCTGCTCGCG 
CGCTCCCTTG GTTCCGACGG CTACGGTGCA TTCACCGCCG CGTGCGCCAT CATCGGTATC
GTCTCCCCAT TCGCAAGTCT CGGAACAGGC AACGTTCTAG TGCAGTATGT TGCCCGCAAT
CGCGAAGAGT TTCCGGTCCG CTGGGGCCAT TGCCTGCTGG TCACACTGAT CTCCGGCGTG
CTTCTTACCG CTGTCGTTCT CGTCATCGTT CCTCTTGCGC TAACTCACGG TGTGCCGCTG
CCGCTTATCG CCTGCATCGC CATTGCCGAA CTGATCTTCG CCCGCCTTCT GGACGCTGCC
GCGATGGCGT TTCAAGCCTT CGAGCGCCTC GCTGTCACAG GCTGGATCAT CCTCGCTCTC
AGTGTCAGCC GACTTCTTGC AGCGTCGCTT CTTCTCCATA CACGTCACGC GACCGCCGTA
CATTGGTCTG TCCTGTATCT CGCGAGCACC GTCGTTCCAG CAGTCATTGC GCTCTGTTAC
GTTGCCCGCG AACTCGGCGC ACCAAGATTT GGACGATGGA CAACGCCGCG CGACCTCCTC
ACCGGCCTTT ACTTTTCAGT CTCACAAGCC GCGCAAACCG TCTACAACGA CATCGACAAA
GCCATGCTGG CGCGACTCTC TGGCCTCGCG GCCGCCGGCA TCTACGCCGC TGCATATCGC
ATTATCGACG CCGCGTTCTC GCCCGTTCTT TCTGTTCTCG CCGCGACGTA CGCAGGATTC
TTCCGCCACG GCCAGCAAGG GCTGCAAAGT TCCTCGGCGT TCGCTCGCCG GATCATGCCG
CGCACGTTCC TTTACTCTTT GTTCGCCGCG TCACTCCTGT GGGTCGCCGC TCCCTATCTG
CCGTTCATCA TTGGTCCGCA GTTCAACGAC TCCGTGTGGG TTCTCCGCTG GCTCTCACCG
CTTCTGGTCC TGCGCAATCT GCACTATCTC GCAGCTGACT CTCTCACCGG CGCCGGTTAT
CAATCCACAC GCACGTTATT GCAGCTCCTT GTCGCAGCAC TCAATATCGG CTTGAATATC
GCCCTTTTGC CGCGCTACTC ATGGCGCGGC GCCGTCTGGA CCAGCCTGGC CTCCGATGGC
GTTATAGCAC TCGCCATGTG GATCGCCCTC TGGTACTTGC GCGCCCGAGA AGTTAAACGC
TGCAACGTCC TCAGCCCGGA GTGGGTCGAA GCATGA
 
Protein sequence
MIGSRTARTL VTALYFILLA RSLGSDGYGA FTAACAIIGI VSPFASLGTG NVLVQYVARN 
REEFPVRWGH CLLVTLISGV LLTAVVLVIV PLALTHGVPL PLIACIAIAE LIFARLLDAA
AMAFQAFERL AVTGWIILAL SVSRLLAASL LLHTRHATAV HWSVLYLAST VVPAVIALCY
VARELGAPRF GRWTTPRDLL TGLYFSVSQA AQTVYNDIDK AMLARLSGLA AAGIYAAAYR
IIDAAFSPVL SVLAATYAGF FRHGQQGLQS SSAFARRIMP RTFLYSLFAA SLLWVAAPYL
PFIIGPQFND SVWVLRWLSP LLVLRNLHYL AADSLTGAGY QSTRTLLQLL VAALNIGLNI
ALLPRYSWRG AVWTSLASDG VIALAMWIAL WYLRAREVKR CNVLSPEWVE A