Gene Acid345_0723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0723 
Symbol 
ID4069795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp885627 
End bp886784 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content59% 
IMG OID637982729 
Productceramide glucosyltransferase, putative 
Protein accessionYP_589802 
Protein GI94967754 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID[TIGR03472] hopanoid biosynthesis associated glycosyl transferase protein HpnI 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTACG CGCACCTCAT CCTGGAAGCA CTGACCGTGA TTGGTGCGGT CAGCGGCACA 
GCGTATTACG CGCTGTGTTT ATGGGGTGCT GCGCGATTTA TCCGCGAGCG GCGCGCGGCA
CAAAGCGAGG CGTTTACGCC GCCGGTGAGC ATATTGAAGC CGCTGAAGGG CGCGGACCCG
AGCATGTACG AGGCGTTCCG CAGCCACTGC CTGCAAGATT ATCCCGAGTA CGAAATCGTC
TTCGGTGTCG CGGACTTGCA CGATCCGGCG GCACAGGCTG TCGAGCGATT GCAGCAAGAA
TTTCCGGAAC TCACGATCAA GTTGGTGCAG TGCTCTCCTT CGGGCGGCAC CAATCGAAAA
GTTGCAACCT TGCAGGAGAT GCTACCGCAC GCGCGGTACC CGTACCTCCT GATCAACGAC
AGTGACATTC GCGTAGGAAC TAATTACTTG CATGAAGTCA TGGGTCCGAT GCTGGACTCG
AAGGTCGGCA TGGTGACGGC CCTGTATCGC GCGGCTCCCG GGAAGACACT CGGATCGAAG
CTGGAAGCAA TTGGCATTGG AACCGACTTC ATGGGAGGGG TGCTGTCAGC CCGCGAGATT
GAAGGTGGGC TTCACTTCGC GCTCGGCTCG ACACTGACTT TTCCACGCGA AGCCCTCGAA
AAGATCGGCG GCTTCGCCCC TCTTCTTGAC TATCTCGCCG ACGACTACGA ACTGGGCGCG
CGAATTTCGC AGGCCGGATA TCAAGTCGCG CTGGCACGTA CGATCGTCGA AACCCACCTA
CCGGACTATT CGTGGCCAGC TTTCTGGAAG CACCAGTTGC GCTGGAACCG CACCATCCGC
GACAAGCGCA AAGGCGGATA CTTCGGCGTG CTGTTGACCT TCGGCCTCCC GTGGGCATTG
CTCACCGTGA TCGCGTCGCT GGGTGCGGGG TGGGCTTGGA TGCTCTTCCT TGCTGTCGTG
GTGGCACGTT ATGCGCTGGC TTTGACGCTG ATGGGGCCGA TTCTTCACGA CCGTAGAGGT
ACGGGCAATC TCTCGCTCGT GCCGCTCCGC GACTGCGTTG CGATGGTCCT ATGGTTCTGG
ACGTATTTAG GCGACGAGAT CGAATGGCGC GGCGAAACCT TTCGCCTGCG CGATGGAAAA
CTCATTCGAA TCGAATAG
 
Protein sequence
MHYAHLILEA LTVIGAVSGT AYYALCLWGA ARFIRERRAA QSEAFTPPVS ILKPLKGADP 
SMYEAFRSHC LQDYPEYEIV FGVADLHDPA AQAVERLQQE FPELTIKLVQ CSPSGGTNRK
VATLQEMLPH ARYPYLLIND SDIRVGTNYL HEVMGPMLDS KVGMVTALYR AAPGKTLGSK
LEAIGIGTDF MGGVLSAREI EGGLHFALGS TLTFPREALE KIGGFAPLLD YLADDYELGA
RISQAGYQVA LARTIVETHL PDYSWPAFWK HQLRWNRTIR DKRKGGYFGV LLTFGLPWAL
LTVIASLGAG WAWMLFLAVV VARYALALTL MGPILHDRRG TGNLSLVPLR DCVAMVLWFW
TYLGDEIEWR GETFRLRDGK LIRIE