Gene Acid345_4721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4721 
Symbol 
ID4070659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5584412 
End bp5585368 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content61% 
IMG OID637986765 
Productglycosyl transferase family protein 
Protein accessionYP_593794 
Protein GI94971746 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02193] lipopolysaccharide heptosyltransferase I 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGGCT GGGCGATTGA GGAGCGCTGG GCTCCCCTGC TTTCGAGTCC GGAGGCGCGT 
TTTGCCGCGC GCGGTGAAGG CAAACCGCTG GTGGACATCC TGCACATTGT GAATACCCGT
GCGTGGCGGA ATGCCTGGCT TTCCGGTGAG ACGTGGCGCG AGATCAAAGA CACCGTCACA
GGATTGCGCG CCGCGCAGTA CGATGTAGCC ATCGACATCC AAGGCGCCAT GAAGTCGGCC
GTGCTTGGAC GCCTTGCGCG TCCTCGAAGG CGCTTCGGGT TTGCGCAGCC GTGGGAAGGT
GCGGCCACCA TGTTTTACAG CCACCAGGTC CAGCCAACCG GAACGCATAT TGCCGATCGA
AACTGGTCAT TGGCAACGGC AGCTGGAGCT ACGGCAAGAC CCGAACATTT ATTTCCGATC
CCGATTGATC CGGTAGCGGA AGCGTGGGCT GATGCCAAGC TGCGCGAATG TGGCTTCAAG
GAATTCGCGA TTGTGAATCC GGGGGCAGGC TGGGGATCGA AGTGCTGGCC GGCAGAGCGC
TTTGGAGAAG TGGCGAAGCG CTTGGCGACA AACGGAATTG CATCCGTGAT TAACGCGGGG
CCAGGCGAAG AAGAGTTGGG GAAAGCGGTG GAAAATGCGA GTGGAGGAGC GGCGATGCAG
CTTTCCACTA CGCTCTCAGA GTTGATTGCG CTGACGCGGC GGGCAACGCT TTTTATCGGC
GGCGATACCG GGCCTTTGCA TCTCGCCGTC GCACTGAATA CACCGAGCGT GGCGCTGTTT
GGGCCCACGG ATCCCGCGCG CAACGGGCCG TATGGCGGGA GAGCGGTTGT CGTGCGTAGC
CCGCAAAGTG TGACCACGTA CAAGCGGTCC GCAGCGATGG AAGGCGGGCT GTCGTCCATC
ACGGCGGAAG AAGTGCTTGC GGCCGCCGGA GAGATATTGG GGAAGACGAT TGCCTAG
 
Protein sequence
MIGWAIEERW APLLSSPEAR FAARGEGKPL VDILHIVNTR AWRNAWLSGE TWREIKDTVT 
GLRAAQYDVA IDIQGAMKSA VLGRLARPRR RFGFAQPWEG AATMFYSHQV QPTGTHIADR
NWSLATAAGA TARPEHLFPI PIDPVAEAWA DAKLRECGFK EFAIVNPGAG WGSKCWPAER
FGEVAKRLAT NGIASVINAG PGEEELGKAV ENASGGAAMQ LSTTLSELIA LTRRATLFIG
GDTGPLHLAV ALNTPSVALF GPTDPARNGP YGGRAVVVRS PQSVTTYKRS AAMEGGLSSI
TAEEVLAAAG EILGKTIA