Gene Acid345_2494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2494 
Symbol 
ID4069863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2947083 
End bp2948774 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content60% 
IMG OID637984511 
ProductO-antigen polymerase 
Protein accessionYP_591569 
Protein GI94969521 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.754605 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCAC CGGCCGCAGG ACAAGGCGAT TCTCTTCCGG CCACACTTCC CTCTCGGATT 
CTCGCGTTTT TTGCCGCTTC GATTCCGATC CTTCTCGCGT GTGTGTATTG CCCCGCGCTG
CAATCGGCAT TTTTCCCTCC GAAGAACGCG ATATTGGTAT GCGCGACGAT GGTGCTCGCG
ATCGGGGTGT TCTTTACTCG GAGCTCAATC CCCTGCGCTG TGAGCGCAAT CGAACGTAAT
TTTCTCTACG CTGTCGCGGC TTTTTTGTTC GGGGGTCTTC TCTCCGCTAG TTTTTCAGCC
CACAAAGAAT TGGCGATTCA ACCGCTTCTG ATCCTCATTG CCGGATGTCT TCTGATGCCG
CTGGCTGCAT CGGCCCTGCG CGGCCGCTCC GATTGGTTGA TCCATGGCAT TGCGATTTCC
GGCTACATCG TCGCAGCAGT CGCAATTGCG AACCGCTTCG GTTTCGACGT CTTCACCCCC
TTCGGGTTAC ACCCGAGCTA CTCCGGCGGA CGTATGCAGA TTTCGTCTAT GCTCGGCAAT
CCGAACTTCG TCGCCAGCTA TCTCGCGACG AGCGCGCCAG CTCTGCTGTA TCTCGCGCTG
CGCCCGAGCA TCAAAGCTTG GGTTTGGCGC CTGGGATTGG CTGGATCATT CGCCTCCATC
TGGTGGACGC AATCGCGTAT TGGCTTGCTC TGCTTCTTCG TTGCACTGGC GCTGCCGCTC
CTCGCACGAT CGCGCAAGCA ACGATGGATC GCAATCGGCG CCGTCCTGGT GCTCTTCGCT
GCCTCAGCCT CGCTCGTGAA CCGCACCAAC CCTCGCTCAC TGACGACTGC GTCCACAGGC
CGTACTTTCT TGTGGCGAGT TTCTCTCGCG GACGGCGTCC ACTTGCTCGG CGACGGTCCT
GGCACGTTCT ACTACACCTA TCCCGAACGC ATGGGGCGCT GGTTCGCCGC TCATCCCGAT
GGGAGCCTTC TTCCCTTCGC CGATATGCAG GAGCATGCAC ACAACGATTT TCTTGAATTT
CTGGTCTCCA CCGGCGTGCT TGGAGCCGCG GCGTTATTGG CAACGCTCGG CATTGGCGTC
GGCAGCCTGC TGCAACGCGT TACTTCTGAT CCGAGAGCAC CCTTTGCCTT GGCAGGTATC
GTGGCGCTCC TCCTCGGCGC CTGTTTCGAC TTCCCGCTCC AGCGCGCAGA GACCTGGGCT
CTGCTGTGGT TATGGTTCGC CTTCGCCTTT CTCGACTTAA ACCGTCGGGT AATTCGGTTT
CGAGAACGTG CAACTGTGCT CGCGCCAGCC TCTGCTCTCG GAATCGTCCT GCTTGTCGTC
GTCATGAGGC CAGCGATCGC CAGCTACCAC GTGCATGAAG GTCTCGCGTG GGAGGCGCAA
TCCAGCGACC AGCGAGCTGT TGAGGAATAC GCCGCTGCTC TACGCTGGGA CCGCACCAAC
GCCGACGCCG AGTTCTACCT GGCGCGTGCG CTGGCGAATT CAGGACGCAC GCAAGACGCG
CTGGAACAAG CCCGCATCGC GCAATACTGG CTCGACGAGC CCGATCTCTG GGAACTGAGG
GCTCGCATCC TCGTCCAAAT GGGACACAAG CAGGCGGCCT TGGCGGAATT GGACGACGGC
CTGCGCCGTT TCCCCTATTC CAGCCTCCTC GCATCGGCAC GTTCCGAAAT CTCCGCCGAA
CCCGAAAAGT GA
 
Protein sequence
MTPPAAGQGD SLPATLPSRI LAFFAASIPI LLACVYCPAL QSAFFPPKNA ILVCATMVLA 
IGVFFTRSSI PCAVSAIERN FLYAVAAFLF GGLLSASFSA HKELAIQPLL ILIAGCLLMP
LAASALRGRS DWLIHGIAIS GYIVAAVAIA NRFGFDVFTP FGLHPSYSGG RMQISSMLGN
PNFVASYLAT SAPALLYLAL RPSIKAWVWR LGLAGSFASI WWTQSRIGLL CFFVALALPL
LARSRKQRWI AIGAVLVLFA ASASLVNRTN PRSLTTASTG RTFLWRVSLA DGVHLLGDGP
GTFYYTYPER MGRWFAAHPD GSLLPFADMQ EHAHNDFLEF LVSTGVLGAA ALLATLGIGV
GSLLQRVTSD PRAPFALAGI VALLLGACFD FPLQRAETWA LLWLWFAFAF LDLNRRVIRF
RERATVLAPA SALGIVLLVV VMRPAIASYH VHEGLAWEAQ SSDQRAVEEY AAALRWDRTN
ADAEFYLARA LANSGRTQDA LEQARIAQYW LDEPDLWELR ARILVQMGHK QAALAELDDG
LRRFPYSSLL ASARSEISAE PEK