Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2494 |
Symbol | |
ID | 4069863 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2947083 |
End bp | 2948774 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637984511 |
Product | O-antigen polymerase |
Protein accession | YP_591569 |
Protein GI | 94969521 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3307] Lipid A core - O-antigen ligase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.754605 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCCAC CGGCCGCAGG ACAAGGCGAT TCTCTTCCGG CCACACTTCC CTCTCGGATT CTCGCGTTTT TTGCCGCTTC GATTCCGATC CTTCTCGCGT GTGTGTATTG CCCCGCGCTG CAATCGGCAT TTTTCCCTCC GAAGAACGCG ATATTGGTAT GCGCGACGAT GGTGCTCGCG ATCGGGGTGT TCTTTACTCG GAGCTCAATC CCCTGCGCTG TGAGCGCAAT CGAACGTAAT TTTCTCTACG CTGTCGCGGC TTTTTTGTTC GGGGGTCTTC TCTCCGCTAG TTTTTCAGCC CACAAAGAAT TGGCGATTCA ACCGCTTCTG ATCCTCATTG CCGGATGTCT TCTGATGCCG CTGGCTGCAT CGGCCCTGCG CGGCCGCTCC GATTGGTTGA TCCATGGCAT TGCGATTTCC GGCTACATCG TCGCAGCAGT CGCAATTGCG AACCGCTTCG GTTTCGACGT CTTCACCCCC TTCGGGTTAC ACCCGAGCTA CTCCGGCGGA CGTATGCAGA TTTCGTCTAT GCTCGGCAAT CCGAACTTCG TCGCCAGCTA TCTCGCGACG AGCGCGCCAG CTCTGCTGTA TCTCGCGCTG CGCCCGAGCA TCAAAGCTTG GGTTTGGCGC CTGGGATTGG CTGGATCATT CGCCTCCATC TGGTGGACGC AATCGCGTAT TGGCTTGCTC TGCTTCTTCG TTGCACTGGC GCTGCCGCTC CTCGCACGAT CGCGCAAGCA ACGATGGATC GCAATCGGCG CCGTCCTGGT GCTCTTCGCT GCCTCAGCCT CGCTCGTGAA CCGCACCAAC CCTCGCTCAC TGACGACTGC GTCCACAGGC CGTACTTTCT TGTGGCGAGT TTCTCTCGCG GACGGCGTCC ACTTGCTCGG CGACGGTCCT GGCACGTTCT ACTACACCTA TCCCGAACGC ATGGGGCGCT GGTTCGCCGC TCATCCCGAT GGGAGCCTTC TTCCCTTCGC CGATATGCAG GAGCATGCAC ACAACGATTT TCTTGAATTT CTGGTCTCCA CCGGCGTGCT TGGAGCCGCG GCGTTATTGG CAACGCTCGG CATTGGCGTC GGCAGCCTGC TGCAACGCGT TACTTCTGAT CCGAGAGCAC CCTTTGCCTT GGCAGGTATC GTGGCGCTCC TCCTCGGCGC CTGTTTCGAC TTCCCGCTCC AGCGCGCAGA GACCTGGGCT CTGCTGTGGT TATGGTTCGC CTTCGCCTTT CTCGACTTAA ACCGTCGGGT AATTCGGTTT CGAGAACGTG CAACTGTGCT CGCGCCAGCC TCTGCTCTCG GAATCGTCCT GCTTGTCGTC GTCATGAGGC CAGCGATCGC CAGCTACCAC GTGCATGAAG GTCTCGCGTG GGAGGCGCAA TCCAGCGACC AGCGAGCTGT TGAGGAATAC GCCGCTGCTC TACGCTGGGA CCGCACCAAC GCCGACGCCG AGTTCTACCT GGCGCGTGCG CTGGCGAATT CAGGACGCAC GCAAGACGCG CTGGAACAAG CCCGCATCGC GCAATACTGG CTCGACGAGC CCGATCTCTG GGAACTGAGG GCTCGCATCC TCGTCCAAAT GGGACACAAG CAGGCGGCCT TGGCGGAATT GGACGACGGC CTGCGCCGTT TCCCCTATTC CAGCCTCCTC GCATCGGCAC GTTCCGAAAT CTCCGCCGAA CCCGAAAAGT GA
|
Protein sequence | MTPPAAGQGD SLPATLPSRI LAFFAASIPI LLACVYCPAL QSAFFPPKNA ILVCATMVLA IGVFFTRSSI PCAVSAIERN FLYAVAAFLF GGLLSASFSA HKELAIQPLL ILIAGCLLMP LAASALRGRS DWLIHGIAIS GYIVAAVAIA NRFGFDVFTP FGLHPSYSGG RMQISSMLGN PNFVASYLAT SAPALLYLAL RPSIKAWVWR LGLAGSFASI WWTQSRIGLL CFFVALALPL LARSRKQRWI AIGAVLVLFA ASASLVNRTN PRSLTTASTG RTFLWRVSLA DGVHLLGDGP GTFYYTYPER MGRWFAAHPD GSLLPFADMQ EHAHNDFLEF LVSTGVLGAA ALLATLGIGV GSLLQRVTSD PRAPFALAGI VALLLGACFD FPLQRAETWA LLWLWFAFAF LDLNRRVIRF RERATVLAPA SALGIVLLVV VMRPAIASYH VHEGLAWEAQ SSDQRAVEEY AAALRWDRTN ADAEFYLARA LANSGRTQDA LEQARIAQYW LDEPDLWELR ARILVQMGHK QAALAELDDG LRRFPYSSLL ASARSEISAE PEK
|
| |