Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0462 |
Symbol | |
ID | 7407540 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 528197 |
End bp | 529543 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 643714850 |
Product | exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
Protein accession | YP_002572367 |
Protein GI | 222528485 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2148] Sugar transferases involved in lipopolysaccharide synthesis |
TIGRFAM ID | [TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00445069 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATCAT ATATTAAAAG CTATCACAAG CTGAGCAAAT TTTTTGTTGT TCTTATGGAC ATTTTACTTG TACATGCAGG TTATATAATT GCCTATATTA TAAAATTCAA CTTTACATTT CCAGAGAAAA ATTTCATGCC TTATTATACA TTAATTCCTC AAATAACTCT TTTTGCTCTT GTTCTTTTAA ATATTTATGG ACTTTATACT ATTACCATGA AAACCATTAG CGAAATAGCA TTTTCGCTGG GTCTGGCTCT AATGCTTCTA CAATTTTTAA CAGTTGTATC AACATTTTTC TACAGACACT TTGCTTTCCC ACGCAGTATA TTTATAATTG CCTTTTTTGT ACAGTTTTTA TTGCTTTTGG GATGGAGAGG ACTTGTCCTA TATGTTTTCA AAAGAGTTCA AGGTGTCAAG CATGTACTTG TGATTGGAGA GATGTCAAAG GCGCAGGAGT TTGCTCAAAA GCTTCAAAAT ATTTCGAAAG GTTGGATAGA TGTAAAGTAT GTTCTTGAGC CAAAAGCTAT AGAAGAATTG ATACTATATA TAAAACTTGT TGATACAATT TATATTTATT CAAAAATGGA TGAAAACTTA AAAAGTGAGA TTGTAAGAAA GGCGATAGAA TTCAAAAAAC ATATTTTCAT AGCACCAGAT TTTAGAGATA TATTGGTATC ACGTGCGAGG GTGATTCAGT TTGATGATGT TGCAACACTT TCAATTGAAC AGCCAGAACT TACTTCCGAG CAAAAGCTTA TAAAAAGATT TTGTGACATT CTTCTTGCAT CAGTTGCGCT GGTTATTTCT TTTCCTATAA TGATCTTGAT TGCAATTGCC ATAAAAATTG ACTCAGAGGG GCCAGTAATT TACAAGCAAA AAAGGGTCAC AGAAGGAGAG AGAGAGTTTT ATGTTTTAAA GTTCAGAACA ATGGTAAAAG ATGCAGAAAA GATGACAGGT CCTGTTCTGG CAACCGAAAA CGACCCCAGA ATAACAAGGG TTGGAAGGTT TTTGCGCGCA ACAAGGCTTG ATGAGCTTCC GCAGCTGATA AATATTTTAA AAGGTGAAAT GAGTTTTATA GGACCAAGAC CAGAGCGTCC TTATTTTGTT GAGCAGTTTA AAAAACTCTA TCCTGAGTAT TCGCTTCGTC ATAATGTAAA GGCAGGGCTC ACAGGACTTG CCCAGGTTTA TGGCAAATAT GCAACAAGCC CTGAAGACAA GCTCAGGCTT GATTTGATAT ATATAAAGAA TTACTCTGTA TTTTTAGACA TCAAAATTTT ACTGTTGACC TTAAAAACAA TTTTTACCAA AGAGGCAGCT GAGGGGGTAA AAAACCAAAA AGGATAG
|
Protein sequence | MKSYIKSYHK LSKFFVVLMD ILLVHAGYII AYIIKFNFTF PEKNFMPYYT LIPQITLFAL VLLNIYGLYT ITMKTISEIA FSLGLALMLL QFLTVVSTFF YRHFAFPRSI FIIAFFVQFL LLLGWRGLVL YVFKRVQGVK HVLVIGEMSK AQEFAQKLQN ISKGWIDVKY VLEPKAIEEL ILYIKLVDTI YIYSKMDENL KSEIVRKAIE FKKHIFIAPD FRDILVSRAR VIQFDDVATL SIEQPELTSE QKLIKRFCDI LLASVALVIS FPIMILIAIA IKIDSEGPVI YKQKRVTEGE REFYVLKFRT MVKDAEKMTG PVLATENDPR ITRVGRFLRA TRLDELPQLI NILKGEMSFI GPRPERPYFV EQFKKLYPEY SLRHNVKAGL TGLAQVYGKY ATSPEDKLRL DLIYIKNYSV FLDIKILLLT LKTIFTKEAA EGVKNQKG
|
| |