Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0835 |
Symbol | |
ID | 8413701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 924919 |
End bp | 926226 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 645022418 |
Product | glycosyl transferase family 2 |
Protein accession | YP_003179855 |
Protein GI | 257784638 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.445477 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00049802 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCATTGC AATTTAACAG ACTCGGTATC ACACCAATCG TTGTTTTTAA TTTTATCATC TGGCTTTTCT TTACGCTCGC ATACTTCTAT CAGATTGTGT ATATCCTTCG CGTTATGTTT AAAGGCGAGG TAAAGCTTCC CGAGGCAAAA AAGCAGCATC GCTATGCTTT CTTTATTGCT GCCCACAATG AAGAGCCTGT TATTGGCAAT CTTGTCAGAT CTATTCTTTC TCAAGATTAT CCTCGGGAGC TGATGGATGT CTTTGTTGTT GCAGATGCCT GTACTGATAA AACAGCAGAA GAGGCAAGAA AAGCTGGAGC AATTACTTGG GAGCGTAATG ACCTTGCTCG TAAGGGTAAG AGCTGGGTTA TGGATTACGG CTTTGATCGC ATTCTTAATG AGTACGGTGA CAAATACGAG GCTTTCATTG TTATGGACGC CGATAACCTG GTTTCTCCAA GCTATCTTAA AATTATGAAT CAAGCTTTTG ATGCAGGGTA TCTTGTGTGC ACCAGCTATA GAAACTCAAA GAATTTTGAT TCTAGTTGGG TTAGTTCTGC CTATGCTACA TGGTTTATGC GTGAAGCAAA GTTTTTGAAT AATGCTCGTA TGATGATGGG TACAAGCTGT GCGGTTTCTG GTTCGGGTTG GATGGTTTCT TCTCGCATTA TTAAAGGCAT GCATGGATGG GATTTTCATA CATTGACTGA AGATATTCAG TTTTCTACGT TTTGCTGTGC TCACAACATT CAAATTGGTT ATGCTCCAGC AGAATTTTTT GATGAACAGC CTTTGACATT TAAAGCTTCA TGGACTCAAA GAATGCGCTG GACAAAAGGA TTTTATCAGG TATTTTTCTC GTATGGCTTT GACCTACTTA AAGGTATTTT CAAGGGTCAG TTTGCTTCAT ACGATATGCT TATGACAATT GCACCAGGTA TGATTTTGTC GTTGCTTTCT GCATTTATTA ATGGAACTTA TCTTTTGGTT GGTTACTTGA GCCACGGCTT TGTTGCAACT GATGCCGAGA TTGCTATGAG TGTGGGTTCT TTGGTTATGA CGGTTTTCTC GATGTATGTT GTCTTCTTTA TTCTGGCGCT CATCACTACT ATTTCAGAGT ACAAGCATTT CCATGTAAAG AAAAAGTGGC GTATTTTTAC CAATCTCTTT ACGTTTCCTA TTTTTATGAT GACGTATATT CCTATTACCG TTGCAGCTTT GTTCAAAAAA GTTGAGTGGG TTCCTACTAA ACATGACATT GCTGTTAACT TTGAGGATGT TATTGCTTCA AGTGGGAGTT CAAATTAA
|
Protein sequence | MPLQFNRLGI TPIVVFNFII WLFFTLAYFY QIVYILRVMF KGEVKLPEAK KQHRYAFFIA AHNEEPVIGN LVRSILSQDY PRELMDVFVV ADACTDKTAE EARKAGAITW ERNDLARKGK SWVMDYGFDR ILNEYGDKYE AFIVMDADNL VSPSYLKIMN QAFDAGYLVC TSYRNSKNFD SSWVSSAYAT WFMREAKFLN NARMMMGTSC AVSGSGWMVS SRIIKGMHGW DFHTLTEDIQ FSTFCCAHNI QIGYAPAEFF DEQPLTFKAS WTQRMRWTKG FYQVFFSYGF DLLKGIFKGQ FASYDMLMTI APGMILSLLS AFINGTYLLV GYLSHGFVAT DAEIAMSVGS LVMTVFSMYV VFFILALITT ISEYKHFHVK KKWRIFTNLF TFPIFMMTYI PITVAALFKK VEWVPTKHDI AVNFEDVIAS SGSSN
|
| |