Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_1112 |
Symbol | |
ID | 8413985 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 1257901 |
End bp | 1260945 |
Gene Length | 3045 bp |
Protein Length | 1014 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 645022701 |
Product | glycosyl transferase family 8 |
Protein accession | YP_003180131 |
Protein GI | 257784914 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCAAGG TTTCTTTTGT AATTCCTGCA TACAACATTG AATCGTACAT TGGGCGTTGT ATTCAAAGTG TAAAGAATCA GACGTTTGGT GATTTTGAAG CAATTATTGT TGACGACGCC TCAACAGATT CCACTCCAGA GAAAATTGTT ACTGCAGTAG GGGATGACAA AAGGTTCAAA GTTGTCACTC ATGCAACTAA CCAGGGACTT CATCTTGCAC GTAAGACTGC TGCAGCGTAT ACAAAAGGAG AGTGGGTCTT TTGTTTAGAC GGTGATGATG AGGTTACTCC TGATTTTCTT GAGCAGGTGG TTGGTCGTAT TGAACAGAAT CCTGTTGATA TTCTCCACTT GGGTATTACC GTCATTCCAG AAAACGGTGT AAGTGAGGCT GAGGCAGAAG GTTTTGGTAG CTTCATCAAC CAGCAGTCTC ACTTTACTCA AGGTGATGAA GTTCTCCGCA CCATTTTTGA TGAGAGCTAT GGACAGAAGA TTGATTGGCG TACTACGCAG CGTCTGTATC GCGGAGAACT CTTTCGTTCT GCCTTTGCAG AGATGACTTC GGAGCGCTTG GTAAGAGCAG AAGATGCCTA TGAGGTATTT GTGCTTTCTG ATAAGGCTCA GACTGCTGAT GGATTTGAGT CTTGTAGAGG TCTTCTTTAC CACTTTGGTA TTGGTGTAAC AGGAGTTTCT CGCATTTCTT TAGATAAGTT TGGAGAGTTC TGCTATCAGT TCTTAGACAA TATTGAACAG ACAGAGTTCT ATATTGGCAA AACAGACAAT GTTGTTTACC TTAGGTCTTT TGAGGGCATG AAGCACAAGT TGATGGAGCT TTTGATTGGT GACTGGAAGT CTCGTCTTGC TCCTGAAGAT CAAGAAGCAG CTCTTGAGCC GTTTGCTATT TTATTTGGAC CATCAGTGGC TGCTCGAGAG CTTTATCGCT TTGTAAGAGA TGACGCTTAC GAGGCCATTA AAACTTGTAC TGAGCTTCCA GAAGACAGCA ATGCATATCT CTATAGATCG TATGCAAAGA AATATGCTGC TCTCATGCAT CAGGATGAGG GACTGTCGTT TGATAGGGCA GTTCGCATGA AACAGATTGC CGATGAACAT ATGGAATATC TAGAAAGAAA GCACATGGTA AAGATGTTTG AGCAGCAGCC TATTCGCATT TTTATTACCT CTCATAAAGA TGTTGATGTT CCAGAAAGTA ATTATCTTCA GCCTATTCAA GTAGGACCAG GTCAGAAGAC AAATCGCTTT TCGTATATGC TTCATGATGA TGAGGGCGAT AACATTACCG AAAAAAACCC AATGTACTGT GAGATGACTA CACAATACTG GGCATGGAAG AATATTACTA ATGAGCGCTA TGTTGGCTTT GGTCATTATC GTCGTTACTT CAACTTTACC GATACGATTT ATCCAGAAAA TCCTTTTGGT GAGGTTATGG ATGATTTTAT CGATGAGGAT GCCATCAAGA AATATGGTCT TGATGATCAG ACCATTGCTC AGTGTATTGA AGGATATGAT CTCATTACCA CTGGGGTAAA AGATATTCGT AAATTCCCTG GAAGCGCCAA TACACCACTC GAGCAGTACC ATGCTGCTCC ATTGCTGCAT CCAAAGGATA TGGATACTAT GGCGGCGCTT ATTGTTGAGC GTCATCCAGA GTATGCAGAG GATGTAAACG CTTTTCTTAA TGGTTATGAA CAGTGTTTCT GTAACATGTA TATCATGCGC AGGGAGCTTT TTGATCGCTA TGCAGCATGG GTATTCCCAC TTGTTGACGA GTGGACTGCT CGTACTGATA TGTCAACCTA CAGTAAAGAG GCTCTAAGAA CTCCAGGTCA CCTAACTGAG CGTCTCTTTA ATATCTGGCG TATGCACATG CTGCGCACAG AGGGTAAAAA CTGGAAGGTA AAAGAGCTAC AGTGTGTTCA CTTTACTAAT CCAGAGCCTC GTCAGAAGTT TATTCCTCTC TTTGAAGAGA AGCCTGAGAT TGCAAGTCAG AACGTTGTTC CTGTTGTTTT TGCAGCAGAT AACAACTACG TTCCAATTCT TACTTGTGCA ATGGGTTCAA TGCTTGAGAA TGCAGATCCT AACCGGTATT ACGACGTAGT TGTCCTTAAT ACCAATATTG GCGGATCAAA GCAGGAATTG GTTAAGAAGT TCTTCTCACG CTATAAGAAT GCTCGCATCA CGTTCTATAA CGTGTGGCGT ATGGTTAAAG ACTATAAATT AGATACCAAT AACGCGCATA TTAGCGTTGA GACATACTTC CGTTTCTTGG CCCAAGATAT CCTTTCTGCT TACGATAAGG TTGTCTATCT TGACTCTGAC CTTGTGGTTA ATGGCAATGT TGCTGAACTT TACGATGTAA GAATAGGCAA CAATCTTATT GCTGCAACGC TTGATATTGA CTATCTAGCA AACCTCAATA TTCGCGGTGG AGACCGCATG AAGTACAGCC TTGACGTGCT TAACCTCAAA AATCCTTATG CTTATTTCCA GGCGGGAGTT ATGGTTTTTA ATACCGCTGA ACTGCGCCGT TACCACACTG TTCCAGAGTG GTTGCGTATT GCATCTAATC CAATCTTTAT TTATAACGAT CAAGATATTC TGAATAGCGA GTGTCAAGGT CGAGTGCTAT ATCTTCCTGC CGATTGGAAC GTTACGCATA ATATTTTTGG TCGTGCAGAG GAACTCTATC CAATGGCACC AAACAGTGTT TTTGATGATT ATCAAGCAGC ACGTCGAGCA CCAAAGATTG TTCACTTTGC TGGCGCCATT AAACCTTGGC AGAATGCCAG CTGTGATATG GCTTCCTACT TCTGGAAGTA TGCACGCAAT ACCCCGTTCT ATGAGGTCAT TATTCAGGAT ATGGTTCCAA GCGCCAGAAA TGACGCGGAC GTTACAGAGT TCCATGAGCG TGCACTTTCT GATGCAAGTC CTCTGCGTAA GATTATTGAC CCTATTGCAC CGTATGGCAG CGCAAGGCGA GAAGCTCTTA AGGCCCTTGG TAGAACCTTA AGAGGTCGCA AATAA
|
Protein sequence | MPKVSFVIPA YNIESYIGRC IQSVKNQTFG DFEAIIVDDA STDSTPEKIV TAVGDDKRFK VVTHATNQGL HLARKTAAAY TKGEWVFCLD GDDEVTPDFL EQVVGRIEQN PVDILHLGIT VIPENGVSEA EAEGFGSFIN QQSHFTQGDE VLRTIFDESY GQKIDWRTTQ RLYRGELFRS AFAEMTSERL VRAEDAYEVF VLSDKAQTAD GFESCRGLLY HFGIGVTGVS RISLDKFGEF CYQFLDNIEQ TEFYIGKTDN VVYLRSFEGM KHKLMELLIG DWKSRLAPED QEAALEPFAI LFGPSVAARE LYRFVRDDAY EAIKTCTELP EDSNAYLYRS YAKKYAALMH QDEGLSFDRA VRMKQIADEH MEYLERKHMV KMFEQQPIRI FITSHKDVDV PESNYLQPIQ VGPGQKTNRF SYMLHDDEGD NITEKNPMYC EMTTQYWAWK NITNERYVGF GHYRRYFNFT DTIYPENPFG EVMDDFIDED AIKKYGLDDQ TIAQCIEGYD LITTGVKDIR KFPGSANTPL EQYHAAPLLH PKDMDTMAAL IVERHPEYAE DVNAFLNGYE QCFCNMYIMR RELFDRYAAW VFPLVDEWTA RTDMSTYSKE ALRTPGHLTE RLFNIWRMHM LRTEGKNWKV KELQCVHFTN PEPRQKFIPL FEEKPEIASQ NVVPVVFAAD NNYVPILTCA MGSMLENADP NRYYDVVVLN TNIGGSKQEL VKKFFSRYKN ARITFYNVWR MVKDYKLDTN NAHISVETYF RFLAQDILSA YDKVVYLDSD LVVNGNVAEL YDVRIGNNLI AATLDIDYLA NLNIRGGDRM KYSLDVLNLK NPYAYFQAGV MVFNTAELRR YHTVPEWLRI ASNPIFIYND QDILNSECQG RVLYLPADWN VTHNIFGRAE ELYPMAPNSV FDDYQAARRA PKIVHFAGAI KPWQNASCDM ASYFWKYARN TPFYEVIIQD MVPSARNDAD VTEFHERALS DASPLRKIID PIAPYGSARR EALKALGRTL RGRK
|
| |