Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_1198 |
Symbol | |
ID | 8414076 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 1343908 |
End bp | 1345395 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 645022792 |
Product | phosphotransferase system EIIC |
Protein accession | YP_003180217 |
Protein GI | 257785000 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific |
TIGRFAM ID | [TIGR00826] PTS system, glucose-like IIB component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.742148 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTACA CCAAGCTTTC AGAACAGATT CTTGCTGCCG TCGGTGGTAA AGAAAATGTT CAGAGCAACA TGGTCTGCAT GACCAGGCTC CGCGTTAAGA CCGTAGATCC TTCAAAGGTT GATTCTGAGG CCATCAAGGC TATTGATGGC GTCATGGGTC TGGTCGAGGA TGCAGAGTAC CTTGAGGTTG TTCTTGGTCC TGGCGTTGTC AACAAGGTTA TTGTTGAGTT CTCCAAGCTT ACCGGTGTTG CTGCTGGCGA CGCATCTGAG GATGATGTTG TTTCTGCAGC TAAGGACAAC AAGGCAGCTC AGAAGGCTAA GTATGAGAAC AAGCCTGTTC AGCGTTTCCT AAAGAAGATT GCTAACATCT TTGTTCCACT GCTTCCTGGC ATCATTTCTG CAGGTTTGAT TAACGGTATC ATCAACGTTA TCAACTTCTC CACTGGTAAG GCTTATGCCA ACGAGTGGTG GTTTGCAGCT ATCTGGACTA TGGGTTGGGC GCTCTTTGCT TATCTGCCAA TTCTCGCTGG CGAAAACGCT GCCAAGGAGT TTGGTGGTTC TCGCGTTCTT GGTGCTATGG CTGGTGCACT TTCCATTGCT AACGCTGGTA TGCCTCTTCT TGCTTCCAAG ACTGTTGACG GCGTTGCAAC TCGTTTGGTT CACCTTCCAT TTAGCCTCCC AACTGTTGCT TTCAAGGAGG GCGCATTTAT CGTTGCTTCC TCTGACCTGT TCAACGCAGC AGCAGGCGGC ATGATTGGTG CAATCATCTG CGCAATCTTC TTTGCATTCC TGGAGAAGAA CCTGCACAAG GTTATGCCAA GCGTTCTTGA CACCTTCCTG ACCCCACTTT GCACCGTCAT CATCGGTGTC ATTGGTTCTG TCCTGATTCT TCAGCCTGCA GGTGCATGGC TCACTCAGGC AATCTTCTTT GTTCTTCAGT TCTTCTATGA CAAGCTTGGC GTCTTTGGTG CTTACCTGCT TGGTTCTACC TTCTTGCCAC TGGTTTCTGT TGGTCTTCAC CAGGCACTTA CACCAATCCA CGTTATGCTT AACAACCCAG AAGGCCCAAC TCAGGGTATC AACTACCTGC TTCCTCTGCT GATGATGGCA GGCGGTGGCC AGGTTGGTGC TGGTCTTGCT ATTCTCTTCA AGACTAAGAA CAAGCGCGTT AAGAAGTACC TCACCGAGTC CATCCCAGTT GGTATGCTCG GCATCGGCGA GCCTCTTATG TACGCAGTTA CTCTGCCTCT TGGTAAGCCA TTCATTACCG CATGCCTTGG CTCTGGTGTT GGCTCCGTAA TTGCTTACCT CTTCCACCTT GGCACCGTTT CCCAGGGCGT CTCTGGCCTC TTTGGTCTGC TGATTGTTCA GCCTGGTAAC CAGGTCTTCT ACCTGCTTGC AATGCTTCTT GCTTATGCTG CAGGCTTTGC ACTTACTTGG TTCTTTGGCG TTGACGAGGA TCGTATCAAC GACGTCTTTG GCGAGTAA
|
Protein sequence | MDYTKLSEQI LAAVGGKENV QSNMVCMTRL RVKTVDPSKV DSEAIKAIDG VMGLVEDAEY LEVVLGPGVV NKVIVEFSKL TGVAAGDASE DDVVSAAKDN KAAQKAKYEN KPVQRFLKKI ANIFVPLLPG IISAGLINGI INVINFSTGK AYANEWWFAA IWTMGWALFA YLPILAGENA AKEFGGSRVL GAMAGALSIA NAGMPLLASK TVDGVATRLV HLPFSLPTVA FKEGAFIVAS SDLFNAAAGG MIGAIICAIF FAFLEKNLHK VMPSVLDTFL TPLCTVIIGV IGSVLILQPA GAWLTQAIFF VLQFFYDKLG VFGAYLLGST FLPLVSVGLH QALTPIHVML NNPEGPTQGI NYLLPLLMMA GGGQVGAGLA ILFKTKNKRV KKYLTESIPV GMLGIGEPLM YAVTLPLGKP FITACLGSGV GSVIAYLFHL GTVSQGVSGL FGLLIVQPGN QVFYLLAMLL AYAAGFALTW FFGVDEDRIN DVFGE
|
| |