Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0079 |
Symbol | |
ID | 8412922 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | + |
Start bp | 89282 |
End bp | 90721 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 645021646 |
Product | PTS system, lactose/cellobiose family IIC subunit |
Protein accession | YP_003179106 |
Protein GI | 257783889 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1455] Phosphotransferase system cellobiose-specific component IIC |
TIGRFAM ID | [TIGR00410] PTS system, lactose/cellobiose family IIC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAAAG ATAATGGCCA AGCATCGTTT CTAGATAAAT TTGCAGAAGT TTCTGCAAAA GTAGGCAATC AAGTTCACTT GAGAAGCCTA CGCGATGCAT TCGCAACTGT CAGCCCAATT TACATTTTGG CTGGTATTGC AGTACTGATC AACAACGTTC TGTTCCCACT CATCTTTGCT AATGATCCAG TCACCTTGGC TAATTTCAAG GTTTGGGGTG CAGCGATTGC TCAGGGAACT CTGAGTTTCT CGGCAGTTAT TCTTGCAGGC ATCATTGGTT ATTGCCTTGC TCGTAATAAG CGTTTTGAAA ACGCAATTTC TTGCGTTGTT ATTGGCATTG CTGCTCTTAT TATCATGATG CCTCAGAGCA TTGTTGCCAC CGCAGGCGCT GTTCTTCACG CAACTAACGC TGCTAACGCA GCTGCTGCAG CAGTAACACT TCCTGTGGCT GAGGTTGCTA AGCTTCTGCC AAATGACTAT GCAGTCACTG GCGTCGATGT AACAGGCGCT TTTTCCTCCT CCTTCACTGG TACTAACGGT CTGTTTGGTG CAATCATTAT TGGCCTGCTT TCGACCACAA TTTTCATTAA GCTCTCTAGC GTTAAACAGC TGAAGGTCAA CCTTGGTGAG GGTGTTCCAC CAGCGGTTGC AGATTCTTTT AACACCATGA TTCCTATGAT GTTGACCTTG AGTGTTTTTG GTATTGCTTC TGCTCTTCTT GCAGTTTGTG CAGGTACTGA CCTTATGACC ATCATTGCAA CAAGCATTTC CGCCCCACTG AAGGGCCTTA TGAATGCTGG TCCATTTGCT GTCATCGTCA TCTACACCTT TGCAAACCTT CTCTTCTGCC TTGGTATTCA CCAGTCCACC ATCTCTGGTG TTCTGATTGA GCCAATCCTG ACCATGCTTA TTGTTGACAA CATGGCAACC TTCGCAGCTG GTCAGCCAAT CCCTCAGGAT CACTACATGA ACATGCAGAT CATCAACACC TTTGCGCTGA TTGGCGGTTC TGGTTGTACT CTGATGCTGC TTTTTGACAC CTTTATCTTC TCCAAGAACA AGGCTTCTAA AGACGTTGCG GCACTTTCTC TTCTCCCAGG TATCTTTAAC ATCAACGAGC CAGTTATTTA TGGTTACCCA ATTGTCTTTA ACCTTCCTTT GATGATTCCA TTTGTTCTTG TACCAGATTT GTTCATCGGT CTGACCTACC TGCTCACCAA CCTTGGTTGG ATTAGCCCTT GCGTTGCAAT GGTTCCTTGG ACCACTCCAG TCTTTTTGAG TGGTTGGTTA GCAACCGGTG GCGACGTTCG TGCTGTTATT TGGCAGATCG TCGAGGTTCT TCTCGCAATG GCAATTTACC TGCCATTTAT GAAGATCTCC GAGCGCGCAC AGGTCAAGCA GGCTGAGGCT CTTGCAGAGA AAGCTCAGGA TGCAGAGTAG
|
Protein sequence | MAKDNGQASF LDKFAEVSAK VGNQVHLRSL RDAFATVSPI YILAGIAVLI NNVLFPLIFA NDPVTLANFK VWGAAIAQGT LSFSAVILAG IIGYCLARNK RFENAISCVV IGIAALIIMM PQSIVATAGA VLHATNAANA AAAAVTLPVA EVAKLLPNDY AVTGVDVTGA FSSSFTGTNG LFGAIIIGLL STTIFIKLSS VKQLKVNLGE GVPPAVADSF NTMIPMMLTL SVFGIASALL AVCAGTDLMT IIATSISAPL KGLMNAGPFA VIVIYTFANL LFCLGIHQST ISGVLIEPIL TMLIVDNMAT FAAGQPIPQD HYMNMQIINT FALIGGSGCT LMLLFDTFIF SKNKASKDVA ALSLLPGIFN INEPVIYGYP IVFNLPLMIP FVLVPDLFIG LTYLLTNLGW ISPCVAMVPW TTPVFLSGWL ATGGDVRAVI WQIVEVLLAM AIYLPFMKIS ERAQVKQAEA LAEKAQDAE
|
| |