Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_1335 |
Symbol | |
ID | 8414220 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | + |
Start bp | 1501196 |
End bp | 1502689 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 645022932 |
Product | PTS system, trehalose-specific IIBC subunit |
Protein accession | YP_003180350 |
Protein GI | 257785133 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific |
TIGRFAM ID | [TIGR00826] PTS system, glucose-like IIB component [TIGR01992] PTS system, trehalose-specific IIBC component |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.135548 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00345051 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAAAGT TCGACCATGA TGCTCGTGAA CTCCTCGAGC TCGTTGGCGG CAAAGACAAC ATTGCTGCGG CTTCACACTG TATGACACGC ATGCGATTTG CCTTGAAAGA TCCTTCAAAA GCGGATGTGG CTGCCATTGA AAAGCTTGCA TCCGTTAAAG GAAGTTTCAC GCAAGCCGGC CAGTTTCAAG TTATTATCGG CAATGACGTC GCCGACTTTT ACGACACCTT TGTCGGTATT TCTGGCGTAA GTGAGGCTTC AAAACAAGAC GTCAAATCAG CCGCCCTTCA AAACACAAAC ATTTTGCAGC GAGCCATGGG AGCCATTGCC GAGGTCTTTG CTCCCCTGAT TCCCGCAATC ATTACCGGCG GCCTTATTCT TGGCTTCCGA AACGTGCTTG GCGAGATGCC CTACTTTGGA CCTGAAGGCA ACCAGACCCT CGCTTCGCTT TCCGTCTTCT GGACGGGCGT ATATAACTTC TTGTGGCTTA TTGGCGAGGC AGTCTTCCAT GCCGGCATCC CTGTAGGCAT TTGCTGGTCA ATCACAAAGA AAATGGGCGG GACCCCCATG CTCGGCATTG TGCTTGGCCT CACCCTCGTT TCCGGCCAGC TCATGAATGC CTATGCTGTT TCAGGAGCAA CCGCCGCTGA TTGGGCTACC CATACATGGA ATTTTGACTT CGCGCAGGTC CGCATGATTG GATATCAGGC TCAAGTTATC CCCGCTATTC TCGCTGCGAT TACCTTCAAC TACCTCGAGC GATTCTTTAA GAAAATCACG CCATCCGTCA TTCAGATGAT CGTAGTGCCC TTCTGCTCAC TTTTGCTTGC CGTTATGGCT GCTCACTTCG TGCTTGGCCC CATTGGCTGG ACCATCGGAT CTTGGATCGG AAACATTGTT CTCGCAGGCA TCACAAGTCC GTTTGCATGG CTCTTTGGCC TTATCTTTGG CGCCGTATAT GCTCCCCTTG TCATCACTGG CTTGCACCAC ATGTCCAATG CCGTTGACAT GCAGCTTATT GCAAGTTCCA ATAACTATGG CACTCCACTG TGGCCCATGA TCGCACTTTC CAATATTGCT CAGGGTTCAT CGGTTCTCGC CATGTCTGTC CTTCAGAAGC ATGACGAAAA TGCTCAGCAG GTAAACATCC CTTCCATCAT CTCTTGCTAC CTTGGCGTTA CCGAACCCGC TATGTTTGGC GTCAACCTCA AATACGGTTT CCCCTTCGTA TGCGCTATGA TTGGCAGTTC CATTGCAGGA GGTGTCTGTA CGGCCTTTGG TGTCAACGCT CTCTCCATTG GCGTTGGTGG ACTCCCCGGA ATTCTTTCAA TTCGCCCCGA GTTTATCGGC ATCTTCGCCG TTTGCATGGC CATCGCTGTA GTGGTTCCAT TTGTTCTTAC ATTCACCATC GGCAAACGCC AAGGTATTGA TAAAGGTATC GATACCAATG TCGTCACGCT TGATGGCGAG AAAAACAGTG CCCTTTCGGC CTAA
|
Protein sequence | MAKFDHDARE LLELVGGKDN IAAASHCMTR MRFALKDPSK ADVAAIEKLA SVKGSFTQAG QFQVIIGNDV ADFYDTFVGI SGVSEASKQD VKSAALQNTN ILQRAMGAIA EVFAPLIPAI ITGGLILGFR NVLGEMPYFG PEGNQTLASL SVFWTGVYNF LWLIGEAVFH AGIPVGICWS ITKKMGGTPM LGIVLGLTLV SGQLMNAYAV SGATAADWAT HTWNFDFAQV RMIGYQAQVI PAILAAITFN YLERFFKKIT PSVIQMIVVP FCSLLLAVMA AHFVLGPIGW TIGSWIGNIV LAGITSPFAW LFGLIFGAVY APLVITGLHH MSNAVDMQLI ASSNNYGTPL WPMIALSNIA QGSSVLAMSV LQKHDENAQQ VNIPSIISCY LGVTEPAMFG VNLKYGFPFV CAMIGSSIAG GVCTAFGVNA LSIGVGGLPG ILSIRPEFIG IFAVCMAIAV VVPFVLTFTI GKRQGIDKGI DTNVVTLDGE KNSALSA
|
| |