Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0498 |
Symbol | |
ID | 8413347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 568367 |
End bp | 570547 |
Gene Length | 2181 bp |
Protein Length | 726 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 645022066 |
Product | YhgE/Pip C-terminal domain protein |
Protein accession | YP_003179520 |
Protein GI | 257784303 |
COG category | [S] Function unknown |
COG ID | [COG1511] Predicted membrane protein |
TIGRFAM ID | [TIGR03061] YhgE/Pip N-terminal domain [TIGR03062] YhgE/Pip C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.325694 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAA TTTGGACTAT TTTTCTTGCT GATTTTAAGG CTGTTGCACA CAACACCATT GCTCTTGTTG TATGCGTGGG GCTAGTAATA CTGCCATCTT TATATGCATG GCTTAATATT GAGGGTAGTT GGGATCCATA CGGACACACC AATGAAATAA AAATTGCTGT TGCTAACAAT GACGCAGGAT ATCAAAGCGA TCTTATTCCT GTACGCGTCA ACATTGGTGA ACGTATGGTA TCTAAACTTA GTGAAAGCAC CACTATTCAC TATGTTATTA CTTCAAAAGA AGACGCTGAA CACGGTGTGC AGTCCGGCGC ATATTACGCA GCGCTCCTCA TTCCAGAGAA TTTTTCGCAA GAGTTACTTG CTTCACTTTC TGGCCAAGAA AATGATGCCA GTATTACCTA TCTAAGCAAT CAAAAACTGG GAGCAATCGC ACCTATTGTT ACCGATAAAG CTGCAGAATC TGCACGTACA GAGATTGAAC GATCTTTTGC AGAGTCAGTT ACTGAAGTAG GCGCAGGTCT TATTGGTGAA CTCACTAATA ACGCGGATGA TTCTAATCTT TCTTCTGTAG TAACCAAGCT TGATGAGACT CTTGTTCATG GATCTAATGA GCTCAGAATT GCTGCTAATC ATATTGGTGT ATATCAGGGA CTTGTTGATT CAGCTCAACA AATTATTGAG AGCACATCTG ATTTATCCTC AAGCTCAAGT AATACACTCT CCTCAGCACA AGAAACACTT CTACAAGCAG CTGATGGAAC GCAACGCATG AATGACGCAG CAACAAGCGC TACTCAAGCA GTAGATAGCG CACTTGCCCA AACTTCTGCA AGCTTTGATA CGCTTAATAC TTCTATTAAT GATGCGCTTG ATAATGCTTC GACAACTGCA GGCCATGCCG CTTCTAATAT TCAAGATGCT GCAAGTAGAA TCGAGCAAGT AAAGAGTGGT TATGAATCGC TTCTTACCTC TCTTCAGTCA GTTAAAGCAG CCCTTCCAAC AAACCTACAA GGACTTCTTG ATGCTCCAAT TGCCAGGCTT CAGGGGAGCA TTGCCACCCT CCAAAATCTT CAAGATGAGC TAACAAATGC TGCCCAAAGT ATTAATGATG GTATATCCGT CTCTGCAACA CAAAGAGCTC AAATCAATGC GCTCCTATCA CAAGCATCTT CTGACTTGAG TACTACACGT TCAGACCTCC AAACAGCTCT TAAAACAAGC CTCACATCAA TCTCTACATC TCTAACTGAT GTTTCTAATA CCACTTCTAC TCTTTCTGAA TCTTTAACTC AAACACTTCA AGACATTCAA AACATCGCAA ATTCAACAGC AAACAACCTC TCAGCATTCT CCAGCACTCT TTCTAAGTCT AAAAATCAGC TTACTGATCT TGCCGACAAG CTTGATTTAA TACACGCTCA ACTCTCTGAG GCACTTTCCA GCAAAGATAT AAAAACTGTC AGGCAGATTC TATCTGCAAG TTCATCTGAT CTTGCAGAAT TTATTGCTGC GCCTACAACA CTCAAGCGAA CTGCCGTATA CCCTATTGAA AACAACGGAT CGGCCATGGC GGGTTTCTAC ACCACACTTT CTTTGTGGGT AGGTGGCATC ATTTTAGTTG CCATGCTTAA CACAGGGATT ACTGAGAGTC TCCTTAAGAA AACTGACGCA AAACCACGTC ATGCATATCT TGGAAGATTG CTTACCTTTA GCGTACTAGG CTTCTTACAG TCCACGTTAG TTGGACTGGG CGATCTTTTC TATCTGCAGA TTCAATGTGT TGACCCTGTA CGTTTTATGA TAAGTGTTTG GTTTACCTCA CTTGTATTTG TCAACATCAT GTACGCATTT ACCTATGCGT TTGGTGATAT TGGAAAAGCA ATTTGTGTCT TCTTACTTGT TATCCAAGTT GCAGGTGCAG GCGGAAGCTT CCCCGTCCAA ATGCTTCCTG AATCTTTCCA AGTTATAAAT CCACTTCTTC CCTTTGTTCA TGCCATTGCA GCAATGCACG AAAACATAGC AGGATACTAT AGCAACGTAT GGATTTCTGA ACTTGGTGCA TTATCTATCT TCCTTGCAGC CTCCCTTATG CTTGGCCTTG TTCTTAGAAA GCCAACTGCA AAACTTAATG CCTGGATTAT AGAAAAACTT GAGGATACTA AGATTATGTA A
|
Protein sequence | MKKIWTIFLA DFKAVAHNTI ALVVCVGLVI LPSLYAWLNI EGSWDPYGHT NEIKIAVANN DAGYQSDLIP VRVNIGERMV SKLSESTTIH YVITSKEDAE HGVQSGAYYA ALLIPENFSQ ELLASLSGQE NDASITYLSN QKLGAIAPIV TDKAAESART EIERSFAESV TEVGAGLIGE LTNNADDSNL SSVVTKLDET LVHGSNELRI AANHIGVYQG LVDSAQQIIE STSDLSSSSS NTLSSAQETL LQAADGTQRM NDAATSATQA VDSALAQTSA SFDTLNTSIN DALDNASTTA GHAASNIQDA ASRIEQVKSG YESLLTSLQS VKAALPTNLQ GLLDAPIARL QGSIATLQNL QDELTNAAQS INDGISVSAT QRAQINALLS QASSDLSTTR SDLQTALKTS LTSISTSLTD VSNTTSTLSE SLTQTLQDIQ NIANSTANNL SAFSSTLSKS KNQLTDLADK LDLIHAQLSE ALSSKDIKTV RQILSASSSD LAEFIAAPTT LKRTAVYPIE NNGSAMAGFY TTLSLWVGGI ILVAMLNTGI TESLLKKTDA KPRHAYLGRL LTFSVLGFLQ STLVGLGDLF YLQIQCVDPV RFMISVWFTS LVFVNIMYAF TYAFGDIGKA ICVFLLVIQV AGAGGSFPVQ MLPESFQVIN PLLPFVHAIA AMHENIAGYY SNVWISELGA LSIFLAASLM LGLVLRKPTA KLNAWIIEKL EDTKIM
|
| |