Gene Apar_0498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0498 
Symbol 
ID8413347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp568367 
End bp570547 
Gene Length2181 bp 
Protein Length726 aa 
Translation table11 
GC content41% 
IMG OID645022066 
ProductYhgE/Pip C-terminal domain protein 
Protein accessionYP_003179520 
Protein GI257784303 
COG category[S] Function unknown 
COG ID[COG1511] Predicted membrane protein 
TIGRFAM ID[TIGR03061] YhgE/Pip N-terminal domain
[TIGR03062] YhgE/Pip C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.325694 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA TTTGGACTAT TTTTCTTGCT GATTTTAAGG CTGTTGCACA CAACACCATT 
GCTCTTGTTG TATGCGTGGG GCTAGTAATA CTGCCATCTT TATATGCATG GCTTAATATT
GAGGGTAGTT GGGATCCATA CGGACACACC AATGAAATAA AAATTGCTGT TGCTAACAAT
GACGCAGGAT ATCAAAGCGA TCTTATTCCT GTACGCGTCA ACATTGGTGA ACGTATGGTA
TCTAAACTTA GTGAAAGCAC CACTATTCAC TATGTTATTA CTTCAAAAGA AGACGCTGAA
CACGGTGTGC AGTCCGGCGC ATATTACGCA GCGCTCCTCA TTCCAGAGAA TTTTTCGCAA
GAGTTACTTG CTTCACTTTC TGGCCAAGAA AATGATGCCA GTATTACCTA TCTAAGCAAT
CAAAAACTGG GAGCAATCGC ACCTATTGTT ACCGATAAAG CTGCAGAATC TGCACGTACA
GAGATTGAAC GATCTTTTGC AGAGTCAGTT ACTGAAGTAG GCGCAGGTCT TATTGGTGAA
CTCACTAATA ACGCGGATGA TTCTAATCTT TCTTCTGTAG TAACCAAGCT TGATGAGACT
CTTGTTCATG GATCTAATGA GCTCAGAATT GCTGCTAATC ATATTGGTGT ATATCAGGGA
CTTGTTGATT CAGCTCAACA AATTATTGAG AGCACATCTG ATTTATCCTC AAGCTCAAGT
AATACACTCT CCTCAGCACA AGAAACACTT CTACAAGCAG CTGATGGAAC GCAACGCATG
AATGACGCAG CAACAAGCGC TACTCAAGCA GTAGATAGCG CACTTGCCCA AACTTCTGCA
AGCTTTGATA CGCTTAATAC TTCTATTAAT GATGCGCTTG ATAATGCTTC GACAACTGCA
GGCCATGCCG CTTCTAATAT TCAAGATGCT GCAAGTAGAA TCGAGCAAGT AAAGAGTGGT
TATGAATCGC TTCTTACCTC TCTTCAGTCA GTTAAAGCAG CCCTTCCAAC AAACCTACAA
GGACTTCTTG ATGCTCCAAT TGCCAGGCTT CAGGGGAGCA TTGCCACCCT CCAAAATCTT
CAAGATGAGC TAACAAATGC TGCCCAAAGT ATTAATGATG GTATATCCGT CTCTGCAACA
CAAAGAGCTC AAATCAATGC GCTCCTATCA CAAGCATCTT CTGACTTGAG TACTACACGT
TCAGACCTCC AAACAGCTCT TAAAACAAGC CTCACATCAA TCTCTACATC TCTAACTGAT
GTTTCTAATA CCACTTCTAC TCTTTCTGAA TCTTTAACTC AAACACTTCA AGACATTCAA
AACATCGCAA ATTCAACAGC AAACAACCTC TCAGCATTCT CCAGCACTCT TTCTAAGTCT
AAAAATCAGC TTACTGATCT TGCCGACAAG CTTGATTTAA TACACGCTCA ACTCTCTGAG
GCACTTTCCA GCAAAGATAT AAAAACTGTC AGGCAGATTC TATCTGCAAG TTCATCTGAT
CTTGCAGAAT TTATTGCTGC GCCTACAACA CTCAAGCGAA CTGCCGTATA CCCTATTGAA
AACAACGGAT CGGCCATGGC GGGTTTCTAC ACCACACTTT CTTTGTGGGT AGGTGGCATC
ATTTTAGTTG CCATGCTTAA CACAGGGATT ACTGAGAGTC TCCTTAAGAA AACTGACGCA
AAACCACGTC ATGCATATCT TGGAAGATTG CTTACCTTTA GCGTACTAGG CTTCTTACAG
TCCACGTTAG TTGGACTGGG CGATCTTTTC TATCTGCAGA TTCAATGTGT TGACCCTGTA
CGTTTTATGA TAAGTGTTTG GTTTACCTCA CTTGTATTTG TCAACATCAT GTACGCATTT
ACCTATGCGT TTGGTGATAT TGGAAAAGCA ATTTGTGTCT TCTTACTTGT TATCCAAGTT
GCAGGTGCAG GCGGAAGCTT CCCCGTCCAA ATGCTTCCTG AATCTTTCCA AGTTATAAAT
CCACTTCTTC CCTTTGTTCA TGCCATTGCA GCAATGCACG AAAACATAGC AGGATACTAT
AGCAACGTAT GGATTTCTGA ACTTGGTGCA TTATCTATCT TCCTTGCAGC CTCCCTTATG
CTTGGCCTTG TTCTTAGAAA GCCAACTGCA AAACTTAATG CCTGGATTAT AGAAAAACTT
GAGGATACTA AGATTATGTA A
 
Protein sequence
MKKIWTIFLA DFKAVAHNTI ALVVCVGLVI LPSLYAWLNI EGSWDPYGHT NEIKIAVANN 
DAGYQSDLIP VRVNIGERMV SKLSESTTIH YVITSKEDAE HGVQSGAYYA ALLIPENFSQ
ELLASLSGQE NDASITYLSN QKLGAIAPIV TDKAAESART EIERSFAESV TEVGAGLIGE
LTNNADDSNL SSVVTKLDET LVHGSNELRI AANHIGVYQG LVDSAQQIIE STSDLSSSSS
NTLSSAQETL LQAADGTQRM NDAATSATQA VDSALAQTSA SFDTLNTSIN DALDNASTTA
GHAASNIQDA ASRIEQVKSG YESLLTSLQS VKAALPTNLQ GLLDAPIARL QGSIATLQNL
QDELTNAAQS INDGISVSAT QRAQINALLS QASSDLSTTR SDLQTALKTS LTSISTSLTD
VSNTTSTLSE SLTQTLQDIQ NIANSTANNL SAFSSTLSKS KNQLTDLADK LDLIHAQLSE
ALSSKDIKTV RQILSASSSD LAEFIAAPTT LKRTAVYPIE NNGSAMAGFY TTLSLWVGGI
ILVAMLNTGI TESLLKKTDA KPRHAYLGRL LTFSVLGFLQ STLVGLGDLF YLQIQCVDPV
RFMISVWFTS LVFVNIMYAF TYAFGDIGKA ICVFLLVIQV AGAGGSFPVQ MLPESFQVIN
PLLPFVHAIA AMHENIAGYY SNVWISELGA LSIFLAASLM LGLVLRKPTA KLNAWIIEKL
EDTKIM