Gene Apar_0935 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0935 
Symbol 
ID8413806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1049159 
End bp1050760 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content49% 
IMG OID645022523 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003179955 
Protein GI257784738 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGT TTATTAACCG TCCACTTTCG CGTCGTAGCT TCCTTGGTGG AGCAACTGTA 
GCAGCTGGCC TTGGTCTCAC CGCTTGTGGT GGTGGACAGC AGAAGGCACC TGAAGCTCCA
TCTGGAAAGG AAGGCGGCAG CACAATTACT GCTGGTACCG CTTATTCAAC CCAGAATTAT
GATCCATCTA CCACTTCATC AGCACTTGCT CTTGGCGTTA ACTGGCAAGT AGTTGAGGGC
CTTTATGGTC TTAACTTCCA CGACTACTCT ACGTTCAATG AGCTTGCTAC TGCTGATCCA
AAGAAGGTTG ACGATAACAC CTTTGAGATT ACCATCCGTG ACGGCGCAAA GTTCTCTGAT
GGCAACGAGG TAAAGGCAGA TGACGTTGTT GAGTCCTTCA ACCGTTCTTC TGCAAAGGGC
AGCGTTTACG TCCCAATGCT TGCTCCAATT GCTTCCGTAG AGAAGAAGGA TGACAAGACT
GTTACCGTTA AGACTACTAT TGCCAACTTC TCCCTCCTCA AGGAGCGTCT GTCCATCGTT
CGCGTTGTTC CAGCAAGCAG CTCCCAGGAT GAGATGAAGA AGCAGCCAAT TGGCTCTGGT
CCATGGAAGT TTGAGTCCAT CTCTGACAAC ACTCTTGACC TTGTTCCAAA CACTAACTAC
AACGGTGCAA CTCCTGCTAA GGACGAGAAG CTCCATTATG ATGTTCTTAA GGATCCAACT
GCTCGTCTGA CCGCACAGCA GGACGGAACC ACCCTTGCTA TGGAAATGGT TCCTGCAGAC
GCTGTAGACC AGCTCAAGGG TGCAGGCTGC ACTGTCGACG CTGTTCAGGG CTTTGGTGTT
CGTTTTGCAA TGTTTAACCT TGGCAAGGCT CCTTGGGACA ACGTTAAGGT TCGCCAGGCA
TGCCTCTATG CACTTGATAC AGACAAGATG CTCTCTAATG CATTCTCTGG TCAGGCTGAG
GTTGCTACCA GCTATCTTCC TTCTTCTTTT GCTAACTATC ACAAGGCATC CACTGTTTAT
GCACATGATG TTGATAAGGC TAAGAAGCTT ATTTCTGAGT CTGGTATCAC TCCAGGTGAT
GTTGTTCTTC GTACCACTGA TAACGAGCAG GTTAAGTCCA TGGCAACTCA GGTCAAGAAC
GACCTCGACG CACTTGGATT TAACGTAAAC ATCGTTTCCG ATACTTCTCC AGCAACGTAT
GCTGCAATCG ATGCAGCCGA CGGCTCTTGG GACATCTTGC TTGCCCCTGG TGATCCTTCC
TGCTTCGGCG CAGACGTTGA TTTGCTGCTC AACTGGTGGT ATGGCGATAA CGTTTGGATG
ACCAAGCGTT GCCCATGGAA AGAGTCTGCT GAGTGGAACA AGCTTCACGA GCTCATGAAC
AAGGCTCTTG GTCAGTCTGG TGCTGATCAG CAGTCCACTT GGAATGAGTG CTTTGATATC
ATCGCTGAGA ACGTGCCTCT GTACCCAGTT CTCTTTGCTA AGACCTTCAC CGCTTCTTGG
AGCGAGAAGC CAAATGCAAA TGGTGTTGCA CTTAAGGGCT TTAAGGGCAT TGGTACTACA
GGTATGTCCT TCATTGATGT CAACACAGTT ACTGCTAGCT AA
 
Protein sequence
MSEFINRPLS RRSFLGGATV AAGLGLTACG GGQQKAPEAP SGKEGGSTIT AGTAYSTQNY 
DPSTTSSALA LGVNWQVVEG LYGLNFHDYS TFNELATADP KKVDDNTFEI TIRDGAKFSD
GNEVKADDVV ESFNRSSAKG SVYVPMLAPI ASVEKKDDKT VTVKTTIANF SLLKERLSIV
RVVPASSSQD EMKKQPIGSG PWKFESISDN TLDLVPNTNY NGATPAKDEK LHYDVLKDPT
ARLTAQQDGT TLAMEMVPAD AVDQLKGAGC TVDAVQGFGV RFAMFNLGKA PWDNVKVRQA
CLYALDTDKM LSNAFSGQAE VATSYLPSSF ANYHKASTVY AHDVDKAKKL ISESGITPGD
VVLRTTDNEQ VKSMATQVKN DLDALGFNVN IVSDTSPATY AAIDAADGSW DILLAPGDPS
CFGADVDLLL NWWYGDNVWM TKRCPWKESA EWNKLHELMN KALGQSGADQ QSTWNECFDI
IAENVPLYPV LFAKTFTASW SEKPNANGVA LKGFKGIGTT GMSFIDVNTV TAS