Gene Apar_0917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0917 
Symbol 
ID8413788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1028299 
End bp1030077 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content43% 
IMG OID645022505 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003179937 
Protein GI257784720 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.340529 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGATC ATATGGCTGA CAGTTTTTCC CGTCGTTCGT TCTTTGCTCT GACTGGTATC 
ACCGCTGCAG GCCTTGGCCT TGCTGGCTGT GCTCCATCAA ACTCTCAGGG TGTCGGCTCT
TCCAACACTG GAACTGAGCC TCAAGACGGA TCTCCTGCAA ACACTCCTCT GGATCAGCTT
CCTCTTCCAG AGAAGGGCAA GAAGTACAAC AATCCTAAAT CTCGTGATGA GGTTCAGGAT
GGCGGTACTT TAACTCAACC AATTACTGAA GTTGGACCTC AGTGGAATTA CTACAATCTT
GCTGGTAATA CCACTTACAT GAATATTCTT CATGGTCTTA TGAATCCACG TGATTTGTTT
TATGGCAATA TTGACGGTAG TAAGTTTGAG CCTAATAAAG ATTATATTAA GGATTATAAG
GTAGAGGAGA AGGACGGCAA GCAGGTTGCT ACCCTTACTT TTACTGATCA GGCTAAGTTT
AATGATGGTA CCGATATTGA CTGGACCGCA ATTCAAACAG CATATGTTTG TCTGAGCGGT
ACAAATGAAA AATTTGAGGT TTCAAATACT GACGGCTATG ACAAGATTGA GTCTGTTGCT
CAAGGTGATA CTGCTAAAAC CGCTGTTGTT ACTATGAAAG AAGCAGTCTA TCCAATTGAA
ATGGTTTTGT CTTGGGCACT TCATCCAAAG CTTCAAGATC CAGATTTCTT TAATAATGGC
TACAACAATG AGCCAAATAA TGATCTTGGT GCTGGCCCTT ATATTGTTGA CTCACACGAT
GATTCTTCTG CTACCTTTAA GCCCAATCCT AAGTGGTGGG GAGACGCTCC TAAGCTTGAT
ACGCTTGTGT ACAAGCAGAT GGATACTCAA GCTACTATTA ATGCCTTCAA GAATGAAGAA
GTAGACACTG CTGGTGGCTC TACCTCAGGT TCTGCAGAGC TTCTCTCTAA CTTCTCTGGC
CTTGGTGATA AAGCACAGAT TCGTCGTGGC CTAGGTTTAT CTATTGCTGT TATTGAGGTT
AACTCTACTC GTGGTGTACT CCAGGATGTA GCGGTTCGTA AGGCATTCTG TCAAGTTGTT
GATCCAGCAA CTCTTGTATC CATTGTGTTC CAGGGCGTTA ACTGGAAAGA AGATACTCCT
GGTTCTATGC TTTGGCCGTA TTGGGCTGCG GGCTATGAGA ATAATCTTCC TGATGATGTC
AAGAACCTTA AGTCTGCTGA AGAGCGCACC AAGGCCGCTA AGAAGACCCT TGAGGATGCA
GGCTATTCTC TCAATGGTGA TTTCTATGAG AAGGATGGTC AGCAGGTTAC CTTTGCTTAC
ACGATGTTTG GCGATGGTAC TAATGTAAAG AACCGTGCTG CAGCAATTCA GAAGATGTGT
AAAGATGCTG GTATTAATAT GACGCTTGAC TCTCACCCAT CTTCAGAGTT CTCTAAGGTT
CTTACCTCTG GTTCTTGGGA TGTCTGCCTC TTTGGTTGGA ATGGAACCTC AACTTCTTAC
AACAATGGCG TTCAGCTTTA TGGTTCCACC TCTGCATCCA ACTTTGGTAA GCAGGGTACT
GCAGACACCG ATGCAATGTT TGCAAAGGTT GTCTCTACCA AAGACTTTGA TGAGCGTATA
AAACTTATGA ATCAGGCAGA GAAAAAGATG ATGGAAACTT ACTGTTATCT TCCAGTTTAC
ACAGGCTCTG ATTGCTATGT TGTTAAGAAG GGTCTTGCCA ACTTTGGTCC TAAGCTTTTT
GAGGATGTTC CATATACCAT TGTTGGTTGG CAGAAGTAA
 
Protein sequence
MGDHMADSFS RRSFFALTGI TAAGLGLAGC APSNSQGVGS SNTGTEPQDG SPANTPLDQL 
PLPEKGKKYN NPKSRDEVQD GGTLTQPITE VGPQWNYYNL AGNTTYMNIL HGLMNPRDLF
YGNIDGSKFE PNKDYIKDYK VEEKDGKQVA TLTFTDQAKF NDGTDIDWTA IQTAYVCLSG
TNEKFEVSNT DGYDKIESVA QGDTAKTAVV TMKEAVYPIE MVLSWALHPK LQDPDFFNNG
YNNEPNNDLG AGPYIVDSHD DSSATFKPNP KWWGDAPKLD TLVYKQMDTQ ATINAFKNEE
VDTAGGSTSG SAELLSNFSG LGDKAQIRRG LGLSIAVIEV NSTRGVLQDV AVRKAFCQVV
DPATLVSIVF QGVNWKEDTP GSMLWPYWAA GYENNLPDDV KNLKSAEERT KAAKKTLEDA
GYSLNGDFYE KDGQQVTFAY TMFGDGTNVK NRAAAIQKMC KDAGINMTLD SHPSSEFSKV
LTSGSWDVCL FGWNGTSTSY NNGVQLYGST SASNFGKQGT ADTDAMFAKV VSTKDFDERI
KLMNQAEKKM METYCYLPVY TGSDCYVVKK GLANFGPKLF EDVPYTIVGW QK