Gene Apar_0823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0823 
Symbol 
ID8413688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp907438 
End bp908739 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content47% 
IMG OID645022405 
ProductABC-type sugar transport system periplasmic component-like protein 
Protein accessionYP_003179843 
Protein GI257784626 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.596738 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000464353 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGAAAGG TCACTACTAT GGATGTGTCT CGTCGCAGTT TTCTAAAGTT CTGTGGTCTG 
GGCGTTCTTA GCGTTGGTGG ATCTAGCGTT CTTGCAGCTT GCGCACCTAG CGAGAAAAAA
GATGATACCG CTAAAAACAC AAAGGTAGAG AACAAGGATA AGCCTGTTGT CTTCTTTAAC
CGTCAGCCTT CCAACAGCTC AACTGGTGAA CTTGATACCA ACACCCTTAA CTTCAACAAG
GACACCTATT ACGTTGGCTT TGATGCAGTT CAGGGTGCAG AGCTCCAGGG CCAGATGGTT
CTTGATTACA TCAAGGCTCA TGCAGCTGAG CTTGATCGCA ACAAGGACGG CATCATTGGT
TACGTTCTTG CAATTGGTGA CATTGGTCAC AATGACTCTA TCGCTCGTAC TCGTGGCGTT
CGTAAGGCAC TTGGTACTGG CATTGAGAAG GATGGCAAGA TTATTTCTGA TCCAGTAGGT
ACTAATACTG ATGGCTCCGC TTCTGTTGTC CAGGACGGTA AGCTTGAGGT TGGTGGCAAG
TCCTACACCA TTCGTGAGCT TGCTTCTCAG GAGATGAAGA ATACCGCTGG TGCAACATGG
GATGCTGCAA CCGCTGGTAA CGCAATTGCT GCTTGGTCTT CTTCCTTCGG TGATCAGATT
GATGTTGTTG TTTCCAATAA CGACGGTATG GGTATGTCTA TCTTCAACGC ATGGTCCAAG
GCTCAGAAGG TCCCAACCTT TGGTTACGAT GCTAACTCTG ACGCTGTTGC AGCTATCGCT
GATGGCTATG CAGGCACCAT TTCTCAGCAC CCAGACGTTC AGGCTTACCT GACACTTCGT
CTGCTCCGTA ACGCTCTTGA CGGCGCTGAT ATCAATACTG GTATTGAGTC TGCAGACGAT
GCTGGCAATA AGATTGACTC CAAGGATTAC AAGTATGTTG CAGAGCAGCG TTCTTACTAT
GCTCTGAACC TTGCAGTTAC TGCTGAGAAC TACAAGGACA ATCTTGATGC AACTACTACT
TACAAGGATG CTTCTGCTCA GCTTAGCGCA GACAAGCACC CAGAGAAGAA GGTTTGGCTC
AACACCTATA ACTCTGGTGA CAACTTCCTT GGTTCAACCT ATGTTCCACT GCTCAAGAAG
TATGCTCCAC TTCTCAACCT CAACGTTGAG TTTATCGCAG GCGATGGCCA GACTGAGTCC
AACATTACCA ACCGTCTTGG CAACCCTGAT GAGTACGATG CATTTGCATT CAACATGGTT
AAGACCGACA ACGGTTCTTC CTATACCCAG CTGCTTAAGT AA
 
Protein sequence
MRKVTTMDVS RRSFLKFCGL GVLSVGGSSV LAACAPSEKK DDTAKNTKVE NKDKPVVFFN 
RQPSNSSTGE LDTNTLNFNK DTYYVGFDAV QGAELQGQMV LDYIKAHAAE LDRNKDGIIG
YVLAIGDIGH NDSIARTRGV RKALGTGIEK DGKIISDPVG TNTDGSASVV QDGKLEVGGK
SYTIRELASQ EMKNTAGATW DAATAGNAIA AWSSSFGDQI DVVVSNNDGM GMSIFNAWSK
AQKVPTFGYD ANSDAVAAIA DGYAGTISQH PDVQAYLTLR LLRNALDGAD INTGIESADD
AGNKIDSKDY KYVAEQRSYY ALNLAVTAEN YKDNLDATTT YKDASAQLSA DKHPEKKVWL
NTYNSGDNFL GSTYVPLLKK YAPLLNLNVE FIAGDGQTES NITNRLGNPD EYDAFAFNMV
KTDNGSSYTQ LLK