Gene Apar_0448 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0448 
Symbol 
ID8413297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp516230 
End bp517126 
Gene Length897 bp 
Protein Length298 aa 
Translation table11 
GC content46% 
IMG OID645022016 
Productextracellular solute-binding protein family 3 
Protein accessionYP_003179470 
Protein GI257784253 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.917749 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTTCT TTGAAAATAC TTCCACTTCT CTTTCTCGTC GTGCTTTCCT TGGTGCAACC 
GCACTTTCTG CTGCAACTCT TTTGGCAGCT TGCAAGAAAA AGGATTCCGA TGGAGCTCAG
GGCGGATCTT CTGATACAGA CTCAAAGTTT AGAACTCTTG ATGATATCAA GAAGGCAGGC
ACAGTAAATA TCGGCGTCTT CAGCGATAAG GCTCCTTTTG GTTACGTTGA TGCAAACGGT
AATTACGCTG GTTACGACAT TGTTTTTGCT GAGCGTCTTG CAAAGGATAT GGGCGTAAAG
ATTAACTACA TTGCCACTGA CGGACAAAAT CGCGTGCCTT TCTTGCAGTC AAATAAGGCA
GATATCATGC TGGCAAACTT TACGGTAACT GATGACCGTA AAGAGAAGGT TGATTTCTCC
CTGCCATACA TGAAGATCAA GCTGGGTGTT GTTTCTCCAG CTTCTGCGCC AATCACAGAC
GTTTCTCAGC TTGATGGCAA GAAGCTTATT GTTTCTAAGG GAACCACAGC TGAGGTTTGG
TTTGCTAAGA ACGCACCTAA GGTTGAGCTG GTGAAATTTG ATAGCTACGC TGATGCGTAT
AACGCGCTGC TCGACGGCCG CGGAGACGCG TTCTCAACTG ACAATACCGA GGTACTTGCA
TGGGTTAAGT CTAACCCTGG CTTTATTGTT GGCATCGATG ATCTGGGCGA CTCCGATACT
ATTGCTGCAG CAGTTCACAA AGGCAATTCT TCGCTATTGG AGTTTATCAA CAATGAGATT
ACCGGTCCTC TTGCAGAGGA GAACTTCTTC CACAAGGCCT ATGAAGAGAC GCTTGCTCCA
ATTTATGGCG ACGAGGTAGA TCCAAATACC ATGGTTGTCG AGGGCGGCAA AATTTAA
 
Protein sequence
MNFFENTSTS LSRRAFLGAT ALSAATLLAA CKKKDSDGAQ GGSSDTDSKF RTLDDIKKAG 
TVNIGVFSDK APFGYVDANG NYAGYDIVFA ERLAKDMGVK INYIATDGQN RVPFLQSNKA
DIMLANFTVT DDRKEKVDFS LPYMKIKLGV VSPASAPITD VSQLDGKKLI VSKGTTAEVW
FAKNAPKVEL VKFDSYADAY NALLDGRGDA FSTDNTEVLA WVKSNPGFIV GIDDLGDSDT
IAAAVHKGNS SLLEFINNEI TGPLAEENFF HKAYEETLAP IYGDEVDPNT MVVEGGKI