Gene Apar_0604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0604 
Symbol 
ID8413461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp673442 
End bp674752 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content44% 
IMG OID645022179 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003179625 
Protein GI257784408 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.293813 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGTA TCAATCGTAG GCAGTTTGTC ACCTTAAGTG CATCAGCTCT TTGCTCACTG 
GGTCTTGTTG CTTGTGGAGG AAATAACAAC CAAAAGCAGG GAGGATCTGA TTCAGCAAAA
GGTTCAGTTT ATTTTCTAAA CTTCAAGCCT GAGGCTGATC AGCAGTGGAA AGATCTTGCA
GCTAAGTATA CTGAGGAAAC TAAGGTTCCA GTAAAGGTAG TAACTGCTGC TTCTGATACC
TATGCACAGA CATTCCAGTC TGAGATTAAC AAGGACGCTT CTGTAGCACC AACACTGTTC
CAGACAAATG GACCTTCCGG TCTTATTGGT GTAAAGGATT ACTGTATTGA TCTTAAGGGC
GCCAAGATTC TAGATGAGCT CACAAATGAT GCTCTTAAGC TTCAGGAGGA TGGCGTTGTC
TATGGCGTTG ACTATGTAGA GGAAGACTAC GGCATTATCT ACAACAAGAA GTTGCTTGAG
AAGGCAGGCT ATAAGGGAGA TGACATCAAG GACTTTGCTT CTCTGAAGAA GGTTGTTGAG
GATATTCAGT CTCGCAAAGC AGAACTTGGT GTAAAGGGAG CATTTACCAG TGCTGGTCTT
GATTCAAGTT CCGACTGGCG CTTTACCACT CACTTGGCTA ACCTCCCTCT TTACTATGAG
TTCAAAGACA CTGGAAAGCC AGATGCTAAA GAGATTAAAG GAACCTATTT GCCTAACTAC
AAGCAAATTT TTGATCTCTA TATTAACAAT GCTACTTGTA GTCCTACTGA GCTTGCTGGT
AAAACGGGCG ATGACGCTGT TGCAGAGTTT GTTAACGGCG AGGCTGTCTT CTATCAGAAT
GGTACTTGGG CATACAATGA CATCAAGGGA CTTGGTGACG ATGCTCTTGG CATGATTCCA
ATTTATATTG GTGTTAAGGG TGAAGAGAAG CAGGGTATGT GCTCTGGCGG TGAGAATTAT
TGGTGCGTTA GCTCAAAGGC GGACGACGCT TCCCAGAAGG CAACACTTGA TTTTATGTAC
TGGTGTGTAA CTTCTGATAC TGCTACTACA GCAATTGCAG AAGATATGGG CCTTACTATT
CCATTCAAGA ATGCAAAAGC AACTAAGAAT GCTCTTGCAA ACATTGCAGC TGAGTACGCA
AAGAAGGGCA ATGAGTCCGT AGCTTGGGAC TTCATTTACA TTCCTTCACA GGAGTGGAAG
AACAACGTTT CCAGCGCACT TAAGGGTTAT GCAGCAGGAA CAGAGGACTG GGATGCCGTA
AAGTCTGCAT TCGTTGATGG TTGGAAGACT GAAAAAGAGG CCAACGCATA G
 
Protein sequence
MSGINRRQFV TLSASALCSL GLVACGGNNN QKQGGSDSAK GSVYFLNFKP EADQQWKDLA 
AKYTEETKVP VKVVTAASDT YAQTFQSEIN KDASVAPTLF QTNGPSGLIG VKDYCIDLKG
AKILDELTND ALKLQEDGVV YGVDYVEEDY GIIYNKKLLE KAGYKGDDIK DFASLKKVVE
DIQSRKAELG VKGAFTSAGL DSSSDWRFTT HLANLPLYYE FKDTGKPDAK EIKGTYLPNY
KQIFDLYINN ATCSPTELAG KTGDDAVAEF VNGEAVFYQN GTWAYNDIKG LGDDALGMIP
IYIGVKGEEK QGMCSGGENY WCVSSKADDA SQKATLDFMY WCVTSDTATT AIAEDMGLTI
PFKNAKATKN ALANIAAEYA KKGNESVAWD FIYIPSQEWK NNVSSALKGY AAGTEDWDAV
KSAFVDGWKT EKEANA