Gene Apar_0093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0093 
Symbol 
ID8412936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp101349 
End bp102767 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content32% 
IMG OID645021660 
Producthypothetical protein 
Protein accessionYP_003179120 
Protein GI257783903 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.891852 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGATT TACTTTCACT ACAAAGCAAA ATACAAGATG TATACGTAAG ACAGATTTAT 
TGTGAGGCGG TTGCTAGCTA TAATGCTCAA GCTTATAGAT CGGCAATTTT GACTGCATGG
CTTGCTGTGT ATGTTGATTT AATGAAAAAA ATAGAATCAA TAGGAGGTAA TAAAATTGCA
GAGGAATTTC AAACAAAAGT AAATAAAATG CGACTTGAGA AAAACGACTC AGCTAGGATT
AAATCTGCAT TAGATATAGA AAAAGGAATA ATTTCGACCG CTAAGGATTT ATCACTTATT
GATGAAGCTG AGGAAAAATT TTTAAGAGAA TTACATGAGT GTAGACACAA GTGCGCCCAT
CCAACTACAG ATGATACTGT ATATATTTTT GAGCCTACCG AAGAGCAAGT AAGATATCTT
TTAAGTGGTG TAATAGATAA CTGTCTATCA TTTAGCGCAC TTCCAAAAAA TAATCAAATA
ATACAAATTT TGATGAATGA TTTATCGAAA GATTTCCCAT TAGAGCAGGA TTTATTTGAA
TTTTACAAAT CAAAGTATAT TGATAAAATT CCACAGAATA CGCAAAGACA GCTTATTAAG
ATAATTGCCA TAGAAGCGGT ATGTCCGTCG TCAAAAGAAA AATGGGCGGA GTGCGGACTT
GAAATATCAA GTCCAGATTT GATTGCAAAA AGGTGTATGC AAATATTGAA ATGCATAAAT
ACGTTTAGCA AAGATCTTTT AATAGAAGTT TTCACCAATC AATCTAAAAA ATTATCCAAC
GGTGACTCTT CCTATAGATT TGTTGGTGTA TTTTCTTCAT TTGATTTCTT TAGGGATCAC
TTAGACCGAG ATCTATACTT TATATGCAAA GCAAAGTTTA ACAAAGCAAT TGAAAGTGAG
TACGATAAGC CTTGGGAGTT GCTTCTGAAT GGATTTCCGT ATGATCAAGA ATTACGAGAA
GAATCTGAGA AGTTATTTAA TTCTGATTAT TTTTTAAGCC ATGAGAAAAA TTTGACTGAA
TTATCAAAAA ATGGAGATTT GGACAACGAT GAGCTAAAGA AGCTTGTTGA TTGTTGTATA
GATAAATTGG AAAAATCAAG TTCATATAGT GAAGCAGATT ATCTTGCAAG ATTAATAGTT
GAACTCGCTC CAGTACTGGA GGGTAATGAT ATTTTAAAAA TCTCTGCTAT TTTGTTTAAA
AACAATCAAG TATTCGAATC TTTTAGTATG GATAGATTAA TCAAAAATAT TGCTTTAAAT
TCTATGAAGA AGGAGACTGC GAATTATTGG AAAGAGTTTG CTGAAAACGG TATGGAAAAG
AAAAAACCAG AGTTATTGAA TCCAGATTTA TCTCCAAGTT ATGATTCAGT AATGAAGTGG
ATTTATAACC GTGCTATCGA AGAGTTAAAA AAGTCTTAG
 
Protein sequence
MRDLLSLQSK IQDVYVRQIY CEAVASYNAQ AYRSAILTAW LAVYVDLMKK IESIGGNKIA 
EEFQTKVNKM RLEKNDSARI KSALDIEKGI ISTAKDLSLI DEAEEKFLRE LHECRHKCAH
PTTDDTVYIF EPTEEQVRYL LSGVIDNCLS FSALPKNNQI IQILMNDLSK DFPLEQDLFE
FYKSKYIDKI PQNTQRQLIK IIAIEAVCPS SKEKWAECGL EISSPDLIAK RCMQILKCIN
TFSKDLLIEV FTNQSKKLSN GDSSYRFVGV FSSFDFFRDH LDRDLYFICK AKFNKAIESE
YDKPWELLLN GFPYDQELRE ESEKLFNSDY FLSHEKNLTE LSKNGDLDND ELKKLVDCCI
DKLEKSSSYS EADYLARLIV ELAPVLEGND ILKISAILFK NNQVFESFSM DRLIKNIALN
SMKKETANYW KEFAENGMEK KKPELLNPDL SPSYDSVMKW IYNRAIEELK KS