Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0219 |
Symbol | |
ID | 8413067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 257917 |
End bp | 259119 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 645021787 |
Product | hypothetical protein |
Protein accession | YP_003179242 |
Protein GI | 257784025 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.431687 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAGA ACGAAGTCAT TTTAAAAAGA CAGTCTACTC GGATATATTT TAAAAATGGT GATACGGATT TCTTCTTTAA TTGGTTGTTG GGAATTGGTG AAGTTTTTGG CTTTTCTCAC GGAGAACTTT ACTTTCTTAC TCAAAAGCTA GGAAAATCAC CAAAACCTGA TGACTGGAAA AATATCTTCT TATCACATGG AAACTATCTT AAACAAAAAG CAAGCAATTC AGATTTAAGT GAACAAACAA AAGCTCAGTA TTATCTAGCA CAAACCTATT CCCTTCGTTC GGCAATCCAG TTTATAAATC CATTTTCTGA TGAATACTTG TCTACCGTTC ACCAAATGGA GCAAGCGTTT TCTAACGCAA TTCATTCGCT AGGTGCACCA ATTGAAAAAC TAACCATTAC TTACCAAGAT TCCTATTTGC CTGGTTACTA TCTTCACACC GGCGACGATT GCCCAACACT GATTATGATT GGCGGCGGTG ATACTTATCG TGAAGACTTA TTTTACTTTG CAGGATATCC TGGATGGATA CGAAAATATA ATGTTCTAAT GGTTGACCTT CCTGGTCAAG GGAGCAACCC TAGTAGAGAG CTAGTCTTTG ATGTGGACGC CTCTGCTCCA ATTTCGCTAT GCATAGACTG GTTGGAAAAT AGAAATTCTA AACTGAATTA CCTAGCTATT TATGGTGTCA GTGGAGGAGG GTATTTTACT GCGCAAGCCG TTGAAAAAGA TCCAAGAATT CATGCTTGGA TTGCTAGTAC ACCCATTTAC GACGTTGCAG AAGTGTTCAG AAAAGAATTT GGATCAAGCT TAAAAACTCC CAGTTGGTTA ATGAATACTA TTTTAAAGTT AGCTGGAAAT TTAAACGAAT CTGCAAATTT AAACCTTAAA AAATATTCTT GGCAGTTTGG CACCTCTGAT TTTAAGAGCG CTATCGATGA GGTGTTCGAC CGTGCAAAGA TTGTAGATTA TCAAAAGATT CAATGTCCTT GCCTGTTTAT TATGGGAGAA GGCGAAAGTG CTGAATTACA ACACCAAACT AAGGTAATCT ATGAAGCGCT TAGATTCAAA AATCCGCAAA CGAAAATTCA AGTATTTGAA GCGGAAAGTG GTGCAGACGC TCATTGTCAA GTTAACAATT TGAGACTTGC CCATAATGTC GTTTTTGATT GGTTGGACAC TTTATTTAAA TGA
|
Protein sequence | MKKNEVILKR QSTRIYFKNG DTDFFFNWLL GIGEVFGFSH GELYFLTQKL GKSPKPDDWK NIFLSHGNYL KQKASNSDLS EQTKAQYYLA QTYSLRSAIQ FINPFSDEYL STVHQMEQAF SNAIHSLGAP IEKLTITYQD SYLPGYYLHT GDDCPTLIMI GGGDTYREDL FYFAGYPGWI RKYNVLMVDL PGQGSNPSRE LVFDVDASAP ISLCIDWLEN RNSKLNYLAI YGVSGGGYFT AQAVEKDPRI HAWIASTPIY DVAEVFRKEF GSSLKTPSWL MNTILKLAGN LNESANLNLK KYSWQFGTSD FKSAIDEVFD RAKIVDYQKI QCPCLFIMGE GESAELQHQT KVIYEALRFK NPQTKIQVFE AESGADAHCQ VNNLRLAHNV VFDWLDTLFK
|
| |