Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0955 |
Symbol | |
ID | 8413826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | + |
Start bp | 1075121 |
End bp | 1076725 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 645022543 |
Product | 4-phytase |
Protein accession | YP_003179975 |
Protein GI | 257784758 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00228685 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGAGC ACGTATTTAA TAATCTTTCC CGTAAATCAT TTTTACGTGG ATCTTTGGCG GCTGCCGTTG CCGCCGGTTC TGCTTCACTA CTTTCTGCTT GTGGTGGATC TGCTAGCGGA GATGGTGAAA AAAAGGTATT ACGCTTTGGC GTAAACAATC CAAAAGTTAC CTTTGATACT CAAAAAACCT CTGGCTCTGT TGGTGTATCT GAAGCAGTTG CAGAATCTTT ACTGGTGCTT AACCCAGACA CAAAAGAGAT TGAGCCAAAT CTGGTAACTG GTCTTCCTAC TGTTTCTGAT GATGGTCTTA CTTACTCATT TGAGCTTAAA GATGGCGTCA AATTCCACAA CGGAGAAACC CTTAAATCAT CTGACGTTAA GTACACCTTA ACGCGCATGT TCTTGCCTGC TACCAAGGCA ACCTCAATTG ACTCTTATGC TTACATTGAG GGTGCTAAAG ACATTATTGC TGGTAAAACT GAAGAGCTTT CTGGCGTTGT TATTAAAGAC GACCGTCACT TTGACATTAA GCTAACCCAA CCGTATTCAA CCTTCAATGC AATCATGGCT CAATTCTATG CTGTCATCTA CCCAGAGAAG GCCTGCAAAG AAGCTGGTGA AGCGTGGGGT ACTGAGACCA ATTTCATTGG CACTGGTGCA TACAAGCTTG TCTCAAACGA TAGCGCTACA GAAGTTGTTC TTGAGGGATT TGCTGACTAC CATGAAGGTA AACCTGGCCT GGATGAGCTT AGATTTGTTT ACATTGACGA CGCTAACACC CGTGTTCTTA ACTACAAGAA CAACGATGTT GACCTGGTCT TTATTTCTCA GTCTCTTATT CAGCAGTATC AAAACGATGA ATCTATCTCC AAAGAAATTG TCAATTACAC ACCTGCATCC ACGCAGTTTG TAAACTTGAA TCTTCAGAGT CAGAACCTCA AAGATGTTCG TGTTCGTCAG GCGCTCTCAA TGGCAATTGA TAGAGACACC ATTTGTAGCA CCATTCTTTC TAATGTTGCT AAACCTGCAA AGTCATTTAT TCCATCTTCT GAGACTGGTT ACAATGAGTC CGCTCCAGAG TTTGAGTACA ACGTAGACAA AGCAAAACAG CTTCTTGCAC AAGCAGGTGT TTCTAACCTT ACTCTTAACG CACAGGTTAG AAGCCAAGAC CAGAACCTTA TGGTTGCTAT ACAGGATGCA TGGTCCAAGA TTGGCGTTAC TTGTAACGTA AGCGTTATTG ACTCTGGTGT TTGGAGCGAC GCACGCGCCA ACGGAGAGCT TGAGGTTACC TTGGTTACCT GGTCTACCCT CTCCTTCCAA GGCATTGAGC ACATGGGCTC CTACTTCCGC TCTGACCGCG CTGCAAAGAA GTCTTCCTTC TACAACAGTT CCGAGTTTGA CAACCTTGTT GACCAAGCTC GCCTCACAGT CAATGACAAT GACAAAGTTC TTGAGCTTAC CCGTCAGGCA GATAGTCTCA TGACACACAC AGACTACGCT TGCCTGCCTA TTGACTGGCC ACAGATGCCC TACGTCCTAA AGCCAGAATT TACGGGCCTC AGCGTTCTGG TCAATCCTCA CTTTGATAAG GTTAAGAAGA AGTAA
|
Protein sequence | MKEHVFNNLS RKSFLRGSLA AAVAAGSASL LSACGGSASG DGEKKVLRFG VNNPKVTFDT QKTSGSVGVS EAVAESLLVL NPDTKEIEPN LVTGLPTVSD DGLTYSFELK DGVKFHNGET LKSSDVKYTL TRMFLPATKA TSIDSYAYIE GAKDIIAGKT EELSGVVIKD DRHFDIKLTQ PYSTFNAIMA QFYAVIYPEK ACKEAGEAWG TETNFIGTGA YKLVSNDSAT EVVLEGFADY HEGKPGLDEL RFVYIDDANT RVLNYKNNDV DLVFISQSLI QQYQNDESIS KEIVNYTPAS TQFVNLNLQS QNLKDVRVRQ ALSMAIDRDT ICSTILSNVA KPAKSFIPSS ETGYNESAPE FEYNVDKAKQ LLAQAGVSNL TLNAQVRSQD QNLMVAIQDA WSKIGVTCNV SVIDSGVWSD ARANGELEVT LVTWSTLSFQ GIEHMGSYFR SDRAAKKSSF YNSSEFDNLV DQARLTVNDN DKVLELTRQA DSLMTHTDYA CLPIDWPQMP YVLKPEFTGL SVLVNPHFDK VKKK
|
| |