Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0977 |
Symbol | |
ID | 8413849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 1104358 |
End bp | 1105548 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 645022566 |
Product | protein of unknown function DUF523 |
Protein accession | YP_003179997 |
Protein GI | 257784780 |
COG category | [S] Function unknown |
COG ID | [COG1683] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01697] inosine guanosine and xanthosine phosphorylase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTTG CTATTAGCGC TTGTTTACTT GGCCTCCCCG TCAGATATGA CGGAGGGGCC AAGTCTGTTT CTGCAGTACA AAAACTTGCT CAAAAAGTAA ACGTTACAAA GATTTGCCCC GAGATTTCCG CAGGTCTTCC TGTGCCTCGT CCGCCCGCAG AGCAGCGAGA AGGACGCGTT TGGCTCAAGG ATGGCTCAGA TGTTACAGAC GATTTTGAGC GTGGAGCTAA GATTGTGCTT AGTGCCGTTA AGCCTTCTGA CATTACGCTC GCCGTTTTGA AGGCAAAAAG TCCGTCTTGC GGCGTACATG AGATTTATGA TGGTACGTAC TCAGGAAAGC TTGTTTCTGG CGAGGGAATA CTTGCTCGTC ACCTCTTAGA GGAGGGGATT TGTGTGGTTA CCGAGAAAAC CATTGAGAAT GTAAGCCCAT CTGTCGAGCA TCCTGTTGCT CTCATTTTAG GAACTGGACT TGGTCATCTT GCTGGTTTAG TAAAGCCAGT TCGTCGCATT GATTATCGTG ATATTCCTGG TTTTCCTGTT AACGCTTCTC CTATGACAGG ACACACCTTT GAGGCAACCA TTGGCACTAT CGACGGTGTG CCTGTAGTTG TTTATCCTGG TCGCGTTCAC CTTTATCAAG GCTATTCTGC AGCTGAGGTA ACCTCTCTTG TTCAGCATGC ACATCATCTA GGCTGCAGAG ATATTATTTT TGCAGGTGCA ACGGGTGCTG TTTCTGGTAA TGCAAAGACT GGCCTTGGTG TCATTACTGA CCAGATTAAC CTTACTGGAA CTAATCCACT TGCAGAGTGG GCAGGTCTTC GCGATGTTGA GACTCCATTT GTAGATATGA ATGATGCTTT CTCACCTTAT CTTAGAACGC TTGCACGTGG GGTAGCAGAC GACCTCAAAA TTGAGCTTAA TGAAGGTGTT TTTGCCGGTC TTTTGGGTCC AAACTTTGAG ACTCCTGCAG AGGTTGCTAT GCTCCGCAGT TTTGGTGTCT CATACGTTGG TGTTTCTACA GCTCTTGAGG TCATTATGGC TCGAGCACTT GATATGAATG TATTGGCTTT GACGCTTGCA GCAAATCCAG CCGGCGCTCA CGGAACAACG CATAAGAGTG TTCAAGAGGC ATCCGAGAAG TATGCAGATG ATCTTGAACG TCTTGTTCGT GGTGTTCTCG GGCTTCTTTA G
|
Protein sequence | MKVAISACLL GLPVRYDGGA KSVSAVQKLA QKVNVTKICP EISAGLPVPR PPAEQREGRV WLKDGSDVTD DFERGAKIVL SAVKPSDITL AVLKAKSPSC GVHEIYDGTY SGKLVSGEGI LARHLLEEGI CVVTEKTIEN VSPSVEHPVA LILGTGLGHL AGLVKPVRRI DYRDIPGFPV NASPMTGHTF EATIGTIDGV PVVVYPGRVH LYQGYSAAEV TSLVQHAHHL GCRDIIFAGA TGAVSGNAKT GLGVITDQIN LTGTNPLAEW AGLRDVETPF VDMNDAFSPY LRTLARGVAD DLKIELNEGV FAGLLGPNFE TPAEVAMLRS FGVSYVGVST ALEVIMARAL DMNVLALTLA ANPAGAHGTT HKSVQEASEK YADDLERLVR GVLGLL
|
| |