Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0756 |
Symbol | |
ID | 8413621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | + |
Start bp | 837579 |
End bp | 838775 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 645022338 |
Product | NLP/P60 protein |
Protein accession | YP_003179776 |
Protein GI | 257784559 |
COG category | [M] Cell wall/membrane/envelope biogenesis [S] Function unknown |
COG ID | [COG0791] Cell wall-associated hydrolases (invasion-associated proteins) [COG3883] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.640011 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACAGC GCCGATCTTT CACAAGACGA GACGCTCTCT TATTTGCAGG TCTTGGTGTA GCTGCTACGC TGCTTACTCC AGCTAGGCTA TTCGCAGACC CACAAAGCGA TCTTGAGGCC GCATCTGCCC AGTTAGACTC ACTTGGTGCA GCTCTTGCAG AAGCCATGGA TAACCTTAAC GAGAAAACCT ATGCACTTGA CGCTACCAAC AACAAGATTG GTGAAGTTCA AGAGCAGATT GCAGAAACCA CAAACCAGCT CAATCAGCAG CGCCTGGTCC TTTCTGCAGC AATGAGGAGT GCTTATAAAG CAGGCCCACA GGAAACTTTG GACTTCCTCC TTGGAGCATC AAGCCCAGAG GACTTTGTCA GCCGTGTCTA TTATATGGAC CGCACTAGCA AGCAAGAGGC TGACTCCATT AATACGGTTA AGACTCTTGG CGATCAACTT CAGGCTCAGC AGCTTGAGCT TCAAGCCGAG CAAGAGAATC TCCAAGCTCA GGTCGCAGAA ATGAAAACAA CCGCAGATGG TCTTCAAAGC CAAGTTGCGG AAGCTAAGGC TTACTATGAC TCTCTTGATG CAGAGGTAAA AGCACAGCTT GCCGCTCAAG AGGCTGCATC AGCAAATAAC AACGTTGCTT ATGCAATTGA GACTGTTACT CGTGAGAACC CCTCTAACAG CTCTGAGTCC AACGATAGCC CCTCTAACAG TTCAAATTCA AACTCCAGCT CTTCGAGCTC TAGCAACTCC AGTAGCAGTT CTAACTCTGG TAGTGGCTCT AACTCTAGTA GTGGGTCTGG TTCAGGCAGC GGATCACACT CCAGCGGTGG TGGCGGTGGT TATCCAGCCG CAGGCGGCGG CGTTGCCACT GCTTATGCTT GCATTGGCTA TCCATACGTT TGGGGTGGAG CTTCTCCTGC TTCTGGCTTT GACTGCGGTG GTCTGGTCTA TTACTGCTTC CTTGGTTATC GCAAAGGTAC TGCAGGAACC ATTGGACGCG CCATTCGTGC TGCTGGTAAC TGGCACGATT CCATGGATGA GCTCAATTAT GGTGACATCA TCTTTACCCG TGCAGGCTAC GAGCATGTTG GAATCTATAT CGGCGGTGGT CGCATGATTC ATGCAGCCAA CGAGTCTGTC GGCGTCATCG AGGGTCCTGT TTACGCTTGC TATGGTGGAG GACCATTCTC TGGCTAA
|
Protein sequence | MSQRRSFTRR DALLFAGLGV AATLLTPARL FADPQSDLEA ASAQLDSLGA ALAEAMDNLN EKTYALDATN NKIGEVQEQI AETTNQLNQQ RLVLSAAMRS AYKAGPQETL DFLLGASSPE DFVSRVYYMD RTSKQEADSI NTVKTLGDQL QAQQLELQAE QENLQAQVAE MKTTADGLQS QVAEAKAYYD SLDAEVKAQL AAQEAASANN NVAYAIETVT RENPSNSSES NDSPSNSSNS NSSSSSSSNS SSSSNSGSGS NSSSGSGSGS GSHSSGGGGG YPAAGGGVAT AYACIGYPYV WGGASPASGF DCGGLVYYCF LGYRKGTAGT IGRAIRAAGN WHDSMDELNY GDIIFTRAGY EHVGIYIGGG RMIHAANESV GVIEGPVYAC YGGGPFSG
|
| |