Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0239 |
Symbol | |
ID | 8413087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | + |
Start bp | 280803 |
End bp | 282029 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 645021807 |
Product | diaminopropionate ammonia-lyase |
Protein accession | YP_003179262 |
Protein GI | 257784045 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1171] Threonine dehydratase |
TIGRFAM ID | [TIGR01747] diaminopropionate ammonia-lyase family [TIGR03528] diaminopropionate ammonia-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000784643 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.351019 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCCAG AAATGAAGTG GGTACTTAAT AAGATGCCTC AGAGTGAAGA CAGAAATCTT CAGGTCATGT CACTTGAAAA TGTTAAGAAG GCTCGCGCAT TTCATAAAAG CTTTCCTCAA TATGCCGTGA CACCTTTGGC AAATCTTGAG GGTATGGCCT CTAATTTAGG TCTTGGTGGA TTATACGTAA AAGATGAGTC GTATCGCTTT GGACTTAACG CATTTAAAGT CCTGGGTGGT TCATTTGCTA TGGCTCGCTA TATTGCTGAT GAAACAGGAA AAGATGTTTC TGATTGCGAC TTTGAGTATC TGACTTCAGA GCAATTGCAG AAAGACTTTG GACAGGCAAC TTTCTTTACT GCAACCGATG GTAACCATGG CCGCGGTGTG GCATGGGCAG CAAATCGTCT TGGTCAAAAA GCTGTTGTTC ATATGCCTAA AGGCTCTACA AAAACTCGTT TTGACAATAT TGCAAAAGAG GGCGCACAAG TAACCATTGA AGAGCTCAAT TACGATGACT GTGTTCGTCT CGCAGCAAAA GAAGCAGAAG AGACTGAGCA TGCAGTAATC GTACAGGACA CTGCTTGGGA TGGTTATGAG AAGATTCCAT CTTGGATTAT GCAGGGTTAC GGTACTATGG CCTATGAGGC AGCAGAACAG CTACGTGAGC TTGAGGTAAA CCGTCCAACA CATGTATTTA TACAAGCAGG CGTTGGATCT TTGGCATCTG CCATGGTAGG CTACTTTACT AATTTATTCC CTTCAAATCC TCCTAAGTTT GTCATTATGG AAGCAGGAGC TGCAGATTGT CTGTATAAGG GAGCACTTGC AGCAGATGGC GAGCCTCGTA TTGTTGGTGG AGATTTGATT ACTATTATGG CTGGTCTTGC CTGTGGTGAG CCAAATACCA TTGGTTGGGA TATTTTACGC AATCATGCGA CAGCCTTTAT TTCTTGTCCT GATTGGGTTT CGGCAAAGGG CATGCGCATG CTCGCTGCTC CTGTTAAGGG AGACCCTTCT GTTGTTTCCG GTGAGTCTGG CGCTGTTGGT ATGGGTGTGA TTTCTACTCT TATGACAGAC CCTGCATACA AAGAGCTTCG TGATGCGCTT GATCTTACAA CAGATTCAAA GGTTCTTTTG TTCTCTACAG AAGGAGATAC TGATCCTGTG CGTTATGAAG AGATTGTATG GGACGGTGCA TGGCAGTCAA CAGATGACGT CAAGTAA
|
Protein sequence | MQPEMKWVLN KMPQSEDRNL QVMSLENVKK ARAFHKSFPQ YAVTPLANLE GMASNLGLGG LYVKDESYRF GLNAFKVLGG SFAMARYIAD ETGKDVSDCD FEYLTSEQLQ KDFGQATFFT ATDGNHGRGV AWAANRLGQK AVVHMPKGST KTRFDNIAKE GAQVTIEELN YDDCVRLAAK EAEETEHAVI VQDTAWDGYE KIPSWIMQGY GTMAYEAAEQ LRELEVNRPT HVFIQAGVGS LASAMVGYFT NLFPSNPPKF VIMEAGAADC LYKGALAADG EPRIVGGDLI TIMAGLACGE PNTIGWDILR NHATAFISCP DWVSAKGMRM LAAPVKGDPS VVSGESGAVG MGVISTLMTD PAYKELRDAL DLTTDSKVLL FSTEGDTDPV RYEEIVWDGA WQSTDDVK
|
| |