Gene Apar_0239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0239 
Symbol 
ID8413087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp280803 
End bp282029 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content44% 
IMG OID645021807 
Productdiaminopropionate ammonia-lyase 
Protein accessionYP_003179262 
Protein GI257784045 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01747] diaminopropionate ammonia-lyase family
[TIGR03528] diaminopropionate ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000784643 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.351019 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCCAG AAATGAAGTG GGTACTTAAT AAGATGCCTC AGAGTGAAGA CAGAAATCTT 
CAGGTCATGT CACTTGAAAA TGTTAAGAAG GCTCGCGCAT TTCATAAAAG CTTTCCTCAA
TATGCCGTGA CACCTTTGGC AAATCTTGAG GGTATGGCCT CTAATTTAGG TCTTGGTGGA
TTATACGTAA AAGATGAGTC GTATCGCTTT GGACTTAACG CATTTAAAGT CCTGGGTGGT
TCATTTGCTA TGGCTCGCTA TATTGCTGAT GAAACAGGAA AAGATGTTTC TGATTGCGAC
TTTGAGTATC TGACTTCAGA GCAATTGCAG AAAGACTTTG GACAGGCAAC TTTCTTTACT
GCAACCGATG GTAACCATGG CCGCGGTGTG GCATGGGCAG CAAATCGTCT TGGTCAAAAA
GCTGTTGTTC ATATGCCTAA AGGCTCTACA AAAACTCGTT TTGACAATAT TGCAAAAGAG
GGCGCACAAG TAACCATTGA AGAGCTCAAT TACGATGACT GTGTTCGTCT CGCAGCAAAA
GAAGCAGAAG AGACTGAGCA TGCAGTAATC GTACAGGACA CTGCTTGGGA TGGTTATGAG
AAGATTCCAT CTTGGATTAT GCAGGGTTAC GGTACTATGG CCTATGAGGC AGCAGAACAG
CTACGTGAGC TTGAGGTAAA CCGTCCAACA CATGTATTTA TACAAGCAGG CGTTGGATCT
TTGGCATCTG CCATGGTAGG CTACTTTACT AATTTATTCC CTTCAAATCC TCCTAAGTTT
GTCATTATGG AAGCAGGAGC TGCAGATTGT CTGTATAAGG GAGCACTTGC AGCAGATGGC
GAGCCTCGTA TTGTTGGTGG AGATTTGATT ACTATTATGG CTGGTCTTGC CTGTGGTGAG
CCAAATACCA TTGGTTGGGA TATTTTACGC AATCATGCGA CAGCCTTTAT TTCTTGTCCT
GATTGGGTTT CGGCAAAGGG CATGCGCATG CTCGCTGCTC CTGTTAAGGG AGACCCTTCT
GTTGTTTCCG GTGAGTCTGG CGCTGTTGGT ATGGGTGTGA TTTCTACTCT TATGACAGAC
CCTGCATACA AAGAGCTTCG TGATGCGCTT GATCTTACAA CAGATTCAAA GGTTCTTTTG
TTCTCTACAG AAGGAGATAC TGATCCTGTG CGTTATGAAG AGATTGTATG GGACGGTGCA
TGGCAGTCAA CAGATGACGT CAAGTAA
 
Protein sequence
MQPEMKWVLN KMPQSEDRNL QVMSLENVKK ARAFHKSFPQ YAVTPLANLE GMASNLGLGG 
LYVKDESYRF GLNAFKVLGG SFAMARYIAD ETGKDVSDCD FEYLTSEQLQ KDFGQATFFT
ATDGNHGRGV AWAANRLGQK AVVHMPKGST KTRFDNIAKE GAQVTIEELN YDDCVRLAAK
EAEETEHAVI VQDTAWDGYE KIPSWIMQGY GTMAYEAAEQ LRELEVNRPT HVFIQAGVGS
LASAMVGYFT NLFPSNPPKF VIMEAGAADC LYKGALAADG EPRIVGGDLI TIMAGLACGE
PNTIGWDILR NHATAFISCP DWVSAKGMRM LAAPVKGDPS VVSGESGAVG MGVISTLMTD
PAYKELRDAL DLTTDSKVLL FSTEGDTDPV RYEEIVWDGA WQSTDDVK