Gene Apar_0450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0450 
Symbol 
ID8413299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp518470 
End bp519558 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content45% 
IMG OID645022018 
Productlipolytic protein G-D-S-L family 
Protein accessionYP_003179472 
Protein GI257784255 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2755] Lysophospholipase L1 and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGTGG GAGAGTGGAG CATCAGTTCT CCAGCATCTA GGTTTTTACT TGGTGCGCCA 
TGGTCAGAAA AACTTGCTCA CGGGTGGGTT CATCCGCTGC GTATTTCTCC CAATCAGCTT
AGAGTTATTG GCTCTTGTTC TTCCTGGCAC CCCGGTCTTT TTAAACAGAT GGCAGCGTGC
ACTTCAGATA TTCAAGTTAC TTTTACCACG GACAGCTCAG AAGTGGTACT TGATCTAAAA
ATTGATGAAC TTCCTAAAGG CTCCTCTTCT GTTTTACAGT TACTTAAAGC TACGTATCTT
AAAAAGCTCA GCTCTGTATT TGTGACGGTT GATGGTAAAC CTTACAAGAA CTTCTCGCTG
GATGATGTAG GTGAACATAC ACTTTCTATT CATCTAGAAA CAGAAACCTC TGAAGATGAC
CTTGCTAGAC TTCCAGGTTT TAACGACACA CATCGCGTAA ATGTTTACTT GCCCTGTTTG
CAGAGCGCTT CTGTTAAAAA TCTCCGTGGC AATGGCACGT TTTTCTCGCC CGGTGAGCCT
AAGAAAAAAC TGGTGGTGTT TGGCGATTCC ATTGCGCAGG GTTTTGTTGT GGAGCGGCCA
GACAAAACTT GGCCGCACTG TCTTGCTAAG CGCATGAAGC TTGATGCGGT TAACCAGGGA
ATCGGTGGAC AAGTTTATCA GCCTGGCTCG TATGCGTTCA TTGAGGATGC CTCTTTAGTT
GTTGTTGCGC TTGGGGCAAA TTATCGATAC GAGAAATGCT CAAAGTCACA GGTAACATAC
GATATTCAAA ATTCACTCTG GCAGATTTCT CAGATGTATG CAGATACGCG CGTGGTGGTT
CTGACGCCCA CCCCTTATTT TGAGGATACG TATCCCACGC ACCCTTATTC TTGTTTTGCT
GAGGTTCCTC AAATAATTGA AGAGGTTGCT GGTAAGTTTG GTTTGCAGTG CATCTCTGGA
GAAAAGCTGT TGCCTCAGGA GAAGAAATAT TTTGCTGACG ATGTGCATCC AAATTCTAAG
GGTGCAGCAC TTTTAGCAAA GAATCTGTTT GAAGCCTTAA AACAGCCAGT TCAGGATAGT
CTTTTTTAG
 
Protein sequence
MEVGEWSISS PASRFLLGAP WSEKLAHGWV HPLRISPNQL RVIGSCSSWH PGLFKQMAAC 
TSDIQVTFTT DSSEVVLDLK IDELPKGSSS VLQLLKATYL KKLSSVFVTV DGKPYKNFSL
DDVGEHTLSI HLETETSEDD LARLPGFNDT HRVNVYLPCL QSASVKNLRG NGTFFSPGEP
KKKLVVFGDS IAQGFVVERP DKTWPHCLAK RMKLDAVNQG IGGQVYQPGS YAFIEDASLV
VVALGANYRY EKCSKSQVTY DIQNSLWQIS QMYADTRVVV LTPTPYFEDT YPTHPYSCFA
EVPQIIEEVA GKFGLQCISG EKLLPQEKKY FADDVHPNSK GAALLAKNLF EALKQPVQDS
LF