Gene Avin_20820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_20820 
Symbol 
ID7761007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2074728 
End bp2076083 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content68% 
IMG OID643804977 
ProductLipolytic enzyme, G-D-S-L domain protein 
Protein accessionYP_002799258 
Protein GI226944185 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2755] Lysophospholipase L1 and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.6463 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATTG AATTCACTTA TATGAAAAAT CTTTCCAAAA TCCTTCCCAG CGCCTCGACC 
ACCGCGATGC TGGCCCTGGT GCTCGCGACG GGCGTATCCG TGACCGCCGT TCCGGCGGCT
GCCGACCCTC CCGCCTGGGT GGCAACCTGG GGCGCCAGCC CCCAGCCCGT CTGGGGAGCG
GGCTTCCTGT TCCCCAGCAA CGTCCCTTCC GAATTGCACG ATCAGACGGT GCGCCAGGTG
GCGCGCGTCA GCCTGGGTGG ACAACGCCTG CGCATCGTGC TGTCCAACGC CTACGGCGGC
CAGCCTCTCG CCGTGGGGAA GGCTACGGTC GCACGGCCGC GTAGCGACGG TGCCGTTGCC
GCCGACAGCC TGCGCACCGT GACATTCGGT GGCCGGGAGG AGGCGACGAT CCTTCCCGGC
GCATCGCTGG TCAGCGATCC AGTGGCATTG CCCATCCCCG CGCTGGCACA GGTCGCGGTG
AGTCTTTATC TGCCGAAAGC GACACCGGTC GGCACCTTTC ACTGGGACGG CCGCCAGACC
GGCTGGATCG TCCCCGGCGA CCAGACCACG GCCCCGGCAT TCGAGACGGC GGAAGGCTGC
GCACGGAGCA CCACGACACG CCTGCTGCTG GCGGGGATTC AGGTCGAAGC CGAACACGCG
GTGCGAGCGG TCGTGGTGAT CGGCGACTCC ATTACCGACG GGGCCGCTGC CAGCCCGGAC
AAGGACAGCC GCTGGCCGGA CTTCCTGGCT GCGCGTCTGG CTCCGCATGG GGTGGCCGTC
GTCAATGCCG GCATCTCCGG TGCCCGGCTG CTGTCCGACG GCATGGGTGT CAATGCGCTG
GCACGGCTGG ATCGCGACGT ACTGGCGCAA CCGGGGGTGC GGAGCCTCGT CGTGATGCTG
GGCATCAACG ATATCGCCTG GCCGGGCACG GCCCTCGCGC CGGAAAGACC CCGACCGACA
CTGCAGGCAC TGACGGCTGG CTACCGCCAG TTGGCCGAGC AGGCTCGCAG CCGTGGGTTG
CGGGTGATCG GCGCAACGCT CACCCCGTTC GAGGGTGCGT TGCCCGGCAC GCCGCTGGAC
GACTACTACC ACCCCGACAA GGACGCCTTG CGCCAGCGGG TCAACGACTG GATTCGCCAC
GGCGGCGCGT TCGACGCGGT GATCGACCTC GATGCGGCAC TGCGCGATCC CGTCCATCCA
GCCCGGATCG ACGCGCACTT CGACTCCGGC GACCACCTGC ATCCCGGCGA CCAAGGCAAT
CGGGCGATGG CCGAAGCCGT CGATCTCGAT GTCTTGCTGC CGGGCCTCGG TGCCTCGCGG
GACAACACAA CGCCGAATAC ATCTCAGGAG CGCTGA
 
Protein sequence
MDIEFTYMKN LSKILPSAST TAMLALVLAT GVSVTAVPAA ADPPAWVATW GASPQPVWGA 
GFLFPSNVPS ELHDQTVRQV ARVSLGGQRL RIVLSNAYGG QPLAVGKATV ARPRSDGAVA
ADSLRTVTFG GREEATILPG ASLVSDPVAL PIPALAQVAV SLYLPKATPV GTFHWDGRQT
GWIVPGDQTT APAFETAEGC ARSTTTRLLL AGIQVEAEHA VRAVVVIGDS ITDGAAASPD
KDSRWPDFLA ARLAPHGVAV VNAGISGARL LSDGMGVNAL ARLDRDVLAQ PGVRSLVVML
GINDIAWPGT ALAPERPRPT LQALTAGYRQ LAEQARSRGL RVIGATLTPF EGALPGTPLD
DYYHPDKDAL RQRVNDWIRH GGAFDAVIDL DAALRDPVHP ARIDAHFDSG DHLHPGDQGN
RAMAEAVDLD VLLPGLGASR DNTTPNTSQE R