Gene Avin_25440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_25440 
Symbol 
ID7761457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2551011 
End bp2552282 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content63% 
IMG OID643805428 
ProductPeptidase M19, renal dipeptidase.PvdM-like protein 
Protein accessionYP_002799701 
Protein GI226944628 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAAGG CCAATGACTT GCAGGAAAGG ATCCTGTCCT TCGACGCCCA TATCGATCTT 
CCCCTGGAAT ACGGAAGCGG GGGTATGGAA GCCGACCGCG ACGGTCGCAC GCAGTTCGAT
CTGGTCAAGG CCGCTCGTGG CCGCCTGAGC GGAGCGGCGC TCACCATCTG GGCCTGGCCG
GAGTTCTGGA TCGGCCCCAA TGCTCCGCAC CGGCCGACAC CCGGTTTCGT CGAGGCGGCG
CGACATGAAC AAGAGGCCCG CTACCGGATC ATCACCGGCA TCGCCCGCGA CTATCCTGAA
CGGGCCGGCA TCGCTTATAG CCCTGCCGAT TTCCGCCGTC TGGCCCACGA GGGCAAGTTC
GCCATCGTCA TCAGCATGCT CAATGCCTAT CCGCTGGGCG ACGAGGTCTC GCGGCTGGAC
GACTGGGCCG CACGTGGCAT GCGCATCTTC GGCTTCAACT ATGTGGCCAA CAACACCTGG
TCGGACTCCT CCCGCCCCAT GCCCTTCTAC GGCGACTCGC CCGACGAGCA TGGCGGCCTC
TCCGAGTTGG GACGACAGGC CGTGAGACGC CTGAACGACC TGGGCGTGGT CATCGATGTC
TCACAGATGT CCAGCAGCGC CCTCGAACAG GTCACCGATC TCAGCCGCGC CCCGATCATC
GCTTCGCACT CAGGCATACG CGGTCTGGTG GACATACAGC GCAATCTGAC CGACCGGGAG
CTGCGCCTGA TCCAGAAGAC CGGCGGTGTG GTTCATATCG CCGGCTTCTC TTCCTACCTG
CGTCCCTTCT CCAAGGAAAC CCTCGCCAAG GTCAACGCCA TGCGCACAGG CTTCGGCCTG
CTGGATGTCG AGAACCTCTC CCAGGCCAGC ATGCCGGCCG ATCCGGTGTT CTCGATCTGG
CCTGAGAAGC GCTTCGGCGA GTACGCCAGC CAGCTCTATG CCATCCTCGA AACCGATCCA
CGGGCGGGCC TGAAGGAATT CGGCGATGCC ATCGACTATG CCGTGAAGAA GATAGGCATC
GACCACGTCG GCATCAGTTC CGACTTCAAC GATGGCGGCG GTGTGATCGG CTGGGAGAAC
GTCGGCGACG CCCGTAACGT CACCGCCGAA CTCATTCAGC GTGGCTACTC GGACGAGGAC
ATCGCCAAGC TGTGGGGTGG CAACTTCCTG CGGGTATGGG AGCAAGTGCA GAACGCCAGC
AGGCAGAGCC ACGACGACAT CACAACATCT CCATCTCAAG AAACGCCCAC TCAAAGGAGT
ACAACGCAGT GA
 
Protein sequence
MRKANDLQER ILSFDAHIDL PLEYGSGGME ADRDGRTQFD LVKAARGRLS GAALTIWAWP 
EFWIGPNAPH RPTPGFVEAA RHEQEARYRI ITGIARDYPE RAGIAYSPAD FRRLAHEGKF
AIVISMLNAY PLGDEVSRLD DWAARGMRIF GFNYVANNTW SDSSRPMPFY GDSPDEHGGL
SELGRQAVRR LNDLGVVIDV SQMSSSALEQ VTDLSRAPII ASHSGIRGLV DIQRNLTDRE
LRLIQKTGGV VHIAGFSSYL RPFSKETLAK VNAMRTGFGL LDVENLSQAS MPADPVFSIW
PEKRFGEYAS QLYAILETDP RAGLKEFGDA IDYAVKKIGI DHVGISSDFN DGGGVIGWEN
VGDARNVTAE LIQRGYSDED IAKLWGGNFL RVWEQVQNAS RQSHDDITTS PSQETPTQRS
TTQ