Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_34980 |
Symbol | |
ID | 7762393 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3570755 |
End bp | 3571972 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643806364 |
Product | Glutamyl aminopeptidase familiy M42 |
Protein accession | YP_002800622 |
Protein GI | 226945549 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | [TIGR03106] hydrolase, peptidase M42 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGAAC ATGCCCTTGG AGTCCCCATG TCCAGCCCAC TCCCAGAACC CGATCTCGAC TACCTGCAGC GCGTGCTGCT GGAAATGCTC GCCATCCCCA GCCCGACCGG CTTCACCGAC ACCATCGTGC GCTATGTCGC CGAACAACTG GAGGAACTCG GCATTCCCTT CGAACTGACC CGCCGCGGCA CCATCCGCGG AACCCTCAAG GGACGCCGCT ACAGTCCGGA CCGGGCGCTC GCGGTACACC TCGACACCAT CGGCGCCATC GTCCGCGAGA TCCACGCCAA CGGCCGCATC GGTCTGGCGC CGGTGGGCTG CTGGTCGAGC CGCTTCGCCG AGGGCAGCCG GGTCAGCCTG TTCAGCGACC GCGGCGTGCT GCGCGGCAGC GTGCTGCCGC TACTGGCCTC GGGACATACC TTCAACACCC AGGTCGACCA GATGCCGATC AGTTGGGACC ATGTCGAGCT GCGCCTCGAC GCCATGACCG CCAGCCTGGC CGAAACCCAG GCCCTGGGGG TGGCGGTAGG CGATTTCGTC GCCTTCGATC CGCTGCCGGA GTTCACCGAG AGCGGCCACA TCAGCGCCCG CCACCTGGAC GACAAGGCCG GTGCCGCCGC CCTGCTCGCC GCGCTGAAGA GCGTGCTCGA CAGTGGCCAG GAGCCGCCGA TCGACTGCCA TCCGCTGTTC ACCATCACCG AGGAAACCGG CTCCGGCGCG GCGGCGGCAC TGCCCTGGGA CGTCAGCGAA TTCGTCGGCA TCGACATCGC GCCGGTCGCT CCCGGCCAGC AGTCCTGCGA ACGGGCGGCG ACCGTCGCCA TGCAGGACTC CGGCGGCCCC TACGACTACC ATCTGACGCG CCACCTGCTG CGCCTGGCGG AACATCACGC GATCCCCGTA CGCCGCGACC TGTTCCGCTA CTACCACAGC GACGCCCAGT CGGCGGTGAC CGCTGGCCAC GACATCCGCA CCGCCCTGCT GGCTTTCGGC TGCGACGCCA CCCATGGCTA CGAGCGTACC CATATCGACG GGCTGGCCGC GCTGAGCCGC CTGATCGGCG CCTATCTGCT CAGCCCGCCG GTGTTCGCCA GCGATGCCAA ACCGCAGAGC GGCTCCCTGA AACGCTTCAG CCGCCAGCTC GAACATGCGG CCCAGATGGA AAGCGAAACG CGGGTTCCCG CCGTGGACAG CCTGCTCAAG CACGAACCGG ATGCTTGA
|
Protein sequence | MREHALGVPM SSPLPEPDLD YLQRVLLEML AIPSPTGFTD TIVRYVAEQL EELGIPFELT RRGTIRGTLK GRRYSPDRAL AVHLDTIGAI VREIHANGRI GLAPVGCWSS RFAEGSRVSL FSDRGVLRGS VLPLLASGHT FNTQVDQMPI SWDHVELRLD AMTASLAETQ ALGVAVGDFV AFDPLPEFTE SGHISARHLD DKAGAAALLA ALKSVLDSGQ EPPIDCHPLF TITEETGSGA AAALPWDVSE FVGIDIAPVA PGQQSCERAA TVAMQDSGGP YDYHLTRHLL RLAEHHAIPV RRDLFRYYHS DAQSAVTAGH DIRTALLAFG CDATHGYERT HIDGLAALSR LIGAYLLSPP VFASDAKPQS GSLKRFSRQL EHAAQMESET RVPAVDSLLK HEPDA
|
| |