Gene Avin_34980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_34980 
Symbol 
ID7762393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3570755 
End bp3571972 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content68% 
IMG OID643806364 
ProductGlutamyl aminopeptidase familiy M42 
Protein accessionYP_002800622 
Protein GI226945549 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID[TIGR03106] hydrolase, peptidase M42 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGAAC ATGCCCTTGG AGTCCCCATG TCCAGCCCAC TCCCAGAACC CGATCTCGAC 
TACCTGCAGC GCGTGCTGCT GGAAATGCTC GCCATCCCCA GCCCGACCGG CTTCACCGAC
ACCATCGTGC GCTATGTCGC CGAACAACTG GAGGAACTCG GCATTCCCTT CGAACTGACC
CGCCGCGGCA CCATCCGCGG AACCCTCAAG GGACGCCGCT ACAGTCCGGA CCGGGCGCTC
GCGGTACACC TCGACACCAT CGGCGCCATC GTCCGCGAGA TCCACGCCAA CGGCCGCATC
GGTCTGGCGC CGGTGGGCTG CTGGTCGAGC CGCTTCGCCG AGGGCAGCCG GGTCAGCCTG
TTCAGCGACC GCGGCGTGCT GCGCGGCAGC GTGCTGCCGC TACTGGCCTC GGGACATACC
TTCAACACCC AGGTCGACCA GATGCCGATC AGTTGGGACC ATGTCGAGCT GCGCCTCGAC
GCCATGACCG CCAGCCTGGC CGAAACCCAG GCCCTGGGGG TGGCGGTAGG CGATTTCGTC
GCCTTCGATC CGCTGCCGGA GTTCACCGAG AGCGGCCACA TCAGCGCCCG CCACCTGGAC
GACAAGGCCG GTGCCGCCGC CCTGCTCGCC GCGCTGAAGA GCGTGCTCGA CAGTGGCCAG
GAGCCGCCGA TCGACTGCCA TCCGCTGTTC ACCATCACCG AGGAAACCGG CTCCGGCGCG
GCGGCGGCAC TGCCCTGGGA CGTCAGCGAA TTCGTCGGCA TCGACATCGC GCCGGTCGCT
CCCGGCCAGC AGTCCTGCGA ACGGGCGGCG ACCGTCGCCA TGCAGGACTC CGGCGGCCCC
TACGACTACC ATCTGACGCG CCACCTGCTG CGCCTGGCGG AACATCACGC GATCCCCGTA
CGCCGCGACC TGTTCCGCTA CTACCACAGC GACGCCCAGT CGGCGGTGAC CGCTGGCCAC
GACATCCGCA CCGCCCTGCT GGCTTTCGGC TGCGACGCCA CCCATGGCTA CGAGCGTACC
CATATCGACG GGCTGGCCGC GCTGAGCCGC CTGATCGGCG CCTATCTGCT CAGCCCGCCG
GTGTTCGCCA GCGATGCCAA ACCGCAGAGC GGCTCCCTGA AACGCTTCAG CCGCCAGCTC
GAACATGCGG CCCAGATGGA AAGCGAAACG CGGGTTCCCG CCGTGGACAG CCTGCTCAAG
CACGAACCGG ATGCTTGA
 
Protein sequence
MREHALGVPM SSPLPEPDLD YLQRVLLEML AIPSPTGFTD TIVRYVAEQL EELGIPFELT 
RRGTIRGTLK GRRYSPDRAL AVHLDTIGAI VREIHANGRI GLAPVGCWSS RFAEGSRVSL
FSDRGVLRGS VLPLLASGHT FNTQVDQMPI SWDHVELRLD AMTASLAETQ ALGVAVGDFV
AFDPLPEFTE SGHISARHLD DKAGAAALLA ALKSVLDSGQ EPPIDCHPLF TITEETGSGA
AAALPWDVSE FVGIDIAPVA PGQQSCERAA TVAMQDSGGP YDYHLTRHLL RLAEHHAIPV
RRDLFRYYHS DAQSAVTAGH DIRTALLAFG CDATHGYERT HIDGLAALSR LIGAYLLSPP
VFASDAKPQS GSLKRFSRQL EHAAQMESET RVPAVDSLLK HEPDA