Gene Avin_41620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_41620 
Symbol 
ID7763043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4196391 
End bp4198346 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content75% 
IMG OID643807018 
Productpeptidase 
Protein accessionYP_002801267 
Protein GI226946194 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCCACGC ATTCCATGCC CTACGGTTCC TGGCCCAGCC GCTGGTCGGC GGCGAATGCC 
GCCGCGGCCA GCCGCGATTT CGCTGGGCTG CAGGCCGGCC TCGGCGGCCT GGTCTGGCTG
GAATACCGCC CGGAAGACGG GCGTTGCCGC CTGTGGCTGT GGCGGGACGG CACGACGCGC
TGCCTGACCC CGGCGAGACA TTCGGTGCGC AGCCGGATCT ACGAATACGG CGGCGGGGCC
TTCTGCATCG TCGACGATGG CGTGGCCTGG GTCGACGAGT CCGACCAGCA GGTGTACCGC
ACGCGCCTCG CCGAGGCTCC CGTCGCCGAG GCGCTGACCG CACAGGCCCG GCGCCGCTAC
GGTGACCTGC ACCACGCGCC GGCCTGGAAC GCCGTGCTGG CGGTCGAGGA AAGCCACGCG
GCGAAGGGAG TGGCGCACCG GCTGGTCGCC CTGTCGCTGC ACGACGGCGC GCGCCGGGTG
CTGGCCGAGG GCGCCGATTT CTACGCCGCG CCGAAGCTGA GCGCCGACGG CCGGCGCCTG
TCCTGGATCG AGTGGGAGCG TCCCGAACTG CCCTGGACGG CCACCCGCCT GTGCCTGGCC
GAGGTGGCGG CGGACGGCCG GCTCGGCGAG CCCGTGACCC TGGCCGGCGG CGAGGGCGGC
GAGGCGTTGC AGCAGCCCTG TTTCGCCGCC GACGGAAGCC TGTGGTGCCT CACCGACCGC
GCCGGCTGGT GGCAGCCCTG GCGGGAGCAG GGCGGCCGGC TGCGGGCTGT CGAGCGGACG
GACGAGCGCG CGGAGCGCAG GGGAGACGAC GGCGCGGCCC TGTCCGCCGC TGTCATGGAG
TGGAGCGAGC CGCGCCCGCT GGCCGCCGCC GACCATGCCC CGGCGCCCTG GCAGCAGGGC
GCGGTCAGCT ACCTGCCGCT GGCGGACGGC GGCCTGCTGC TCGCCCGCCA GGAGGAGGGC
TGGGGATTCC TGATCGAGCG CGATGCCGCG GGCCGCGAGC GCTATCTGGC GGCGGAGTTC
AGCCGCTTTC GCCAACTGGC GGCGGACGCG GACTGTTTCT ACTGCATCGC CGCCTCTCCG
GCGCGGCTGC CGGCGGTGCT GGCGATCGAA CGGGCCGGCG GGACGCCGCG GGTGCTCGCC
GGGGGCGAGG CGCCGCTGGC CGAGGCCGAG CTGTCGCGGC CGCAGTCGCT GCGCTTCGCC
ACGGGCGAGG GCGAGTCCGC CCAGGCGTTC TTCTATCCGC CGCGCAACGC CGCCTGCCCG
GAGCCCGGGC ACGGACGGCC GCCGCTGGTG GTGTTCGTCC ACGGCGGACC GACTTCGGCC
TGCTATCCGG TGTTCGATCC GCGCATCCAG TTCTGGACCC AGCGCGGCTT CGCCGTGGCC
GATCTCAACT ACCGGGGGTC CAGCGGCTTC GGGCGCGCCT GTCGGCTGCG TCTCGCCGGC
GAGTGGGGAC GCATCGACGT GGAGGACGCC TGCGCCCTGG TGCACGACCT GGGCGAGGCG
GGGCTGGTCG ACCCGGCGCG GGCCTTCGTT CGCGGCGCCA GCGCCGGCGG CTACACCGCG
CTGTGCGCGC TGGCCTTCCG CGAATTGTTC CGGGGCGGCG CCAGCCTGTA CGGGGTCAGC
GATCCCGCGA GCCTGCGCCG GGTCACCCAC AAGTTCGAGG CCGACTACCT CGACTGGCTG
ATCGGCGATC CGCAGCGCGA CGCCGAGCGT TACCGGCAAC GCACGCCGCT GCTGCACGCC
GGGGAGATCG GGGCGCCGGT GATCTTCTTC CAGGGCGGCC TGGACGCGGT GGTGGTGCCG
CGGCAGACCG AGGCGATGGT CGCCGCCCTG CGCGCGCACG GAGTGCCGGT CGAGTACCGG
CTCTATCCCG ACGAGCGCCA CGGCTTCTGC CGGGCCGCCC ACCTGGCCGA CGCGCTGGAG
CGCGAGTGGC GTTTCTACCG GCGCCTGCTG GATTGA
 
Protein sequence
MPTHSMPYGS WPSRWSAANA AAASRDFAGL QAGLGGLVWL EYRPEDGRCR LWLWRDGTTR 
CLTPARHSVR SRIYEYGGGA FCIVDDGVAW VDESDQQVYR TRLAEAPVAE ALTAQARRRY
GDLHHAPAWN AVLAVEESHA AKGVAHRLVA LSLHDGARRV LAEGADFYAA PKLSADGRRL
SWIEWERPEL PWTATRLCLA EVAADGRLGE PVTLAGGEGG EALQQPCFAA DGSLWCLTDR
AGWWQPWREQ GGRLRAVERT DERAERRGDD GAALSAAVME WSEPRPLAAA DHAPAPWQQG
AVSYLPLADG GLLLARQEEG WGFLIERDAA GRERYLAAEF SRFRQLAADA DCFYCIAASP
ARLPAVLAIE RAGGTPRVLA GGEAPLAEAE LSRPQSLRFA TGEGESAQAF FYPPRNAACP
EPGHGRPPLV VFVHGGPTSA CYPVFDPRIQ FWTQRGFAVA DLNYRGSSGF GRACRLRLAG
EWGRIDVEDA CALVHDLGEA GLVDPARAFV RGASAGGYTA LCALAFRELF RGGASLYGVS
DPASLRRVTH KFEADYLDWL IGDPQRDAER YRQRTPLLHA GEIGAPVIFF QGGLDAVVVP
RQTEAMVAAL RAHGVPVEYR LYPDERHGFC RAAHLADALE REWRFYRRLL D