Gene Avin_22260 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_22260 
Symbol 
ID7761144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2224341 
End bp2225639 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content70% 
IMG OID643805111 
Productguanine deaminase 
Protein accessionYP_002799392 
Protein GI226944319 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02967] guanine deaminase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.119985 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGCCT ACCGCGCCGC CCTGCTCCAC TGTCTCGCCG ATCCCCGCGA GGTCGGCATC 
GAGCGTTCGT ACCAGTATTT CGAGGACGGC CTGCTGCTGG TCGAGAACGG CAGGATCGTC
CGGATCGGCG CCGCCGCCGA GCTGCTGCCG GGCCTGCCGG CCGGGGTCGG GGTGGCCGAG
TACCGCGATG CGCTGATCGT CCCCGGCTTC GTCGACACCC ACATCCATTA CCCGCAACTG
GACGTGATCG CCTCCTACGG CAGCCAGTTG CTGGAGTGGC TGGAAACCTA CACCTTCCCC
GCCGAGGCGC GCTTCGCCGA CCCGGCGCAC GCCCGCGCCC AGGCGCGCCT GTTCCTCGCC
GAGCTGTTGC GCAACGGCAC CACCACGGCG CTGGTGTTCG CCACCGTGCA TCCGCAGTCG
GTGGACGCCT TCTTCGAGGA GGCCAGCCGG CTCGATCTGC GGATGATCGC CGGCAAGGTG
CTGATGGACC GCAACGCCCC GGACGGACTG CGCGACAGCG CCGCCTCCGG CTACGCCGAG
AGCCGCGCGC TGATCGAACG CTGGCACGGC AAGGGCCGCC TGCACTACGC AGTCACCCCG
CGCTTCGCGC CGACCAGCAC GCCCGGACAG CTCGACCTGG CCGGCCGGCT GCTGCGCGAA
TACCCCGGCC TCTACCTGCA CACCCACCTG TCCGAGAACC GCGCGGAGAT CGACTGGGTG
AAGGAACTGT TCCCCGAGCG CCGGCATTAC CTGGACGTCT ACGACCACCA CCGCCTGCTC
GGCGAGCGCT CGGTGTTCGC CCACGGCGTC CACCTCTGCG ACGACGAGTG CCGGCGGCTC
GGCGAGAGCG GCTCGGCGGT GGCCTTCTGC CCGACCTCCA ACCTGTTCCT CGGCAGCGGC
CTGTTCGACC TGGCCCGGCT GGAAGGCCAC GGCGTGCGCG TCGGCCTGGG CACCGACGTC
GGCGGCGGCA CCAGCTTCTC CCAGTTGCAG AGCCTCAACG AGGCCTACAA GGTGCTGCAG
TTGCAGGGGC AGAAACTCGA CCCGTTCAAG GCGCTGTACC TGGCCACCCT CGGCGGCGCC
AGGGCGCTCT ACCTGGACGA GCGCATCGGC AACCTGCAGC CGGGCAAGGA CGCCGACTTC
GTGGTGCTGG ACTGCAAGGC CACGCCGCTG CTCGCCCGCC GTCTGGAACA GGCGCGCAGC
CTCGCGGAAA GGCTGTTCGC GCTGATGATC CTCGGCGACG ACCGCGCGGT GCGGGAAACC
TTCGCCGCCG GGCGTTCGGT GCACCGGCGC GACGTCTGA
 
Protein sequence
MQAYRAALLH CLADPREVGI ERSYQYFEDG LLLVENGRIV RIGAAAELLP GLPAGVGVAE 
YRDALIVPGF VDTHIHYPQL DVIASYGSQL LEWLETYTFP AEARFADPAH ARAQARLFLA
ELLRNGTTTA LVFATVHPQS VDAFFEEASR LDLRMIAGKV LMDRNAPDGL RDSAASGYAE
SRALIERWHG KGRLHYAVTP RFAPTSTPGQ LDLAGRLLRE YPGLYLHTHL SENRAEIDWV
KELFPERRHY LDVYDHHRLL GERSVFAHGV HLCDDECRRL GESGSAVAFC PTSNLFLGSG
LFDLARLEGH GVRVGLGTDV GGGTSFSQLQ SLNEAYKVLQ LQGQKLDPFK ALYLATLGGA
RALYLDERIG NLQPGKDADF VVLDCKATPL LARRLEQARS LAERLFALMI LGDDRAVRET
FAAGRSVHRR DV