Gene Avin_39910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_39910 
Symbol 
ID7762880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4042724 
End bp4044286 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content73% 
IMG OID643806853 
Productmetal dependent hydrolase 
Protein accessionYP_002801105 
Protein GI226946032 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.776901 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTCGAC GCGGCTTCCT CTCCAGGCTT GCAGCGATTA CCGCGGGTTT CGCCCATGCC 
CGCCCCGCTT TCGCGCAGTC GCTGGCATCC GGCCCGCTCG GGGCCCGCGT CATCCCGTCG
ACCGGCAGGC GCTCGCCGGC GATTGGCACG GGCGGTTCCG GGCGTCTCGA GCGGGAGCGC
GGCCAGGCTT CGCCGCCGAA AGCCATGAGG CGCCTGCTGT CCCTCCTGCT GACCGCCTGC
CTGCCCTGGC AGGCAGAAGC CGCGACCGAG GCCGAGGCGC TGCTCGTACA CGGCGGCTAC
GTGATGACGA TGGACCCGAC ACTGGGCGAC ATCGACGGCG GCGAGGTACT GATCCGCGAC
GGCCGCATCG TCGCCGTCGG CCGCGGCCTG GACGCCGGCG ACGCCCACCG CATCGACGCG
CGCGGGCAGG TCGTGCTGCC GGGCTTCGTC GACACGCATT CGCACCTGTA CGTGACCACG
ATGCGCGGGC AGTTCCGCAA CCGCGACGGG CAGTTCTTCC CGGTGAGTTC GCGCCTGGCC
GCGGCCATGA CGCCGGAGGA CACCCGTACC GCCATGCAGC TCGGCGCCCT GGAACTGCTG
CAGGGCGGCA TCACCACCAC CGCCGACTTC TTCGACAACA TCCTCACCCC GGCGCACGGC
GAGGCCGGAG TGCAGGCGCT GGAGGCGTCC GGCATCCGCG CGGTGATGTA CTACGGCGGT
CCGGACAAGA CCACCCGCCA TCCCATCGAC CTCGCGCAGT TGCGCGCCCT GGCCGAACGC
CGGGGCAAGG ACGCGCGGGT ACGAATCGGC CTGGCCTGGC GCTTGCCGCG CGATCGCGGG
GATGCGGACA ACTGGGCGAT GCGCCAGCGC GAATACGACA CCGCGCGCGG CCTCGGCCTG
CCGATCCAGG TGCACGTCAG CGGCGAGCCC GCCCCGATGT TCGAGGCGCT GATCCAGCGC
GATTACCTGT TTCCCGGCCT GACCGTCGTG CATGCCACCG ATGCCGGCCA GGAGCGGCTG
CAGGCACTCG AACGGGCCGG CGGCGGCCTG GCGCTGACAC CGCCGAGCGA GCAGCGCGTC
GGCTACGGGC TGACCCGGCT GGACCACTTC GCCACGGTGA CCCGGCAGGG CCTGGGCATC
GACGGCAATT CGCTGGCCGG CAGCGCCGAC ATGTTCGCCA CGCTGCGACT GGCGGCGCTG
ACCTGGAGCG GCGGCGCGCG GGACGAGCGG GCGCCCGCTC CGCGCGCGCT GCTGGAACTG
GCCACCCGCC GTGGCGCCGA GGCCGTGGGC CTGGGCGACG AGGTCGGTAC GCTGGCGCCG
GGCAAGCGCG CCGACCTGCA GGTCATCGAT CCGGCTGCGC TGAATCTGGG CGGCTTCGGC
GGCGGCGACC CGGCCGCGCT GCTGGTCTAT TCGGCGCGCC CGGACAACGT CCGCACGGTG
CTGGTCGACG GCCGCCTCGT CAAGCGGGAC GGTCAACCGG TCGGCGTGGA CGCGGCGGAC
CTGCTGGAGC GCGCCCGGCG CTCCGCCCGG GACCTGCTTG ACCGCAGCCG ATCCTCTCCC
TGA
 
Protein sequence
MPRRGFLSRL AAITAGFAHA RPAFAQSLAS GPLGARVIPS TGRRSPAIGT GGSGRLERER 
GQASPPKAMR RLLSLLLTAC LPWQAEAATE AEALLVHGGY VMTMDPTLGD IDGGEVLIRD
GRIVAVGRGL DAGDAHRIDA RGQVVLPGFV DTHSHLYVTT MRGQFRNRDG QFFPVSSRLA
AAMTPEDTRT AMQLGALELL QGGITTTADF FDNILTPAHG EAGVQALEAS GIRAVMYYGG
PDKTTRHPID LAQLRALAER RGKDARVRIG LAWRLPRDRG DADNWAMRQR EYDTARGLGL
PIQVHVSGEP APMFEALIQR DYLFPGLTVV HATDAGQERL QALERAGGGL ALTPPSEQRV
GYGLTRLDHF ATVTRQGLGI DGNSLAGSAD MFATLRLAAL TWSGGARDER APAPRALLEL
ATRRGAEAVG LGDEVGTLAP GKRADLQVID PAALNLGGFG GGDPAALLVY SARPDNVRTV
LVDGRLVKRD GQPVGVDAAD LLERARRSAR DLLDRSRSSP