Gene Avin_30170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_30170 
SymbolcodA 
ID7761917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3125551 
End bp3126789 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content68% 
IMG OID643805889 
ProductCytosine deaminase protein 
Protein accessionYP_002800157 
Protein GI226945084 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.831905 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATCG TCAACGCCCG CCTGCGCGGC CGCGAAGGCC TCCAGCGCAT CGAACTGGAC 
GGCGCGCGCA TCGCCGCCAT AGCCGCGCAA CCGGCGCCCG CCGAGGCCGG CGGCGACGAG
TTGGACGCCG CCGGCAACCT GGTGGTACCG CCCTTCGTCG AACCGCACAT CCACCTGGAC
GCCACCCTCA CCGCCGGCGA GCCGGCCTGG AACATGAGCG GCACCCTGTT CGAAGGCATC
GAGCGCTGGG CCGAGCGCAA GGCGCTGGTC ACCCACGAGG ACATCAAGAC CCGGGCGAAG
AAGGCCATCG ACATGCTGGT CGAGCACGGC ATCCAGCACG TGCGCACCCA CGTCGACGTC
ACCGACCCGA CGCTGGCCGC GCTCAAGGCG ATGCTCGAGG TGCGCGAGGA AACCCGTCAC
CTGATCGACC TGCAGATCGT CGCCTTTCCC CAGGAAGGCA TCGAGTCCTA CCAGGGCGGC
CGCGAGCTGA TGACCGAGGC CATCGCCCTG GGCGCCGACG TGGTCGGCGG CATCCCGCAT
TTCGAGAACA CCCGCGAGCA GGGCGTCGGC TCGATCAAAT TCCTCATGGA CCTGGCCGAG
CGCACCGGCT GCCTGGTCGA CGTGCACTGC GACGAAACCG ACGATCCACA GTCGCGATTT
CTCGAGGTGC TCGCCGAGGA GGCACGGGTG CGCGACATGG GCGAGCGGGT CACCGCCAGT
CACACCACGG CCATGGGCTC ATGGGACAAC GCCTACTGCT CCAAGCTGTT CCGCCTGCTG
AAGCTGTCGC GGATCAACTT CGTCTCCTGT CCCACCGAGA GCATCCACCT GCAGGGGCGC
TTCGACACCT TTCCCAAGCG CCGCGGCCTG ACCCGGGTCG CCGAACTGGA CCGCGCCGGG
CTGAACGTCT GCTTCGGCCA GGACTCCATC GTCGATCCCT GGTATCCACT GGGCAACGGC
AACATCCTGC GCATCCTCGA AGCCGGCCTG CACATCTGCC ACATGCTCGG CTACGCCGAC
CTGCAACGCG CCCTCGATCT GATCACCGAG CACAGCGCCA AGGCCCTGCA CCTGGGCGAG
CGCTACGGCC TGGAAGTCGG GCGGCCGGCC AACCTGCTGA TCCTCTCGGC GGCCAACGAC
TACGAGATGT TGCGCAGCCA GGGCCACGCG CTGGTATCGA TCCGCCACGG GGAGATCCTG
ATGCGCCGCA CGCCGGCGCG GATCGAACGC CATCGCTGA
 
Protein sequence
MKIVNARLRG REGLQRIELD GARIAAIAAQ PAPAEAGGDE LDAAGNLVVP PFVEPHIHLD 
ATLTAGEPAW NMSGTLFEGI ERWAERKALV THEDIKTRAK KAIDMLVEHG IQHVRTHVDV
TDPTLAALKA MLEVREETRH LIDLQIVAFP QEGIESYQGG RELMTEAIAL GADVVGGIPH
FENTREQGVG SIKFLMDLAE RTGCLVDVHC DETDDPQSRF LEVLAEEARV RDMGERVTAS
HTTAMGSWDN AYCSKLFRLL KLSRINFVSC PTESIHLQGR FDTFPKRRGL TRVAELDRAG
LNVCFGQDSI VDPWYPLGNG NILRILEAGL HICHMLGYAD LQRALDLITE HSAKALHLGE
RYGLEVGRPA NLLILSAAND YEMLRSQGHA LVSIRHGEIL MRRTPARIER HR