Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_15770 |
Symbol | |
ID | 7760512 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 1546843 |
End bp | 1548180 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643804477 |
Product | N-ethylammeline chlorohydrolase |
Protein accession | YP_002798767 |
Protein GI | 226943694 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGACG CCACCGCTCC GCTCGACCTC CTGCTCCTGC CGACCTGGCT GGTGCCGGTC GAGCCCGCCG GTGTCGTGCT GCACGATCAT GCCCTGGGCA TCCGCGACGG CCGCATCGCC CTGCTCGCCC CCCGCGACGC CGCGCTGCGC CATGCCGCCC GGGAAACCCG CGAACTGCCC GGCATGCTCC TCGCCCCCAG CCTGGTGAAC GCTCACGGCC ATGCCGCGAT GACCCTGTTC CGCGGCCTGG CCGATGACCT GCCGCTGATG ACTTGGCTGG AAAAGCACAT CTGGCCGGCC GAGGCTCGCT GGGTATGCGA GGAATTCGTC CGCGACGGCA CCGAACTGGC TATCGCCGAG CAGCTCAAGG GCGGCATCGG CTGCTTTTCC GACATGTACT TCCATCCGGA AATCGCCAGC GACCCCATCC ACCAGAGCGG TATCCGCGCG CAACTGTGCA TCCCGGTGCT GGACTTCCCG ACTCCTGGCG CCCGCGATGC CGGCGAGGCG CTGCGCAAGG GCGTCGAGCT GCTCGAGGAT CTCCGCCATC ACCCGCGGAT CCACGTCGCC TTCGGCCCCC ACGCCCCCTA TTCGGTGGGC GACGAAACCC TGGAGAGGAT ACGGGTGCTC GCCGAGGAGC TGGATGCGCT AATCCAAATG CACGTGCACG AAACCTCCCA CGAGATCGCC CGCGCCCTGG AACGCGACGG CGTGCGGCCG CTGGCCCGCC TGGCGCGCTG CGGCCTGCTC GGCCCGCGCT TCCAGGCGGT GCACATGACC CAGCTCGACG ACCAGGACCT GGCCCTGCTG GTGGAGAGCA ACAGCAGCGT GATCCACTGC CCCGAGTCCA ATCTCAAGCT GGCCAGCGGC TTCTGCCCGG TGGAGCGTCT CTGGCAGGCC GGCGTGAACG TGGCCGTCGG CACCGACGGC GCGGCGAGCA ACAACGATCT CGATCTGCTC GGCGAAACCC GCACCGCGGC TCTGCTGGCC AAGGCGGTGG CCGGCTCGGC CACCGCCTTG GACGCCCACC GCGCGCTGCG CATGGCCACC CTGAACGGTG CCCGCGCCCT GGGGCTGGAG ACCGAGACCG GTTCCCTGGA GCCCGGCAAG GCTGCCGACA TGGTCGCTTT CGACCTCTCC GGACTGGCCC AGCAGCCGGT CTACGACCCG GTCTCGCAAC TGATCTACGC CAGCGGCCGG GACTGCGTGC GGCACCTCTG GGTCGGTGGC CGGCAATTGC TGGACAACGG CCAGTTGACC CGCCTGGACG AGGAGCGCCT GAAGGACAAG GCTCGCGAAT GGAGCCGGCG CATCGGAGCC TCGGACGGCG CCCGCTGA
|
Protein sequence | MSDATAPLDL LLLPTWLVPV EPAGVVLHDH ALGIRDGRIA LLAPRDAALR HAARETRELP GMLLAPSLVN AHGHAAMTLF RGLADDLPLM TWLEKHIWPA EARWVCEEFV RDGTELAIAE QLKGGIGCFS DMYFHPEIAS DPIHQSGIRA QLCIPVLDFP TPGARDAGEA LRKGVELLED LRHHPRIHVA FGPHAPYSVG DETLERIRVL AEELDALIQM HVHETSHEIA RALERDGVRP LARLARCGLL GPRFQAVHMT QLDDQDLALL VESNSSVIHC PESNLKLASG FCPVERLWQA GVNVAVGTDG AASNNDLDLL GETRTAALLA KAVAGSATAL DAHRALRMAT LNGARALGLE TETGSLEPGK AADMVAFDLS GLAQQPVYDP VSQLIYASGR DCVRHLWVGG RQLLDNGQLT RLDEERLKDK AREWSRRIGA SDGAR
|
| |