Gene Avin_15770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_15770 
Symbol 
ID7760512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1546843 
End bp1548180 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content70% 
IMG OID643804477 
ProductN-ethylammeline chlorohydrolase 
Protein accessionYP_002798767 
Protein GI226943694 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGACG CCACCGCTCC GCTCGACCTC CTGCTCCTGC CGACCTGGCT GGTGCCGGTC 
GAGCCCGCCG GTGTCGTGCT GCACGATCAT GCCCTGGGCA TCCGCGACGG CCGCATCGCC
CTGCTCGCCC CCCGCGACGC CGCGCTGCGC CATGCCGCCC GGGAAACCCG CGAACTGCCC
GGCATGCTCC TCGCCCCCAG CCTGGTGAAC GCTCACGGCC ATGCCGCGAT GACCCTGTTC
CGCGGCCTGG CCGATGACCT GCCGCTGATG ACTTGGCTGG AAAAGCACAT CTGGCCGGCC
GAGGCTCGCT GGGTATGCGA GGAATTCGTC CGCGACGGCA CCGAACTGGC TATCGCCGAG
CAGCTCAAGG GCGGCATCGG CTGCTTTTCC GACATGTACT TCCATCCGGA AATCGCCAGC
GACCCCATCC ACCAGAGCGG TATCCGCGCG CAACTGTGCA TCCCGGTGCT GGACTTCCCG
ACTCCTGGCG CCCGCGATGC CGGCGAGGCG CTGCGCAAGG GCGTCGAGCT GCTCGAGGAT
CTCCGCCATC ACCCGCGGAT CCACGTCGCC TTCGGCCCCC ACGCCCCCTA TTCGGTGGGC
GACGAAACCC TGGAGAGGAT ACGGGTGCTC GCCGAGGAGC TGGATGCGCT AATCCAAATG
CACGTGCACG AAACCTCCCA CGAGATCGCC CGCGCCCTGG AACGCGACGG CGTGCGGCCG
CTGGCCCGCC TGGCGCGCTG CGGCCTGCTC GGCCCGCGCT TCCAGGCGGT GCACATGACC
CAGCTCGACG ACCAGGACCT GGCCCTGCTG GTGGAGAGCA ACAGCAGCGT GATCCACTGC
CCCGAGTCCA ATCTCAAGCT GGCCAGCGGC TTCTGCCCGG TGGAGCGTCT CTGGCAGGCC
GGCGTGAACG TGGCCGTCGG CACCGACGGC GCGGCGAGCA ACAACGATCT CGATCTGCTC
GGCGAAACCC GCACCGCGGC TCTGCTGGCC AAGGCGGTGG CCGGCTCGGC CACCGCCTTG
GACGCCCACC GCGCGCTGCG CATGGCCACC CTGAACGGTG CCCGCGCCCT GGGGCTGGAG
ACCGAGACCG GTTCCCTGGA GCCCGGCAAG GCTGCCGACA TGGTCGCTTT CGACCTCTCC
GGACTGGCCC AGCAGCCGGT CTACGACCCG GTCTCGCAAC TGATCTACGC CAGCGGCCGG
GACTGCGTGC GGCACCTCTG GGTCGGTGGC CGGCAATTGC TGGACAACGG CCAGTTGACC
CGCCTGGACG AGGAGCGCCT GAAGGACAAG GCTCGCGAAT GGAGCCGGCG CATCGGAGCC
TCGGACGGCG CCCGCTGA
 
Protein sequence
MSDATAPLDL LLLPTWLVPV EPAGVVLHDH ALGIRDGRIA LLAPRDAALR HAARETRELP 
GMLLAPSLVN AHGHAAMTLF RGLADDLPLM TWLEKHIWPA EARWVCEEFV RDGTELAIAE
QLKGGIGCFS DMYFHPEIAS DPIHQSGIRA QLCIPVLDFP TPGARDAGEA LRKGVELLED
LRHHPRIHVA FGPHAPYSVG DETLERIRVL AEELDALIQM HVHETSHEIA RALERDGVRP
LARLARCGLL GPRFQAVHMT QLDDQDLALL VESNSSVIHC PESNLKLASG FCPVERLWQA
GVNVAVGTDG AASNNDLDLL GETRTAALLA KAVAGSATAL DAHRALRMAT LNGARALGLE
TETGSLEPGK AADMVAFDLS GLAQQPVYDP VSQLIYASGR DCVRHLWVGG RQLLDNGQLT
RLDEERLKDK AREWSRRIGA SDGAR