Gene Avin_22390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_22390 
Symbol 
ID7761157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2238862 
End bp2239818 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content72% 
IMG OID643805124 
ProductTaurine dioxygenase protein 
Protein accessionYP_002799405 
Protein GI226944332 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2175] Probable taurine catabolism dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGTCTT CCGCCCGCCT GCATCAGGCC GCGCTGCCCG CCGCCGAGGC GATCGTCGTC 
ACCCCGCTGT CCCTGTACAT CGGCGCCCAG GTCGACGGGG TCGACCTGTC GCGCCCGTTG
CCCTCCGCCC AACGCGAAGC CATCCGCGCC GCCCTGCTGC GCTGGAAGGT GCTGTTCTTC
CACGACCAGC ACCTCGACCA CGCCCAGCAG GTGGCGTTCG GCCGGCAATT CGGCGAACCG
ACCGTCGGCC ACCCGGTGTT CGGCCATGTC GAGGGTCACC CGGAGATCTA TTCGGTCGGC
CGCGACCGCT TCAAGGCGCG CTTCACCGAC GAGCGCCTGG TGCGCCCCTG GAGCGGCTGG
CACACCGACG TGAGCGCGGC GCTCAATCCG CCGGCGGCGG CGATCCTGCG CGGCGTGGAC
ATCCCGCCCT ATGGCGGCGA CACCCAGTGG ACCGACCTGG TGGCCGCCTA CAATGGCCTG
TCGCCGACCC TGCGGGCCTT CGTCGACGGC CTGCGCGGCG AGCACCGCTT CACCCCGCCG
GAAGGCGCCG AGGCACGGCC GGGCTTCAGC GAGCCGCTGG CGGTCAGGCC GCTGGTCAGC
GAGCATCCGC TGGTGCGCGT GCACCCGGAA ACCGGCGAGA AGGCGCTGTT CGTCAGCCCG
ACCTTCCTCA AGCGCATCGT CGGCCTCAGC CCGCGCGAGA GCGAACAGTT GCTGGAACTG
CTCTTCGAAC ACGCGATCCG CCCCGAATAC ACGGTGCGCT TCAAGTGGCG ACCCGGCTCG
CTGGCCTTCT GGGACAACCG GGTCACCGCC CACCAGCCGC CATCCGACAT CCACGCCACC
GACCTGCCCC GCCAGCTCTA CCGCATCACC CTGGTCGGCG ACATTCCCGT CGGCCCGGAC
GGCCGTCCCT CCCGCACCAT CGCCGGCGAG CCGGTGCTGG CTCATCCGGC CGCCTGA
 
Protein sequence
MPSSARLHQA ALPAAEAIVV TPLSLYIGAQ VDGVDLSRPL PSAQREAIRA ALLRWKVLFF 
HDQHLDHAQQ VAFGRQFGEP TVGHPVFGHV EGHPEIYSVG RDRFKARFTD ERLVRPWSGW
HTDVSAALNP PAAAILRGVD IPPYGGDTQW TDLVAAYNGL SPTLRAFVDG LRGEHRFTPP
EGAEARPGFS EPLAVRPLVS EHPLVRVHPE TGEKALFVSP TFLKRIVGLS PRESEQLLEL
LFEHAIRPEY TVRFKWRPGS LAFWDNRVTA HQPPSDIHAT DLPRQLYRIT LVGDIPVGPD
GRPSRTIAGE PVLAHPAA