Gene Avin_21940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_21940 
SymboltorG 
ID7761112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2191356 
End bp2193626 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content71% 
IMG OID643805079 
Producttrimethylamine-N-oxide reductase (cytochrome c) 
Protein accessionYP_002799360 
Protein GI226944287 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR00509] molybdopterin guanine dinucleotide-containing S/N-oxide reductases 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTATA CCTCGCTGCA TTGGGGCGCC TACCGGCCCC TGGTCGAGAA CGGCCGCCTG 
GTGGAGATGC GTCCGGTGCC CTGGGACCGC GATCCGTCGC CCATCGGCGC CTCCCTGCCG
GGCGCCATCG ACTCGCCCAG CCGGATTCGC CGCCCGGCCG TGCGCCGGGG ATTCCTGGAG
CGGCCGGGCG GCAGCGGCGA AGGACGCGGG CGGGAGCCCT TCGTCGAGGT CGACTGGGAT
ACCGCGCTGG ATCTGGTCGC CGCCGAGCTG CGCCGGGTGC GCGGCGAATA CGGCAACCGG
GCGATCTTCG GCGGCAGCTA CGGCTGGAGC AGCGCCGGCC GCTTCCACCA TGCGCAGAGC
CAGTTGCATC GCTTCCTCAA CGGCTTCGGC GGCTACGTCT CCAGCCGCGA CAGCTACAGC
CTGGGCGCCG GGCGCGTGCT GCTGCCGCAC ATCGCCGGCG ACATGGACTG GCTGCTGGCG
CGGCACACCT CCTGGGACAA CCTGGCCGAG CATTGCCAAC TGTTCGTCGC CTTCGGCGGC
CTGCCGCTGA AGAACGCCCA GGTCAGCCCC GGCGGCGCCA GCGATCACCT GCTGCGCGAG
GCCATCGACA GGCTGTCGCG GGCCGGCGTG CGCTTCGTCA ACCTCAGCCC GCTGCGCAGC
GACCTGCAGG GCGCGGCCGA CTGCGAGTGG TGGCCCCTGC GGCCGGGCAG CGACACCGCG
CTGATGCTGG CGCTGGCCTA TGTGCTGGTC GACGAGGGTC TGCACGACCA GGCTTTCCTG
GCGCGCTACA CGGTGGGTTT CGAACCTTTT CGCGATTACC TGCTAGGGGT CGTCGACGGC
CAGCCGAAGG ACCCGGAGTG GGCCGCGGCG CGCACCGACA TTCCCGCGCC GCGCATCGTC
GAACTGGCCC GGCGCATGGC CGCCGGCCGC ACCATGATCA ACGTCGCCTA CGCCTTGCAG
CGCGCCGTGC ACGGCGAACA GCCGTTCTGG ATGACCCTGG TGCTGGCCTG CCTGCTGGGC
CAGATCGGCC TGCCCGGCGG CGGCTTCGGC CTGGGCTACG GGGCGATGAA CAACACCGGC
AGCGGGCGCA AGCCGTTTTC CGGCCCGCGC CTGGAGCAGG GCACGAACCC GGTGCGCGAT
TTCATTCCCG TCGCGCGCAT CGCCGACATG CTGCTGCGGC CGGGCGCGCC GTTCGACTAC
GACGGCCGGC GCCACCGTTA CCCGGACATT CGTCTGGTGT ACTGGGCGGG CGGCAACGCC
TTCCATCACC ACCAGGACCT CAACCGCTTC CTCCACGCCT GGCGGCGCCC GCAGACCATC
GTGGTCCACG AGCAGTACTG GACGGCCCAG GCCAAGCATG CCGACATCGT CCTGCCGGCG
ACCACCGCAC TGGAGCGCGA CGACATCGGC AGTGCCAGTT CGGACCGTTT CATGCTGGCC
ATGAAGCGCG CCATCGAGCC GGTCGGCGAG GCGCGCGACG ACTACCGCAT CTTCCTCGAC
CTGGCCGGGC GTCTGGGCTT CGCCGACCGC TTCGGCGAGG GCCGCGACAC CTTGGACTGG
CTGCGCCACA GCTACGAAGG CTCGCGGCAG CGCGCCAGGG AGCAGGGCAT CGAACTGCCG
GACTTCGAGG TCTTCTGGGC CGAGGGCTGC TTCGAGGTGC CCCGCCCGGC GCAGCAGACC
ATCCTGCTCG AGGAGTTCCG CGACGACCCC GAGGCCCACC CGCTGGGCAC GCCCTCCGGT
CGGCTGGAAA TCCACTCCGA GCGCATCGCC GGCTTCGGCT ACGCCGATTG CCCCGGCCAT
CCGGTCTGGT TCGAGCCGCC GCCGCCCAGC CATCCGCTGC ACCTGATCTC CAACCAGCCG
AAGACCCGCC TGCACAGCCA GTATGACCAC GGCGCCTACA GCCGGGCCTC GAAGATCCAC
GGCCGCGAGC CGCTCACCCT GCATCCGCGG GACGCCGCCG CCCGCGGCAT CGCCGACGGC
GACATAGTGC GGGTGTTCAA CGAACGCGGC GCGCTGCTGG CCGGCGCGAT CCTCAGCGAA
GACATACGTC CCGGCGTAGT GCAACTGGCC ACCGGCGCCT GGTACGACCC CGTCGATCCC
GCCGAGCACA ATAGTCTGGA GAAGCACGGC AACCCCAACG TGCTGACCCA CGACGTCGGC
GCCTCCAGCC TGTCGCAGGG CTGCACCGCC CATAGCGCGC AGGTGGAGGT GGAGCGCTGG
AGCGGCGAAG CGCCGCCGGT GACGGCGTTC GAGCCGCCGC GCTTTCGCTA G
 
Protein sequence
MTYTSLHWGA YRPLVENGRL VEMRPVPWDR DPSPIGASLP GAIDSPSRIR RPAVRRGFLE 
RPGGSGEGRG REPFVEVDWD TALDLVAAEL RRVRGEYGNR AIFGGSYGWS SAGRFHHAQS
QLHRFLNGFG GYVSSRDSYS LGAGRVLLPH IAGDMDWLLA RHTSWDNLAE HCQLFVAFGG
LPLKNAQVSP GGASDHLLRE AIDRLSRAGV RFVNLSPLRS DLQGAADCEW WPLRPGSDTA
LMLALAYVLV DEGLHDQAFL ARYTVGFEPF RDYLLGVVDG QPKDPEWAAA RTDIPAPRIV
ELARRMAAGR TMINVAYALQ RAVHGEQPFW MTLVLACLLG QIGLPGGGFG LGYGAMNNTG
SGRKPFSGPR LEQGTNPVRD FIPVARIADM LLRPGAPFDY DGRRHRYPDI RLVYWAGGNA
FHHHQDLNRF LHAWRRPQTI VVHEQYWTAQ AKHADIVLPA TTALERDDIG SASSDRFMLA
MKRAIEPVGE ARDDYRIFLD LAGRLGFADR FGEGRDTLDW LRHSYEGSRQ RAREQGIELP
DFEVFWAEGC FEVPRPAQQT ILLEEFRDDP EAHPLGTPSG RLEIHSERIA GFGYADCPGH
PVWFEPPPPS HPLHLISNQP KTRLHSQYDH GAYSRASKIH GREPLTLHPR DAAARGIADG
DIVRVFNERG ALLAGAILSE DIRPGVVQLA TGAWYDPVDP AEHNSLEKHG NPNVLTHDVG
ASSLSQGCTA HSAQVEVERW SGEAPPVTAF EPPRFR