Gene Avin_21890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_21890 
Symbol 
ID7761107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2185363 
End bp2186694 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content68% 
IMG OID643805074 
Productmonooxygenase, NtaA/SnaA/SoxA family 
Protein accessionYP_002799355 
Protein GI226944282 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACGG GACAACTCAA GCTGGGCACC ATGATCCATG GGGTCGGCCA TGGCTGGGGA 
GAATGGCGGC ACCCCGAGGC CCTGGCCGAC GCCAGCGTCA ATTTCGAGTT CTACCGGCAG
CAGGCCCAGG TCGCCGAGGC GGGCAAGTTC GACTTCGTGT TCATCGCCGA CAGCCTGCAC
ATCCACGAGA AATCCAGTCC GCACTACCTC AACCGCTTCG AGCCCCTGAC CATCCTCTCG
GCGCTGGCGG CGGTGACCCG GCACGTCGGC CTGGTGGGCA CCGTCACGGT CAGCTACAGC
GAGCCCTTCA ACGTCGCCCG CCAGTTCGCC TCGCTCGACC ACATCAGCGG CGGGCGGGCC
GGCTGGAACG TGGTGACCTC CTGGCTGTCC GGCACGGCGG ACAATTTCGG CAGGGCCGAG
CATCCGGCGC ATGCCGTGCG CTACCGGATC GCCAGGGAAC ATGTCGGGGT CGTGAAAGGA
CTGTGGGACT CCTGGGAGGA CGATGCCTTC GTCCGCGACA AGGCGAGCGG CGAATTCTTC
GCTCCCGGCA AGCTGCACGC GTTGAACCAC CAGGGCGAGT TCTTTGCCGT CAAGGGTCCC
TTGAACATCG CCCGTTCGCG CCAGGGGCAG CCGGTCATCT TCCAGGCCGG CATCTCCGAG
GCCGGCCGCG ATTTCGCGGC GCAGAACGCC GACGCGGTCT TCACCAACCC GGGCTCGTTC
GACGAAGCCC AGGCCTTCTA CCGCGACCTC AAGGCGCGTG CCGCCGCTCA CGGCCGCGAC
CCGCGGGAAC TCTCGATCCT GCCGGGCATC AGCCCGATCG TCGGACGCGA TCCCGTCGAG
GTCGAGCGGC GTTACCGGCA GGCCGTCGAC CTGGTGTCCA TCGAGGACGC CCTCGTCGCC
CTCGGCCGCC CGTTCGACGA CCACGATTTC TCGCGCTACC CGCTCGACGA GCCCTTCCCC
GATATCGACG ACGGCGACGA CAGCCATAAA GGCAGCGCCG ACCGCATCAG GCGAGTCGCC
CGCGAAGAAG GACTGAGCCT GCGCGAGGTG GCGCTGCGCT TCGCCCTGCC CGACCGGACC
TTCGCCGGCA CTCCCGAGCA GGTCGCCGAC ACCCTGCAGC ACTGGTTCGA GAAGGACGCG
GCGGATGGTT TCATCGTCAG GTCGCTGCTG CCGGACGGCC TGGAGCATTT CGTCGAGCTG
GTCGTGCCGG TCCTGCAGGC GCGCGGCCTG TTCCGCCGGG AATACAGCGG CCGGACCCTG
CGCGACAACC TGCGTCTGCC GGTGCCGGCG AACCGCTACA GCGTGCGCGA CGAAGCGGGA
GCGGCGAGAT GA
 
Protein sequence
MSTGQLKLGT MIHGVGHGWG EWRHPEALAD ASVNFEFYRQ QAQVAEAGKF DFVFIADSLH 
IHEKSSPHYL NRFEPLTILS ALAAVTRHVG LVGTVTVSYS EPFNVARQFA SLDHISGGRA
GWNVVTSWLS GTADNFGRAE HPAHAVRYRI AREHVGVVKG LWDSWEDDAF VRDKASGEFF
APGKLHALNH QGEFFAVKGP LNIARSRQGQ PVIFQAGISE AGRDFAAQNA DAVFTNPGSF
DEAQAFYRDL KARAAAHGRD PRELSILPGI SPIVGRDPVE VERRYRQAVD LVSIEDALVA
LGRPFDDHDF SRYPLDEPFP DIDDGDDSHK GSADRIRRVA REEGLSLREV ALRFALPDRT
FAGTPEQVAD TLQHWFEKDA ADGFIVRSLL PDGLEHFVEL VVPVLQARGL FRREYSGRTL
RDNLRLPVPA NRYSVRDEAG AAR