Gene Avin_21660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_21660 
Symbol 
ID7761086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2166264 
End bp2167589 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content66% 
IMG OID643805054 
Productmonooxygenase, NtaA/SnaA/SoxA family 
Protein accessionYP_002799335 
Protein GI226944262 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.714961 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGGCC CGAGGCAACT CAAACTGGGT GCGATCATTC ACGGTGTCGG CCACGGCTGG 
GGCGATTGGC GCCATCCGGA TGCCGTGGCC GATGCCAGTG TCAATTTCCG TTTCTACCGG
CAGCAGGCGC AACTGGCCGA GGCGGGCAGG TTCGATTTCC TGTTCATCGC CGACAGCCTG
CACATCCACG AGAAATCCAG CCCGCACTAC CTCAACCGCT TCGAACCCCT GACCATCCTT
TCCGCGCTGG CCACCGTGAC CGAGCACATC GGCCTGGTCG GCACCGCCAC GGTCAGCTAC
ACGGAGCCCT TCAACCTGGC CCGCCAGTTC GCCTCGCTCG ACCATATCAG CGGCGGGCGG
GCCGGCTGGA ACGTGGTGAC CTCCTGGCTG TCCGGCACGG CCGACAACTT CGGCCGGCCC
GAACACGCGC CGCACGACCT GCGCTACCGG ATCGCCAGGG AGCACCTCAA CGTGGTCAAG
GGCCTGTGGG ACTCCTGGGA GGACGATGCC TTCGTCCGCG ACAAGGCGAG CGGCAAATTC
TTCGATCCCG ACAGGCTGCA TGCGCTCAAC CACCAGGGCG AGTTCTTTTC CGTCAAGGGG
CCCTTGAACA TCGCCCGTTC GCCCCAGGGA CAACCGGTCA TCTTCCAGGC CGGCAGCTCG
GAGGAGGGGC GCAACTTCGC GGCGCAGAAC GCCGATGCGA TCTTCGTCAA TCCGGAGTCT
TTCGACGAAG CGCTCGCCTA TTATCGGGAC ATCAAGACGC GCACGGCCCA ATACGGCCGG
GACCCGCAGA AGCTGTCGAT CCTGCCGGGC ATCCGGCCGA TCGTCGGACG CGACCCGGCC
GAGGTCGAGC GGCGTTACCG GCAGGCCGTC GACCTGGTGT CCATCGAGGA TGCCCTCGTC
GCGCTGGGGC GTCCCTTCAA CGATCACGAT TTCTCGCGAT ATCCCCTCGA CGAGCCCTTC
CCCGAACTGG GCGATATCGG CAGGGACAGC CAGCAGGGCG AGTCCAACCA CATCAAGCGG
GTGGCCAGGG AGGAGGGACT CAGCCTGCGC GAGGCCGCCC TGCGCTTTTC CCGGCCGAAC
CGGGCGTTCG TCGGTACGCC GGAGCAGATC GCCGACACTT TGCAGCACTG GTTCGAGAAG
GGCGCGGCGG ACGGTTTCAC CATCGGTTCG CTGCTGCCCG ACAGCCTGCA GTCCTTCACC
GAGCTGGTGG TGCCGGTTCT GCAGGCGCGC GGCCTGTTCC GCCGGGAATA CGCCGGCCAT
ACCCTGCGCG ACAACCTGGG CCTGGACGTG CCCGTCAACC GCTATAGTGC GAGACGCCTG
GCGTGA
 
Protein sequence
MGGPRQLKLG AIIHGVGHGW GDWRHPDAVA DASVNFRFYR QQAQLAEAGR FDFLFIADSL 
HIHEKSSPHY LNRFEPLTIL SALATVTEHI GLVGTATVSY TEPFNLARQF ASLDHISGGR
AGWNVVTSWL SGTADNFGRP EHAPHDLRYR IAREHLNVVK GLWDSWEDDA FVRDKASGKF
FDPDRLHALN HQGEFFSVKG PLNIARSPQG QPVIFQAGSS EEGRNFAAQN ADAIFVNPES
FDEALAYYRD IKTRTAQYGR DPQKLSILPG IRPIVGRDPA EVERRYRQAV DLVSIEDALV
ALGRPFNDHD FSRYPLDEPF PELGDIGRDS QQGESNHIKR VAREEGLSLR EAALRFSRPN
RAFVGTPEQI ADTLQHWFEK GAADGFTIGS LLPDSLQSFT ELVVPVLQAR GLFRREYAGH
TLRDNLGLDV PVNRYSARRL A