Gene Avin_43740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_43740 
Symbol 
ID7763247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4420357 
End bp4421715 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content69% 
IMG OID643807229 
Productmonooxygenase, NtaA/SnaA/SoxA/DszA family 
Protein accessionYP_002801470 
Protein GI226946397 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.514983 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGGA AACGCCTGAT TCTCAACGGC TTCGCGATGA ACACCGTGTC GCACGTCTAC 
CACGGACTGT GGCGCCACCC GGACAGCCAG CAGATTCATT TCAACGACCT GGAAACCTGG
GTCGAGCTGG CGCAGTTGCT GGAGCGCAGC CATTTCGACG CGCTGTTTCT GGCCGACGTG
ATCGGCATCG ACCCGTCCTA CCAGGGCAAC TGGGACACCT ACCTGCGCGG TGCGGTGCAG
GTGCCGATCA ACGATTCCTC GACGCTGATC GCCGCGCTGA TCGGCGCCAC CCGCGACCTC
GGTCTGGTCT TCACCAGTTC GATCCTCCAG GACCATCCGT TCAACTTCGC CCACCGCGCC
TCGACCCTGG ATCACCTGAG CAAGGGGCGC GTCGGCTGGA ACATCGTCAC CAGCGTCAGC
CACAACGCCG CGCAGAACTT CGGCTTCGAG CGCATCGTCG CCCACGACCG GCGCTACGCC
TGGGCCGAGG AATACATGGA GGTGGTCTAC AAGCTGTGGG AAGGCTCCTG GGAGGAGGAT
GCGGTGCGCG CCGACCGGCG CGCGGGGATC TACGCCGATC CGCTCAAGGT GCACCGCATC
CACCACCAGG GCGAACGCTA CAAGGTCGCC GGACCGCATC TCAGCCAGCC TTCGCCGCAA
CGCACGCCGG CGCTGTTCCA GGCCGGCGCC TCGTACGCCG GACGGGCCTT CGCCGCGCGC
AACGCCGAAG CGACCTTCAT CGCCAGCCGC CACCCGGAAG GCGCGCGGCG GCTGATCGAG
GACGTGCGCG GGCAGGTGCG GCGCGCCGGC CGGCGCGCCG ACGACCTGCT GTTCATCCAG
GGGCTGTCGT TCGTCGTCGG CAGCAGCGAG GAAGAAGCGC AATGCAAGGC GCGGGAGCTG
GACGAACTGC TCTGCGTCGA CGGGCTGGCC GCGCACATCA GCCGCGACCT CGGCATCGAC
CTCGGCCTGC TGGAGCCCGA GCAGCCCATC GACGAGCTGG AGGTCGAGGG CGTGCAGGGC
ATCCTGCGCT TCTTCGAGGA GGCCAATCCC GGCCAGCGCG CCACGGTGGC GGATCTGGCG
CGCGCCTACG CCGGCACGCG CCTGGTCGGC TCGCCGGAGT CCATCGCCGA CGAGCTGGAG
CGCTGGCAGG ACGCGGGGAT CGACGGCGTC AACGTCATCT ACCAGACCCT GCCCGGCACC
TTCCGCGAGG TGGCCGAGCA ACTCATCCCC GAACTGCAGA AACGCGGCCT GGCGCAGCGC
GAATACGCGC CGGGCACCCT GCGCGAGCGG CTGTTCCCCG GCCGTCCCGC GCATCTCAAC
GAACGCCACC CGGCCGCCGC CCAGCGGCGC CGGGGGTGA
 
Protein sequence
MSRKRLILNG FAMNTVSHVY HGLWRHPDSQ QIHFNDLETW VELAQLLERS HFDALFLADV 
IGIDPSYQGN WDTYLRGAVQ VPINDSSTLI AALIGATRDL GLVFTSSILQ DHPFNFAHRA
STLDHLSKGR VGWNIVTSVS HNAAQNFGFE RIVAHDRRYA WAEEYMEVVY KLWEGSWEED
AVRADRRAGI YADPLKVHRI HHQGERYKVA GPHLSQPSPQ RTPALFQAGA SYAGRAFAAR
NAEATFIASR HPEGARRLIE DVRGQVRRAG RRADDLLFIQ GLSFVVGSSE EEAQCKAREL
DELLCVDGLA AHISRDLGID LGLLEPEQPI DELEVEGVQG ILRFFEEANP GQRATVADLA
RAYAGTRLVG SPESIADELE RWQDAGIDGV NVIYQTLPGT FREVAEQLIP ELQKRGLAQR
EYAPGTLRER LFPGRPAHLN ERHPAAAQRR RG