Gene Avin_20660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_20660 
Symbol 
ID7760992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2056464 
End bp2057876 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content59% 
IMG OID643804963 
ProductMoaA, NifB, PqqE, radical SAM superfamily protein 
Protein accessionYP_002799244 
Protein GI226944171 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID[TIGR03471] hopanoid biosynthesis associated radical SAM protein HpnJ 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTGCAGGCTC CATCCTTTGA TGGTTTCGAT GGCGGTGCCG GTTCGCGTTA TCAGGCTAAG 
CGCGAAATCA AGAGTTTCTG GTATCCGACC TGGTTGGCTC AGCCGGCCGC CCTGGTGCCC
GGTTCACGCC TGCTGGACGC TCCGGTGGAG GAGCTTTCGG TGGAGGAGTC GCTGAGGATT
GCCGCCGAAT ACGAGTTGGT GATCATCCAT ACCAGCACGC CTTCTTTCCC AACCGATGCC
AAGTTCGCCG AATTGCTGAA GGCGCGCCGC CCCGAGGTTC TGATCGGTAT GGTCGGCGCC
AAGGTCGCGG TCGATCCGAC CGACTCGCTG AACGCATCGA CGGCGATCGA CTTCGTGGCG
CGCGAGGAAT TCGATTACAC CTGCAAGGAA ATCGCCGAGG GCCATCCGCT CGAGGAGGTC
GCCGGCGTTA GCTATAAACT CGCCGATGGC GACGTGCGGC ACAACCCGCC GCGGGCGCCG
ATCGAGAATA TGGACGACCT GCCCTTCGTG GCGCCCGTCT ACAAGCGCGA CCTGAAGATC
GACCGCTATT TTATCGGCTA TCTCAAGCAT CCTTACGTAT CCATTTATAC CGGTCGTGGC
TGCCGCTCCA AGTGCACCTT CTGCCTATGG CCGCAGACCG TCGGCGGTCA CCGTTACCGT
GTCCGCTCGG CCGCCAGCGT GATCGCCGAG GCGAAGTGGA TCAAGGAGAA CATGCCCGAA
GTGAAGGAGT TGATGTTCGA CGACGATACC TTCACCGATA CTTCCAACTT GGAGCGTGTA
CACGAGATCG CCCGCGGCCT GCATGCTCTG GGCTGGACCT GGAGTTGCAA CGCCAAGGCC
AACGTACCCT ACGAATCGCT GAAGATCATG AAGGAAAATG GCCTGCGCCT GCTGCTGGTG
GGCTACGAGT CCGGCGACGA CCAGATCCTG CACAACATCA AGAAGGGCCT GCGTACCGAT
ATCGCCCGCA AGTTCACTGA AAATTGCCAC AAGCTGGGCA TTCAGGTGCA CGGTACCTTC
ATCCTCGGCC TGCCGGGGGA GACCAAGGAA ACCATCGCCA AGACCATCGA GTTTGCCAAG
GAGATAAATC CGCACACGGT GCAGGTATCG CTGGCGGCTC CCTATCCCGG TACCACGTTG
TACAGGCAGG CTGTGGAAAG TGGCTGGCTA GAGCCCAATC AGGACGCCAA CTTGGTCAAT
GACAAGGGAG TGCAACTGGC ACTGATCAGC TATCCACACC TGTCCAAGGA AGAGATCTAT
CACGGTGTGG AGACTTTCTA CCGGAACTTC TATTTTCGTC CCTCGAAAAT CTGGGAGATC
GTCAAGGAAA TGCTCGGCAG TTGGGAAATG ACGAAGCGCC GCCTACGCGA GGGCGTGGAG
TTTTTCCGCT TCCTGCATGC TCATGAGGCC TGA
 
Protein sequence
MQAPSFDGFD GGAGSRYQAK REIKSFWYPT WLAQPAALVP GSRLLDAPVE ELSVEESLRI 
AAEYELVIIH TSTPSFPTDA KFAELLKARR PEVLIGMVGA KVAVDPTDSL NASTAIDFVA
REEFDYTCKE IAEGHPLEEV AGVSYKLADG DVRHNPPRAP IENMDDLPFV APVYKRDLKI
DRYFIGYLKH PYVSIYTGRG CRSKCTFCLW PQTVGGHRYR VRSAASVIAE AKWIKENMPE
VKELMFDDDT FTDTSNLERV HEIARGLHAL GWTWSCNAKA NVPYESLKIM KENGLRLLLV
GYESGDDQIL HNIKKGLRTD IARKFTENCH KLGIQVHGTF ILGLPGETKE TIAKTIEFAK
EINPHTVQVS LAAPYPGTTL YRQAVESGWL EPNQDANLVN DKGVQLALIS YPHLSKEEIY
HGVETFYRNF YFRPSKIWEI VKEMLGSWEM TKRRLREGVE FFRFLHAHEA