Gene Avin_20630 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_20630 
Symbol 
ID7760989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2053198 
End bp2054331 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content60% 
IMG OID643804960 
ProductMoaA, NifB, PqqE, radical SAM superfamily protein 
Protein accessionYP_002799241 
Protein GI226944168 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR03470] hopanoid biosynthesis associated radical SAM protein HpnH 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGGCATTC CTTTCATTCA ACAGATCAAG ATCGCGCGCT ATATCCTTGG CAAGAAGCTG 
AGCGGTCAAC ACCGCTATCC GTTGGCGATG ATGCTGGAGC CCCTGTTCCA GTGTAATCTT
GCCTGCGCCG GCTGTGGCAA GATCGACCAT CCCAAGGATG TCTTGCGCAA GCGCATGAGC
GTCGAGGACG CCTTGCGCGC GGTGGACGAA TGCGACGCGC CGGTGGTTTC CATCCCTGGG
GGCGAGCCCC TGATTCACAA GGAAATGCCG CAGATCGTGC GGGGCATCGT CGCACGCAAG
AAATTTGTCT ATCTATGCAC CAATGCCATT CTGCTGCCCA AGCACATCGA CGAATACGAA
CCCTCGCCCT ACTTCACCTG GTCGATTCAT CTGGATGGCT TGCAGAAGCG CCACGATGAG
TCGGTATGCA TGAAGGGTGT GTTCGACAAG GCTGTCGCTG CGATAAAACT GGCGCTTGCA
CGCGGCTACC GGGTGACCAT CAACTGCACC CTGTTCAACG GGGAGCCACC GGCCGAAATC
GCCGACTTCT TCGACTACGC CATGACTCTG GGCATCGAGG GCATCACCGT CTCCCCGGGC
TACAGCTACC AGCACGCACC GCGTCAAGAC GTATTTCTCG GCCGCACCGA GAGCAAGGAG
CTGTTCCGCG AGGTGTTCAA GCGCGGCAAG GAGCGCAAGA GTCGCTGGGT ATTCAATCAG
TCATCGATGT TTCTCGATTT TCTGGCCGGC AACCAGAGTT ACCAGTGCTC CCCGTGGTCC
AACCCGACCT ACAGCATTTT CGGCTGGCAG AAACCCTGCT ATCTGCTGGT CGATGAGGGA
TACGCGCCGA CCTATAAGGC GCTGATGGAA GACACCCGCT GGGAACACTA CGGCGTCGGC
ATCAATCCCA AATGCGACAA CTGCATGGCC CACTGCGGCT TCGAAGGCAG CGCCGTCAAT
GACACCTTTG CACATCCACT CGCGGCCATG CGGGTAGCCA TGTTCGGGCC TCGTACCGAC
AGCGTGATGG CCCCGAATCT GCCGGTTCGG TACGGCAGTC GCGCCGATGC CGCCCCCGCA
CGCATCCCGG TCCACGCCAT CCAGCCCTCT GGCGCCGACA TGAATGAAGG CTGA
 
Protein sequence
MGIPFIQQIK IARYILGKKL SGQHRYPLAM MLEPLFQCNL ACAGCGKIDH PKDVLRKRMS 
VEDALRAVDE CDAPVVSIPG GEPLIHKEMP QIVRGIVARK KFVYLCTNAI LLPKHIDEYE
PSPYFTWSIH LDGLQKRHDE SVCMKGVFDK AVAAIKLALA RGYRVTINCT LFNGEPPAEI
ADFFDYAMTL GIEGITVSPG YSYQHAPRQD VFLGRTESKE LFREVFKRGK ERKSRWVFNQ
SSMFLDFLAG NQSYQCSPWS NPTYSIFGWQ KPCYLLVDEG YAPTYKALME DTRWEHYGVG
INPKCDNCMA HCGFEGSAVN DTFAHPLAAM RVAMFGPRTD SVMAPNLPVR YGSRADAAPA
RIPVHAIQPS GADMNEG