Gene Avin_20220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_20220 
Symbol 
ID7760950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2011967 
End bp2013736 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content70% 
IMG OID643804919 
Producthypothetical protein 
Protein accessionYP_002799202 
Protein GI226944129 
COG category[C] Energy production and conversion 
COG ID[COG1053] Succinate dehydrogenase/fumarate reductase, flavoprotein subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.953536 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACCTGC CGGATAGCGG TCCGGCGGAG CTTTCCTACG ATGTGGTGGT GCTCGGCAGC 
GGCGCCGCCG GCTTCGCCGC GGCGCTCAGC GCTGCCTGCA AGGGCCTGAG GGTCCTGCTG
GTGGAGAAGG ACGCCAGCTT CGGCGGCACC TCGGCCATCT CCGGCGGCGC CGTGTGGATT
CACGACAGCG ACCAGGCGCG GGCGGCCGGC ATCCATGTCC CGGCCGAACA CACGCGCACC
TACCTGCAGG CGGTCATCGG CGCGGGCTAC AAGCCCGAGC TGGTCGACGC CTTCGTCGAG
CGCGGGCGCG AGGCGCTGGC CTTCCTCGAA GGCCACAGCG AACTGAAATA CAGTCTGCGG
CCGCTGTCCC CCGACTATTA CCCGGACCTG CCCGGCGGCA CCGACAAGGG CCGCGCCCTG
GAAATCGACG AATACGACGG ACGCAGGCTG GGCGAGCGCT TCAAGGACCT GCGCAGGCCG
CCGGACGGCA TGCTGCTGTT CGGCGGGATG ATGGTCAACC GCGTCGACAT CCAGCATTTC
CTCGCCATCC GCCGCTCGCC GAAGTCCTTC TGGCACTGCC TGAAACTGCT GGCGCGCTAT
GGCGTCGATC GCCTGAGCCA TCCGCGCGGC ACCCGCCTCA CGGTAGGCAA CGCGCTGATC
GCCCGGCTGG CCGCCAGCGC CTTCGCCAAG GGGGTCGAAC TCTGGCTGGA GACGCGCACG
GAATCGCTGA TCGTCGAGGA TGGCGCGGTC AAGGGCCTGC TGGTCGACTA CCGGGGCCAG
CGCCGGCGCG TGCTGGCGCG TGGCGGGGTG GTGCTGGCCT GCGGCGGTTT CGCCGCCGGC
GCGCAGGCCG CCGACTACCG GCCGAAGACC GACGCCCGGC ACTGGAGCAT GTCGCCGGAG
ACCAACGTCG GCGACGGCCT GCGCCTGGCC GCCGCGGCGA ACGCCGCTGC CGGCGAGGGT
TTGGCCGCCA ACTTCTTCTG GGCGCCGGTG TCGGTGCTGC GCAAGCCGGA CGGCAGCCTG
GAGCGCTTCC CGCACCTGGT CACCGACCGC GCCAAGCCGG GGGTGATCGC GGTCAATCGC
GCCGGCCGGC GCTTCGTCAA CGAGTCGAAC TCCTACCACT GCTTCGTCGA GGCCATGTTC
GCCGACGGCG GGGCCAACGC ACCCTGCTGG CTGATCTGCG ACAGCGAGGC GCTGAACAGG
TATGGCATGG GCCTGGCCCG GCCGAAGCCG GTGGACAACT CGGCGCTGAT CGAGGCCGGC
TACCTGCTGC GCGCCGACGA CATCGGCGGG CTGGCGCAGG CCATCGGCGT GGACCCGGCC
GCCCTGCAAC GGACGTTGGA ACGCTACAAC GCCGACGCCC GGCAGAACCT CGACCGGGAA
TTCGGCAAGG GCGGCAACTC CTACAACCGC TACATGGGCG ATCCGCTGCA CCAGCCCAAT
CCCTGCATCG CGCCGCTGCA GCGGGCGCCG TACTACGCGA TCCGCATCGA TACCGGCGAC
CTCGGCTCGG CGCGCGGCCT GCTCACCGAC GCCCGCGCCA ACGTGCTGGA TCGCGAGGGC
CGGCCGATCG CCGGGCTCTA CGCGGCCGGC AACGAAATGA ATTCGATCAT GGACGGCACC
TATCCCGGTC CCGGCATCAC CCTGGGGCCG GGCCTGGCCT TCGGCTACAT CGCCGCCTGC
GACATCGCCG AACGCCTGGG CGGCGGGACT TCGCGAGCAA CCCAACCTGG AGACGAGCAT
GTACTACGAA CTGCGCACCT ACACCATTAA
 
Protein sequence
MHLPDSGPAE LSYDVVVLGS GAAGFAAALS AACKGLRVLL VEKDASFGGT SAISGGAVWI 
HDSDQARAAG IHVPAEHTRT YLQAVIGAGY KPELVDAFVE RGREALAFLE GHSELKYSLR
PLSPDYYPDL PGGTDKGRAL EIDEYDGRRL GERFKDLRRP PDGMLLFGGM MVNRVDIQHF
LAIRRSPKSF WHCLKLLARY GVDRLSHPRG TRLTVGNALI ARLAASAFAK GVELWLETRT
ESLIVEDGAV KGLLVDYRGQ RRRVLARGGV VLACGGFAAG AQAADYRPKT DARHWSMSPE
TNVGDGLRLA AAANAAAGEG LAANFFWAPV SVLRKPDGSL ERFPHLVTDR AKPGVIAVNR
AGRRFVNESN SYHCFVEAMF ADGGANAPCW LICDSEALNR YGMGLARPKP VDNSALIEAG
YLLRADDIGG LAQAIGVDPA ALQRTLERYN ADARQNLDRE FGKGGNSYNR YMGDPLHQPN
PCIAPLQRAP YYAIRIDTGD LGSARGLLTD ARANVLDREG RPIAGLYAAG NEMNSIMDGT
YPGPGITLGP GLAFGYIAAC DIAERLGGGT SRATQPGDEH VLRTAHLHH