Gene Avin_04940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_04940 
Symbol 
ID7759451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp465323 
End bp466357 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content72% 
IMG OID643803415 
ProductTranscriptional regulator, AraC family 
Protein accessionYP_002797723 
Protein GI226942650 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.543435 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCCC GCGAGCCCTG GTACGAGCGC GACAGCCGCT TCATCGCCGC CCATCAGCAG 
CCCGCCGGGC TGCTCGACCT GGGCCTGGGC CGCGGCATCG ACAGCCACCG CCTGCTACGC
GGCAGCGGCC TGTTCCACGA GGACATCCTG CGCGGCGCGA CGCTGATCAG CCCCGAGCAG
TACCTGCGCC TGATCGACAA CGCCCAGCGC CTGCTGGCGG CGGACGACAC CAGCTTCCTG
TTCGGCCAGC GGCTGCTGCC AGGCCACCAG GGCGCCGCCA GCCAGGCCCT GGCGCAGGCC
GGCAACCTGC TGGAGGCGCT GCGCCTGCTC GGCGAACTGC GTGCCCTGCT CTGCCCGCTG
CTGAGCCCGC GGCTGGTCTG CGACGAGCGG TACGCCTATT TGTACTGGCT CGACAGTTGC
GGCGCCGGGC CGTCGCTGCG TTTTCTGCTC GAAGCGCACA TGGCGGCGGC GACCGCCCTG
GGCCGGCGGC TGAGCGGCGA ACGCCTGCCC TGGCGCTTCC ATTTCCGGCA TGCCGAGCCG
CGCTGCGTCG AGCAGTACTG GGTGCATCTG GGCGAGGACC TGCATTTCGG CAGCCCCTGC
GACCTGATGC GCCTGCCGCT CGACTGCCTG AACCGCCCGC TGCCGCAGGC CGCCCCGACG
GCCGGCGCGG TGAGCCGCCG ACAGGCCCGC GCGCAACTGG AGAGCGCCGG CCCGGCGGCC
AGCCTGCTCG ACCGCCTGTA CGACTGGCTG CTCGCGCACG TGCGCGAGGC GCCCGGCCTG
GAGCGCGCCG CCGAAGCCTT CGCGATGAGC CCGACCACCT TCAAGCGCAA GCTGCGCAAG
CACGCCACCA GCTACCAGGA GCAGCACGAC CGCGCCCGTC TGCACGTGGC GCTCTGGCTC
CAGCAGGTGA AGGGCTACGG CAACGAGGCG ATCGCCAGCT ACCTGCACTT CCACGACGCC
AACAACTTCC GCCGCTCGTT CAAGCGCTGG ACCGGCATGC CGCCCAGTTC GCTGCGCCAG
TCGCTGTGCG GTTGA
 
Protein sequence
MSAREPWYER DSRFIAAHQQ PAGLLDLGLG RGIDSHRLLR GSGLFHEDIL RGATLISPEQ 
YLRLIDNAQR LLAADDTSFL FGQRLLPGHQ GAASQALAQA GNLLEALRLL GELRALLCPL
LSPRLVCDER YAYLYWLDSC GAGPSLRFLL EAHMAAATAL GRRLSGERLP WRFHFRHAEP
RCVEQYWVHL GEDLHFGSPC DLMRLPLDCL NRPLPQAAPT AGAVSRRQAR AQLESAGPAA
SLLDRLYDWL LAHVREAPGL ERAAEAFAMS PTTFKRKLRK HATSYQEQHD RARLHVALWL
QQVKGYGNEA IASYLHFHDA NNFRRSFKRW TGMPPSSLRQ SLCG