Gene Avin_47100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_47100 
SymbolvnfA3 
ID7763573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4779711 
End bp4781264 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content68% 
IMG OID643807555 
Productsigma54-dependent activator protein 
Protein accessionYP_002801790 
Protein GI226946717 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID[TIGR01817] Nif-specific regulatory protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.536607 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCGC ATTCGGCGAA CGCGCGTCCC GGTCGGGAAC TGCCGGTCCA GGACAAGCCC 
TGCGAATGGG ACATGGGTGA ATGCCGCACC GATCTGTTGC CGCTGCTCGG CGAACTGGGG
CGCATCTCCA GCGAGGGCAG CGACCTGGCC GGCATCCTGC GGGTGCTGCT GGAACTGATG
CAGCGCCATC TGAAGGTCGC GCGCGGCATG GTGACCCTGC GCGATCCGGA ATCGGGGCGG
ATCTTCGTCC AGCAGGGCTG CGGCCTGAGC GAGGAGGAGG AGGCCTCGGG CAGCCTCGCC
CACGGCGAGG ACATCGTCGC CCAAGTGGTG GACAGCGGGC GGACGGTGCT GCTGCCGGGC
GAGGCGGGCA GCCAGTCCTT CCTCTGCGTG CCGATCCGCC GCGACCGCAA GGTGCTGGGC
GCCATCGTCG CCGAGCGCCA CTACGCCAAC CGCCAGATGC TCGAGCTGGA CGCGGAGATT
CTCGCCATTC TCGCCGCCAC CACCGCCCAG GCGGTGGAGC TGCACCTGCA GGAGCACGTG
CGCAAGGTCG CCCTGGAGGA CGAGAACCGC CGCCTGCGCT CGGCCCTGCA GAGTCGCTTC
AAGCCCAGCA ACATCATCGG CAATTCGCGG CCGCTGCAGG AGGTCTACGG GTTGATCGAG
AAGGTCACCC GCTCGCGGAC CACGGTACTG ATCCTCGGCG AGAACGGCGT GGGCAAGGAA
CTGGTGGCCA GCGCCATCCA CTACAACAGC AGCAGCGCCG AGGGTCCCTT CGTCAAGTTC
AACTGCGCCG CCCTGCCGGA GAGCGTCATC GAGAGCGAGC TGTTCGGCCA CGAGCGCGGC
GCCTTCACCG GGGCGGCGAC CCAGCGGCGC GGGCGCTTCG AGGCGGCCGA CGGCGGGACC
ATCTTCCTCG ACGAGGTGGG CGAGCTGTCC CTGGCCATGC AGGCCAAGCT GCTGCGGGTG
CTGCAGGAGA AGAGCTTCGA ACGGGTCGGC GGCAACGTCA CCCACCAGGT CGACCTGCGC
ATCCTCGCCG CCACCAACCG CGACTTGCGG GCGATGGTGG AGCAGGGCCG CTTCCGCGAG
GATCTCTACT ACCGGCTCAA CGTCTTCCCC ATCACCGTGC CGCCGCTGCG CGAGCGCGGC
TCCGACGTCG CCACCCTGGC GGAGCATTTC GTCGCGCGCT TCTCCGGCGA GATGGGCGTC
ACAGTGGAGC GCATCTCCGC GCCGGCGATG AGCATGCTGA TGTGCTACCA CTGGCCGGGC
AACGTGCGCG AGCTGGAGAA CGTCATCGAG CGGGCGGTCA TCCTCTGCGA GGATGCGGTC
ATCGAACCGC ATCACCTGCC GCCTTCGCTG CAGACCCCGG CGGTTTCCGA GAGCCCGTCC
GCCGGCGGCA TTCTCGATGT CCGCCTGAGG CAGGCCGAGC ACGAGATGAT CGTCGAGGCG
CTCAAGCGGC ATAAGGGCAA CATGACCGAG GCCGCCACCC ATCTGGGTCT GACCCGGCGC
ATCCTCGGCC TGCGCATGGC CCGGCACAAC CTGAACTACA AGGATTTCCG CTGA
 
Protein sequence
MKPHSANARP GRELPVQDKP CEWDMGECRT DLLPLLGELG RISSEGSDLA GILRVLLELM 
QRHLKVARGM VTLRDPESGR IFVQQGCGLS EEEEASGSLA HGEDIVAQVV DSGRTVLLPG
EAGSQSFLCV PIRRDRKVLG AIVAERHYAN RQMLELDAEI LAILAATTAQ AVELHLQEHV
RKVALEDENR RLRSALQSRF KPSNIIGNSR PLQEVYGLIE KVTRSRTTVL ILGENGVGKE
LVASAIHYNS SSAEGPFVKF NCAALPESVI ESELFGHERG AFTGAATQRR GRFEAADGGT
IFLDEVGELS LAMQAKLLRV LQEKSFERVG GNVTHQVDLR ILAATNRDLR AMVEQGRFRE
DLYYRLNVFP ITVPPLRERG SDVATLAEHF VARFSGEMGV TVERISAPAM SMLMCYHWPG
NVRELENVIE RAVILCEDAV IEPHHLPPSL QTPAVSESPS AGGILDVRLR QAEHEMIVEA
LKRHKGNMTE AATHLGLTRR ILGLRMARHN LNYKDFR