Gene Avin_51000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_51000 
SymbolnifA 
ID7763948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5170162 
End bp5171730 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content68% 
IMG OID643807928 
ProductNif-specific sigma54-dependent transcriptional activator protein, NifA 
Protein accessionYP_002802162 
Protein GI226947089 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID[TIGR01817] Nif-specific regulatory protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.519777 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCAA CCATCCCTCA GCGCTCGGCC AAACAGAACC CGGTCGAACT CTATGACCTG 
CAATTGCAGG CCCTGGCGAG CATCGCCCGC ACGCTCAGCC GCGAACAACA GATCGACGAA
CTGCTCGAAC AGGTCCTGGC CGTACTGCAC AATGACCTCG GCCTGCTGCA TGGCCTGGTG
ACCATTTCCG ACCCGGAACA CGGCGCCCTG CAGATCGGCG CCATCCACAC CGACTCGGAA
GCGGTGGCCC AGGCCTGCGA AGGCGTGCGC TACAGAAGCG GCGAAGGCGT GATCGGCAAC
GTGCTCAAGC ACGGCAACAG CGTGGTGCTC GGGCGCATCT CCGCCGACCC GCGCTTTCTC
GACCGCCTGG CGCTGTACGA CCTGGAAATG CCGTTCATCG CCGTGCCGAT CAAGAACCCC
GAGGGCAACA CCATCGGCGT GCTGGCGGCC CAGCCGGACT GCCGCGCCGA CGAGCACATG
CCCGCGCGCA CGCGCCTTCT GGAGATCGTC GCCAACCTGC TGGCGCAGAC CGTGCGCCTG
GTGGTGAACA TCGAGGACGG CCGCGAGGCG GCCGACGAGC GCGACGAACT GCGTCGCGAG
GTGCGCGGCA AGTACGGCTT CGAGAACATG GTGGTGGGCC ACACCCCCAC CATGCGCCGG
GTGTTCGATC AGATCCGCCG GGTCGCCAAG TGGAACAGCA CCGTACTGGT CCTCGGCGAG
TCCGGTACCG GCAAGGAACT GATCGCCAGC GCCATCCACT ACAACTCGCC GCGCGCGCAC
CGCCCCTTCG TGCGCCTGAA CTGCGCCGCG CTGCCGGAAA CCCTGCTCGA GTCCGAACTC
TTCGGCCACG AGAAGGGCGC CTTCACCGGC GCGGTGAAGC AGCGCAAGGG GCGTTTCGAG
CAGGCCGACG GCGGCACCCT GTTCCTCGAC GAGATCGGCG AGATCTCGCC GATGTTCCAG
GCCAAGCTGC TGCGCGTGCT GCAGGAAGGC GAGTTCGAGC GGGTCGGCGG CAACCAGACG
GTGCGGGTCA ACGTGCGCAT CGTCGCCGCC ACCAACCGCG ACCTGGAAAG CGAGGTGGAA
AAGGGCAAGT TCCGCGAGGA CCTCTACTAC CGCCTGAACG TCATGGCCAT CCGCATTCCG
CCGCTGCGCG AGCGTACCGC CGACATTCCC GAACTGGCGG AATTCCTGCT CGGCAAGATC
GGCCGCCAGC AGGGCCGCCC GCTGACCGTC ACCGACAGCG CCATCCGCCT GCTGATGAGC
CACCGCTGGC CGGGCAACGT GCGCGAACTG GAGAACTGCC TGGAGCGCTC GGCGATCATG
AGCGAGGACG GCACCATCAC CCGCGACGTG GTCTCGCTGA CCGGGGTCGA CAACGAGAGC
CCGCCGCTCG CCGCGCCGCT GCCCGAGGTC AACCTGGCCG ACGAGACCCT GGACGACCGC
GAACGGGTGA TCGCCGCCCT CGAACAGGCC GGCTGGGTGC AGGCCAAGGC CGCGCGGCTG
CTGGGCATGA CGCCGCGGCA GATCGCCTAC CGCATCCAGA CCCTCAACAT CCACATGCGC
AAGATCTGA
 
Protein sequence
MNATIPQRSA KQNPVELYDL QLQALASIAR TLSREQQIDE LLEQVLAVLH NDLGLLHGLV 
TISDPEHGAL QIGAIHTDSE AVAQACEGVR YRSGEGVIGN VLKHGNSVVL GRISADPRFL
DRLALYDLEM PFIAVPIKNP EGNTIGVLAA QPDCRADEHM PARTRLLEIV ANLLAQTVRL
VVNIEDGREA ADERDELRRE VRGKYGFENM VVGHTPTMRR VFDQIRRVAK WNSTVLVLGE
SGTGKELIAS AIHYNSPRAH RPFVRLNCAA LPETLLESEL FGHEKGAFTG AVKQRKGRFE
QADGGTLFLD EIGEISPMFQ AKLLRVLQEG EFERVGGNQT VRVNVRIVAA TNRDLESEVE
KGKFREDLYY RLNVMAIRIP PLRERTADIP ELAEFLLGKI GRQQGRPLTV TDSAIRLLMS
HRWPGNVREL ENCLERSAIM SEDGTITRDV VSLTGVDNES PPLAAPLPEV NLADETLDDR
ERVIAALEQA GWVQAKAARL LGMTPRQIAY RIQTLNIHMR KI