Gene Avin_01390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_01390 
SymbolnifD 
ID7759104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp137758 
End bp139236 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content59% 
IMG OID643803063 
ProductNitrogenase molybdenum-iron protein alpha chain:Nitrogenase component I, alpha chain 
Protein accessionYP_002797379 
Protein GI226942306 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGGTA TGTCGCGCGA AGAGGTTGAA TCCCTCATCC AGGAAGTTCT GGAAGTTTAT 
CCCGAGAAGG CTCGCAAGGA TCGTAACAAG CACCTGGCCG TCAACGACCC GGCGGTTACC
CAGTCCAAGA AGTGCATCAT CTCCAACAAG AAGTCCCAGC CCGGTCTGAT GACCATCCGC
GGCTGCGCCT ACGCCGGTTC CAAAGGCGTG GTCTGGGGCC CCATCAAGGA CATGATCCAC
ATCTCCCACG GTCCGGTAGG CTGCGGCCAG TATTCGCGCG CCGGCCGTCG TAACTACTAC
ATCGGTACCA CCGGTGTGAA CGCCTTCGTC ACCATGAACT TCACCTCGGA CTTCCAGGAG
AAGGACATCG TGTTCGGTGG CGACAAGAAG CTCGCCAAAC TGATCGACGA AGTGGAAACC
CTGTTCCCGC TGAACAAGGG TATCTCCGTC CAGTCCGAGT GCCCGATCGG CCTGATCGGC
GACGACATCG AATCCGTGTC CAAGGTCAAG GGCGCCGAGC TCAGCAAGAC CATCGTACCG
GTCCGTTGCG AAGGCTTCCG CGGCGTTTCC CAGTCCCTGG GCCACCACAT CGCCAACGAC
GCAGTCCGCG ACTGGGTCCT GGGCAAGCGT GACGAAGACA CCACCTTCGC CAGCACTCCT
TACGATGTGG CCATCATCGG CGACTACAAC ATCGGCGGCG ACGCCTGGTC TTCCCGCATC
CTGCTGGAAG AAATGGGCCT GCGTTGCGTA GCCCAGTGGT CCGGCGACGG CTCCATCTCC
GAAATCGAGC TGACCCCGAA GGTCAAGCTG AACCTGGTTC ACTGCTACCG CTCGATGAAC
TACATCTCCC GTCACATGGA AGAGAAGTAC GGTATCCCAT GGATGGAGTA CAACTTCTTC
GGCCCGACCA AGACCATCGA GTCGCTGCGT GCCATCGCCG CCAAGTTCGA CGAGAGCATC
CAGAAGAAGT GCGAAGAGGT CATCGCCAAG TACAAGCCCG AGTGGGAAGC GGTGGTCGCC
AAGTACCGTC CGCGCCTGGA AGGCAAGCGC GTCATGCTCT ACATCGGTGG CCTGCGTCCG
CGCCACGTGA TCGGCGCCTA CGAAGACCTG GGCATGGAAG TGGTGGGTAC CGGCTACGAG
TTCGCCCACA ACGACGACTA TGACCGCACC ATGAAAGAAA TGGGTGACTC CACCCTGCTG
TACGATGACG TGACCGGCTA CGAATTCGAA GAATTCGTCA AGCGCATCAA GCCCGACCTG
ATCGGCTCCG GTATCAAGGA GAAGTTCATC TTCCAGAAGA TGGGCATCCC CTTCCGTCAA
ATGCACTCCT GGGATTATTC CGGCCCCTAC CACGGCTTCG ATGGCTTCGC CATCTTCGCC
CGTGACATGG ACATGACCCT GAACAATCCG TGCTGGAAGA AACTGCAGGC TCCCTGGGAA
GCTTCCGAAG GCGCCGAGAA AGTCGCCGCC AGCGCCTGA
 
Protein sequence
MTGMSREEVE SLIQEVLEVY PEKARKDRNK HLAVNDPAVT QSKKCIISNK KSQPGLMTIR 
GCAYAGSKGV VWGPIKDMIH ISHGPVGCGQ YSRAGRRNYY IGTTGVNAFV TMNFTSDFQE
KDIVFGGDKK LAKLIDEVET LFPLNKGISV QSECPIGLIG DDIESVSKVK GAELSKTIVP
VRCEGFRGVS QSLGHHIAND AVRDWVLGKR DEDTTFASTP YDVAIIGDYN IGGDAWSSRI
LLEEMGLRCV AQWSGDGSIS EIELTPKVKL NLVHCYRSMN YISRHMEEKY GIPWMEYNFF
GPTKTIESLR AIAAKFDESI QKKCEEVIAK YKPEWEAVVA KYRPRLEGKR VMLYIGGLRP
RHVIGAYEDL GMEVVGTGYE FAHNDDYDRT MKEMGDSTLL YDDVTGYEFE EFVKRIKPDL
IGSGIKEKFI FQKMGIPFRQ MHSWDYSGPY HGFDGFAIFA RDMDMTLNNP CWKKLQAPWE
ASEGAEKVAA SA