Gene Avin_48970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_48970 
SymbolanfK 
ID7763756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4956821 
End bp4958209 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content59% 
IMG OID643807737 
ProductFe-only nitrogenase, beta subunit 
Protein accessionYP_002801972 
Protein GI226946899 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR02931] Fe-only nitrogenase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.12057 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTGCG AAGTCAAGGA AAAAGGGCGG GTTGGCACTA TCAACCCCAT CTTTACCTGT 
CAACCGGCCG GTGCCCAGTT CGTCAGTATC GGTATCAAGG ATTGCATCGG TATCGTGCAT
GGCGGCCAAG GCTGCGTGAT GTTCGTCCGC CTGATCTTTT CCCAGCACTA CAAGGAAAGT
TTCGAGCTGG CCTCTTCCTC CCTGCACGAG GACGGCGCCG TGTTCGGTGC CTGCGGCCGG
GTCGAGGAAG CGGTCGATGT GCTGCTCAGC CGCTATCCCG ACGTGAAGGT GGTGCCCATC
ATCACCACCT GCTCCACCGA GATCATCGGC GACGACGTGG ACGGGGTGAT CAAGAAGCTC
AACGAAGGGC TGCTGAAAGA GAAGTTCCCG GACCGGGAAG TCCATCTGAT CGCCATGCAC
ACGCCGAGCT TCGTGGGCAG CATGATCAGC GGCTACGACG TGGCCGTTCG GGATGTGGTC
AGGCATTTCG CCAAGCGCGA AGCGCCCAAC GACAAGATCA ATCTGCTCAC CGGCTGGGTC
AATCCGGGGG ATGTCAAGGA GCTGAAGCAC CTGCTCGGGG AAATGGACAT CGAAGCCAAC
GTGTTGTTCG AGATCGAAAG TTTCGACTCG CCGATCCTGC CGGATGGCAG TGCCGTTTCC
CACGGCAATA CCACCATCGA GGATCTGATC GACACCGGCA ATGCCCGGGC GACCTTCGCC
CTGAACCGCT ACGAAGGCAC CAAGGCCGCC GAGTATCTGC AGAAGAAATT CGAGATCCCG
GCGATCATCG GCCCGACCCC GATCGGCATC CGCAATACCG ACATCTTCCT GCAGAACCTG
AAGAAGGCGA CGGGCAAGCC GATTCCCCAG TCGCTGGCCC ATGAGCGCGG GGTGGCCATC
GATGCCCTGG CCGACCTGAC CCACATGTTT CTGGCCGAAA AGCGTGTGGC CATCTATGGG
GCGCCGGATC TGGTGATCGG CCTGGCCGAA TTCTGCCTGG ATCTGGAGAT GAAGCCCGTC
TTGCTGCTGC TGGGCGACGA CAACTCCAAG TACGTGGACG ATCCGCGCAT CAAGGCGCTT
CAGGAAAACG TCGATTACGG CATGGAAATC GTCACCAATG CGGATTTCTG GGAACTGGAA
AACCGCATCA AGAACGAGGG TCTGGAACTG GATCTGATCC TCGGTCACTC CAAGGGCCGT
TTCATCTCCA TCGACTACAA CATCCCGATG CTGCGCGTGG GTTTCCCGAC CTACGACCGC
GCCGGCCTGT TCCGCTATCC CACGGTGGGC TATGGCGGTG CCATCTGGCT GGCCGAGCAG
ATGGCCAACA CCCTGTTCGC CGATATGGAA CACAAGAAGA ACAAGGAATG GGTCCTCAAC
GTCTGGTAA
 
Protein sequence
MTCEVKEKGR VGTINPIFTC QPAGAQFVSI GIKDCIGIVH GGQGCVMFVR LIFSQHYKES 
FELASSSLHE DGAVFGACGR VEEAVDVLLS RYPDVKVVPI ITTCSTEIIG DDVDGVIKKL
NEGLLKEKFP DREVHLIAMH TPSFVGSMIS GYDVAVRDVV RHFAKREAPN DKINLLTGWV
NPGDVKELKH LLGEMDIEAN VLFEIESFDS PILPDGSAVS HGNTTIEDLI DTGNARATFA
LNRYEGTKAA EYLQKKFEIP AIIGPTPIGI RNTDIFLQNL KKATGKPIPQ SLAHERGVAI
DALADLTHMF LAEKRVAIYG APDLVIGLAE FCLDLEMKPV LLLLGDDNSK YVDDPRIKAL
QENVDYGMEI VTNADFWELE NRIKNEGLEL DLILGHSKGR FISIDYNIPM LRVGFPTYDR
AGLFRYPTVG YGGAIWLAEQ MANTLFADME HKKNKEWVLN VW