Gene Ava_4051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4051 
Symbol 
ID3681672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5036448 
End bp5037818 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content40% 
IMG OID637719403 
Productnitrogenase MoFe cofactor biosynthesis protein NifE 
Protein accessionYP_324551 
Protein GI75910255 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.283872 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAATA CTCAGGAAGA AATCAACAAA CTACTCAACG AAATAGCCTG CGAATATAAC 
GTCAAGAAAT TGCAGAGCAA GCCCAATAAA ACTACTTGTA AAAAACCACC AAAACCAGGT
GCAGCGCAGG GAAATTGTCC TTTTGAGGGG GCGATGGTGG CTCTTGGTCC AATTACTGAT
GCAGTGCATT TAGTACATGG CCCTGTTGGT TGCACGCGCA ATCCTTGGGG TAGTCGGGGT
AGTCTTTCTT CTAGCTCTGA ACTCTACCAA ATGGGTTTCT CTACTGATTT GAGTGAGAAT
GAAATCATTT TTGGCGGTGA AAAGAAGCTA GAAAAATCTA TTTTACATAT TGCTCAACGT
TATAATCCGG CTGCTATTTT TGTCTATTCT ACTTGTGTCA CGGCTTTGAT TGGTGACGAT
ATCGAAAGTG TTTGTAAAAA AGCTACAAAT CAAGTCGGGA TTCCGGTTGT TTATATCAAC
GCACCTGGAT TTCTTGGGAG TAAAAATTTA GGCAATCGTA TTGCCGGGGA AGCTTTATTA
GATTATGTAG TGGGAACAGC CGAACCAGAA TTTACTACGC CATTTGATAT CAATATTATT
GGTGATTATA ATGTTGCTGG GGAAATCTGG AATATTTTAC CACTGTTGCA GAAATTAGGT
ATTCGCGTTC TTTCCAAAAT TACCGGTGAT GCACGTTATC AAGAAATTTG CTATGCTCAC
CGCGCGAAAT TAAATGTGGT AATTTGCTCG AATGTGTCAC TAAAAATGGC ACAAACAATG
CAGGAGCGTT ACGGCATTCC TTATATTGAA GAGTCTTTCT TTGGCATAGA GAACATGAAT
AGTTGTTTGC GAAATATTGC AGCAGCATTC GGTGATCAAT ACTTACAAGA AAGAACGGAA
TGGTTAATTG CGGAGGAAAC CGCAGCTTTA GATATTGCCC TTGCTCCTTA TCGTTCTCAG
TTACAAGGAA AACGTATTGT CCTTTATAGT GGAGGTGTGA AAAGTTGGTC AGTAATTTTA
GCAGCAAAAG ACTTGGGTAT GAAGGTTGTT GCTACTAGTG ATAGGAAGAA TACTCAAGAT
GAAAAAGTTA AAATTAAGGA ATTACTTGGT CAAGATGGCA TGGTTTTGGC TAAGGGTGGC
CCCAAGGCTT TATTACAAGT AATAGAAGAT ATGAATGCGG ATATTTTAGT TGCTGGTGCT
AGTAATCAAT ACACAGCAAT TAAAGCACGT ATTCCTTTTT TAGATATTAA TCATGAGCGT
CATCATGCTT ACGCTGGTTA TGCGGGAATG GTGGAAATGG CGCGGGAATT TTATGAAGCC
TTGTATAGTC CTGTGTGGAA ACAAGTTAGG CAACCTGCTC CTTGGGAGTA G
 
Protein sequence
MRNTQEEINK LLNEIACEYN VKKLQSKPNK TTCKKPPKPG AAQGNCPFEG AMVALGPITD 
AVHLVHGPVG CTRNPWGSRG SLSSSSELYQ MGFSTDLSEN EIIFGGEKKL EKSILHIAQR
YNPAAIFVYS TCVTALIGDD IESVCKKATN QVGIPVVYIN APGFLGSKNL GNRIAGEALL
DYVVGTAEPE FTTPFDINII GDYNVAGEIW NILPLLQKLG IRVLSKITGD ARYQEICYAH
RAKLNVVICS NVSLKMAQTM QERYGIPYIE ESFFGIENMN SCLRNIAAAF GDQYLQERTE
WLIAEETAAL DIALAPYRSQ LQGKRIVLYS GGVKSWSVIL AAKDLGMKVV ATSDRKNTQD
EKVKIKELLG QDGMVLAKGG PKALLQVIED MNADILVAGA SNQYTAIKAR IPFLDINHER
HHAYAGYAGM VEMAREFYEA LYSPVWKQVR QPAPWE