Gene Aazo_1350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1350 
Symbol 
ID9339145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1421282 
End bp1422649 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content38% 
IMG OID 
Productnitrogenase MoFe cofactor biosynthesis protein NifE 
Protein accessionYP_003720729 
Protein GI298490552 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.452769 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTA CCCAAGGCAA AATTAACGAG TTACTTAGTG AATCAGGATG CAAAGATAAT 
CAACATAAAC AAGGAGAAAA GAAAAACAAA TCTTGTACAC AACAAGCTCA ACCAGGTGCT
GCTCAAGGAG GATGCGCTTT TGATGGTGCA ATGATTGCAT TAGTACCTAT TACTGATGCT
GCTCATTTGG TTCATGGACC TATTGCTTGT GCTGGTAACT CTTGGGGAAG TCGTGGAAGT
CTTTCTTCTG GACCAATGCT CTATAAGACC GGTTTTACCA CTGATATTAG TGAAAATGAT
GTAATTTTCG GTGGTGAGAA AAAGCTTTAT AAGGCAATTT TGGAAGTTAA TGAACGCTAC
AAACCAGCCG CTATTTTTGT TTACGCTACT TGTGTAACTG CTTTAATCGG CGATGATATT
GATGCAGTTT GTAAAGTTGC GGCTGAGAAA GTTGGTACTC CTGTCATTCC AGTAATTGCT
CCGGGATTTA TTGGTAGTAA AAATCTAGGT AATCGTTTTG GTGGTGAAGC TTTACTAGAA
TATGTAGTTG GCACTGCTGA ACCTGAATAT ACTACACCTT ATGATATTAA TTTGATTGGT
GAATACAATA TTGCTGGTGA AATGTGGGGC GTTTTGCCTT TATTTGAAAA ATTAGGCATT
CGCGTCTTAT CAAAAATCAC TGGTGATGCT CGTTATGAAG AAATTCGTTG TGCTCACCGC
GCTAAGTTAA ATGTAATGAT TTGCTCACGG GCCTTATTAA ATATGGCGCG GAAAATGGAG
GAACGTTACG GTATTCCTTA CATTGAAGAG TCTTTTTATG GTATTAATGA TATTAATCAT
TGTCTCAGAA CTGTTGCAGC TAAATTAGGT AGTCTTAATT TACAAGCACG GACTGAAAAG
TTAATTACAG ATGAAACAGC GGCTTTAGAT ATTGCCCTTG CTCCCTATAG AGAATCCTTA
AGGGGTAAAC GGGTGGTTCT GTATACTGGT GGTGTGAAAA GTTGGTCGAT CATTTCTGCT
TCTAAGGATT TAGGAATTGA AGTTGTTGCT ACTAGTACAC GCAAAAGTAC AGAAGAAGAT
AAATCCAAAA TCAAGAAATT ACTTGGCAAT GATGGCATTA TGTTGGAAAA GGGTAATGCC
CAAGAATTGC TAAAATTAGT AAGAGAAACT AAAGCTGATA TGTTAATAGC TGGTGGTCGG
AATCAGTACA CAGCTTTAAA GGCAAGAATT CCATTTTTAG ATATTAACCA AGAACGTCAT
CATCCCTATG CAGGTTATAT GGGAATGGTG GAAATGGCAC GGGAGTTATA TGAGGCTTTG
TATAGTCCAA TTTGGGAACA AATTCGTAAG CCTGCGCCTT GGGAGTAA
 
Protein sequence
MKITQGKINE LLSESGCKDN QHKQGEKKNK SCTQQAQPGA AQGGCAFDGA MIALVPITDA 
AHLVHGPIAC AGNSWGSRGS LSSGPMLYKT GFTTDISEND VIFGGEKKLY KAILEVNERY
KPAAIFVYAT CVTALIGDDI DAVCKVAAEK VGTPVIPVIA PGFIGSKNLG NRFGGEALLE
YVVGTAEPEY TTPYDINLIG EYNIAGEMWG VLPLFEKLGI RVLSKITGDA RYEEIRCAHR
AKLNVMICSR ALLNMARKME ERYGIPYIEE SFYGINDINH CLRTVAAKLG SLNLQARTEK
LITDETAALD IALAPYRESL RGKRVVLYTG GVKSWSIISA SKDLGIEVVA TSTRKSTEED
KSKIKKLLGN DGIMLEKGNA QELLKLVRET KADMLIAGGR NQYTALKARI PFLDINQERH
HPYAGYMGMV EMARELYEAL YSPIWEQIRK PAPWE