Gene Aazo_1358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1358 
Symbol 
ID9339153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1430633 
End bp1432072 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content42% 
IMG OID 
Productnitrogenase cofactor biosynthesis protein NifB 
Protein accessionYP_003720737 
Protein GI298490560 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACTAC CGGCCACAGG ACTCCTCACC TCCTCCGAAC AGGAACTCAA TATCAAGCAA 
GCCAAATCAG GTGGTTGTGG TTGCGACAGC AGCACAGCTC TAGAAATGGA CGAAAAGGTC
AAAGAACGCA TTGCCAAACA CCCTTGCTAT AGTGAAGAAG CACATCACCA TTATGCACGG
ATGCACGTTG CAGTTGCACC AGCTTGTAAT ATTCAATGCA ACTATTGTAA CCGCAAGTAT
GACTGTGCTA ACGAAAGCCG ACCTGGAGTA GTGAGTGAGT TACTCACACC TGAAGAAGCC
GCACATAAAG TGTTGGTAAT TGCAGGTAAA ATTCCCCAAA TGACAGTGTT GGGAGTTGCA
GGTCCTGGTG ATCCTTTAGC AAATCCTGAA AAAACATTCC GTACCTTTGA GTTGATTGCA
GATAAAGCAC CAGATATTAA GCTTTGCTTA TCAACTAACG GTTTGATGCT ACCAGAATAT
ATTGATCGCA TCAAACAATT AAATATAGAT CACGTTACTA TAACCCTTAA CACCATTGAT
CCAGAAATCG GCGCACAAAT TTATGCTTGG GTTCATTACA AACGCAAGCG TTATAAAGGT
GTGGAAGGTG CAAAGATTCT GCTAGAAAAG CAGTTGGAAG GATTGCAAGC TTTAAAAGAA
GCCGACATTT TGTGTAAAGT TAATTCTGTG ATGATTCCCG GAATTAATGA TCATCACTTG
GTGGAAGTTA ACAAAATGAT TCGTGAGAAT GGTGCATTCT TACACAATAT CATGCCGCTA
ATTTCCGCAC CAGAACATGG GACACATTTC GGTTTAACTC ATCAACGTGG TCCAACAGGA
AAAGAACTCA AAGAAGTTCA AGATAACTGT TCTGGTAACA TGAAAATGAT GCGTCACTGT
CGCCAGTGCC GAGCAGATGC GGTAGGATTA TTAGGAGAAG ACCGCAGTCA GGAATTTACC
AAAGAGAAAT TCTTGGAAAT GTCTCCAGAA TATAACCTGG AAACACGCCA GGAAGTTCAT
CAGGGCATTG AGAAATTTAG AGAAGCAATT AAACTAGCAA AGGCCAAGGT ACAAACTGCT
AAGGAAGTTG CCAACAGTCC GAAAATTTTA GTGGCTGTAG CGACTAAAGG TGGTGGATTA
GTTAATCAAC ACTTCGGTCA TGTGAAGGAA TTTCAAGTGT ACGAAGTTGA TGGTAATGAA
GTGCACTTTA TCAGTCATCG CAAAATCGAC CAATATTGTC AAGGTGGATA CGGCGAAGAA
GCGACCGCAG AAAATATAAT GAAAGCGATT GCAGATTGTA AAGCAGTCTT AGTTGCCAAA
ATTGGTAACT GTCCCAAAGA GAAATTAGAA GCAGCAGGGA TAAAGACTGT GGAAGCTTAC
GACGTAATTG AAAAAGTCGC ACTTGAATTT TACCAGCAGT ATGTAGGGAC TGGGGACTAG
 
Protein sequence
MTLPATGLLT SSEQELNIKQ AKSGGCGCDS STALEMDEKV KERIAKHPCY SEEAHHHYAR 
MHVAVAPACN IQCNYCNRKY DCANESRPGV VSELLTPEEA AHKVLVIAGK IPQMTVLGVA
GPGDPLANPE KTFRTFELIA DKAPDIKLCL STNGLMLPEY IDRIKQLNID HVTITLNTID
PEIGAQIYAW VHYKRKRYKG VEGAKILLEK QLEGLQALKE ADILCKVNSV MIPGINDHHL
VEVNKMIREN GAFLHNIMPL ISAPEHGTHF GLTHQRGPTG KELKEVQDNC SGNMKMMRHC
RQCRADAVGL LGEDRSQEFT KEKFLEMSPE YNLETRQEVH QGIEKFREAI KLAKAKVQTA
KEVANSPKIL VAVATKGGGL VNQHFGHVKE FQVYEVDGNE VHFISHRKID QYCQGGYGEE
ATAENIMKAI ADCKAVLVAK IGNCPKEKLE AAGIKTVEAY DVIEKVALEF YQQYVGTGD