Gene Aazo_3889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3889 
Symbol 
ID9341693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3941457 
End bp3942641 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content45% 
IMG OID 
ProductNADH dehydrogenase 
Protein accessionYP_003722520 
Protein GI298492343 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAGAA TAGAAACCCG CACTGAACCA ATGGTGCTAA ATATGGGACC TCATCACCCA 
TCAATGCACG GAGTTCTGCG GCTAATTATG ACACTGGATG GCGAGGATGT CGTTGACTGT
GAACCAGTCA TTGGCTACCT GCATAGGGGA ATGGAAAAAA TTGCTGAAAA CCGTTCTACC
GTGATGTATG TCCCCTACGT CAGCCGTTGG GACTATGCAG CGGGAATGTT CAATGAAGCA
GTTACAGTTA ACGCACCTGA AAAATTAGCA GGTATTACAG TTCCCAAACG CGCCAGCTAC
ATCCGCGTGA TTATGCTGGA ACTAAACCGC ATCGCCAACC ATTTACTGTG GTTTGGTCCT
TTTCTCGCTG ACGTGGGCGC ACAAACCCCC TTCTTTTATC AATTCCGGGA ACGGGAAATG
ATTTATGATC TGTGGGAAGC TGCCACAGGT TATCGGATGG TAAACAACAA CTACTTCCGT
GTTGGTGGTG TAGCTGCTGA TTTACCCTAT GGTTGGGTAG ACAAGTGCTT AGAATTTTGT
GAATACTTTA TCCCCAAAGT AGACGAATAC GAACGCTTAG TTACTAACAA TCCCATTTTC
CGGAGACGCA TCGAAGGTAT TGGTACAATT ACCCGCCAAG AAGCAATTAA CTGGGGACTT
TCTGGCCCTA TGTTACGTGG TTCTGGCGTA AAATGGGACT TGCGAAAAGT CGACCATTAT
GAATGTTATG ACGACTTCGA CTGGGAAGTA CAGTGGGAAA CAGTCGGTGA TTGTCTCGCC
CGTTATATGG TAAGAATGCG GGAAATGCGG GAATCGGTAA AAATCATCAA ACAAGCTATC
AAAGGTTTAC CCGGTGGTTC CTACGAAAAC CTGGAAGCTA AGCGTTTAAT GGCTGGTAAG
AAATCAGAGT GGGATGGTTT TGATTACCAG TTTGTCGCCA AGAAAGTTGC TCCCACTTTC
AAAATTCCCA CAGGTGAAAT CTATTCCCGT GTAGAAAGCG GTAAAGGGGA ACTGGGTATT
TATCTAGTTG GTGATAATAA TGTTTTCCCC TGGCGGTGGA AAATTCGCGC TGCCGATTTC
AATAACCTTC AAATTCTACC TCATTTGTTA CGAGGAATGA AGGTAGCAGA TGTGGTGGTA
ATTCTTGGCA GTATTGACGT AATTATGGGT TCTGTTGACA GATAA
 
Protein sequence
MSRIETRTEP MVLNMGPHHP SMHGVLRLIM TLDGEDVVDC EPVIGYLHRG MEKIAENRST 
VMYVPYVSRW DYAAGMFNEA VTVNAPEKLA GITVPKRASY IRVIMLELNR IANHLLWFGP
FLADVGAQTP FFYQFREREM IYDLWEAATG YRMVNNNYFR VGGVAADLPY GWVDKCLEFC
EYFIPKVDEY ERLVTNNPIF RRRIEGIGTI TRQEAINWGL SGPMLRGSGV KWDLRKVDHY
ECYDDFDWEV QWETVGDCLA RYMVRMREMR ESVKIIKQAI KGLPGGSYEN LEAKRLMAGK
KSEWDGFDYQ FVAKKVAPTF KIPTGEIYSR VESGKGELGI YLVGDNNVFP WRWKIRAADF
NNLQILPHLL RGMKVADVVV ILGSIDVIMG SVDR