Gene Aazo_5022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_5022 
Symbol 
ID9342830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5139888 
End bp5141801 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content41% 
IMG OID 
Productsulfite reductase, ferredoxin dependent 
Protein accessionYP_003723255 
Protein GI298493078 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.299313 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTAACA CTGCTCCTAC CTCTCTGACC AACCGCAAGC CTTCCAGAGT TGAAGGTATC 
AAAGAAAATA GTAATTTTTT GCGTGAACCT GTAGCAACAG AAATTCTTCA GGATACAACC
CACTTTAGTG AAGATGCGGT GCAGATTCTG AAGTTTCATG GGTCTTATCA ACAGGATAAC
CGTGATAATC GTGCTAAGGG ACAGGAGAAA GATTACCAAA TGATGCTGCG GACAAAAAAT
CCTGGTGGGT TAGTACCACC GCAGCTTTAT TTGGCTTTGG ATCAGTTGGC GGATGAATAT
GGTAATCATA CGTTAAGAGC TACAACTCGT CAGGGTTTTC AGATTCACGG GATTTTGAAA
AAGAATCTGA AAAGTACGAT CACTACAATT GTTGGAAACT TGGGTTCAAC GTTGGGTGCT
TGTGGTGACA TCAACCGCAA CGTTATGGCT CCCGCTGTAC CTTTTAAAAA TCGCCCAGAA
TATCTGTATG CTTGGGAATA TGCCCAAAAT CTCGCTGATT TGCTCTCACC CCAAACTGGT
GCTTATTATG AGATTTGGTT GGATGGGGAA AAAGCCATTA GTGCGGAAGA GCATCCAGAT
GTGAAAGCAG CTAGGGAAAA GAATGGTAAT GGTACTATTA TTCATGATTC CGTAGAACCA
ATTTATGGTA CTCACTATAT GCCCCGCAAA TTTAAAATTT GCGTGACTGT TCCTGGTGAT
AATTCCGTTG ATTTGTATTC CCAAGATTTG ACTTTGGTCG TAATGACCAA TAAGAAAGGG
GAACTCCAAG GTTTTAATGT TTTTGCGGGT GGTGGTTTAG GAAGAAACCA CAATAAAGAA
GAAACTTTTG CGCGACTAGC TGACCCAATT TGCTATGTGG TCAAAGATGA TGTTTATGAT
ATCGTTAAGG CGATTGTTGC GACTCAGAGA GATTATGGTG ATCGCACAGA CCGAAGACAC
GCCAGATTAA AATATTTAAT CAATGATTGG GGTATAGATA AATTCCGCGC TAAAGTCGAA
GAATACTTTG GTAAATCTGT CGAACCGTTT AAAGAATTAC CCAAGTTCAA ATATCAAGAT
TTCCTGGGCT GGAATGAACA GGGCGACGGT AAACTATTCA TAGGGATTTC CATTGATAAT
GGTCGGGTAA AAGATGAAGG TAAATTTCAA CTGAAAACCG CTTTGCGGTC AATTGTTGAA
CAATTTAACT TACCCATCCG CCTCACACCT AACCAAAACC TGATTTTTTA CGATATTCTA
CCGGAAGATA AAGAAGCTAT TCAAGAGATT CTCGACAAGT GTGGTATTAT CGTTGACCCT
ACGCAAATCG CAGCGTTAAC CCGCTATGCT ATGGCTTGTC CCGCTTTACC TACTTGTGGT
TTAGCGATTA CGGAATCAGA AAGGGCAATT CCTAGAATTT TAGAAAGAAT CCGGGCTTTA
TTAGATAAAC TGGGTTTACA AAAAGACCAT TTTGTAGTAA GGATGACTGG TTGCCCTAAC
GGTTGCGCTC GTCCCTATAT GGCAGAATTA GGGTTTGTGG GTAGTGCGCC GGAATCTTAT
CAACTTTGGT TGGGTGGTTC ACCAGATCAA ACACGGTTAG CACAACCTAT CATTGAAAAA
TTACACGACA ATGACATAGA AAGCTTTTTA GAGCCAATTT TTGTCTTGTT TAAGAAGTCG
CGGAAGGGTA AAGAGAGTTT TGGTGATTTT TGTGATCGCA CAGGTTTTGA TGCTATCCGC
GAATTTTCAG CTAACTACAC ACCGGGAGAA CCTACCAGTA GCGGTAAATC TCGTCATCGG
GTGAGTTTGC GAGATGATGT TTATCTGCAA CTTAAGGAAA CAGCAGAAAA GCAAGATCGG
CCAATGACTG AGTTGGTACA TGAGGCTTTA GATAAGTATT TCCAAAGCCT GTAG
 
Protein sequence
MVNTAPTSLT NRKPSRVEGI KENSNFLREP VATEILQDTT HFSEDAVQIL KFHGSYQQDN 
RDNRAKGQEK DYQMMLRTKN PGGLVPPQLY LALDQLADEY GNHTLRATTR QGFQIHGILK
KNLKSTITTI VGNLGSTLGA CGDINRNVMA PAVPFKNRPE YLYAWEYAQN LADLLSPQTG
AYYEIWLDGE KAISAEEHPD VKAAREKNGN GTIIHDSVEP IYGTHYMPRK FKICVTVPGD
NSVDLYSQDL TLVVMTNKKG ELQGFNVFAG GGLGRNHNKE ETFARLADPI CYVVKDDVYD
IVKAIVATQR DYGDRTDRRH ARLKYLINDW GIDKFRAKVE EYFGKSVEPF KELPKFKYQD
FLGWNEQGDG KLFIGISIDN GRVKDEGKFQ LKTALRSIVE QFNLPIRLTP NQNLIFYDIL
PEDKEAIQEI LDKCGIIVDP TQIAALTRYA MACPALPTCG LAITESERAI PRILERIRAL
LDKLGLQKDH FVVRMTGCPN GCARPYMAEL GFVGSAPESY QLWLGGSPDQ TRLAQPIIEK
LHDNDIESFL EPIFVLFKKS RKGKESFGDF CDRTGFDAIR EFSANYTPGE PTSSGKSRHR
VSLRDDVYLQ LKETAEKQDR PMTELVHEAL DKYFQSL