Gene Aazo_4269 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4269 
Symbol 
ID9342073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4341107 
End bp4342312 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content42% 
IMG OID 
Productoxygen-independent coproporphyrinogen III oxidase 
Protein accessionYP_003722766 
Protein GI298492589 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.281837 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCAAA AAGATATTGC TGTTCAAGTT CCTAGTGCGG CTTACGTACA TATCCCCTTT 
TGTCGGCGGC GGTGTTTTTA TTGTGACTTT CCGGTGTTTG TGGTGGGCGA TACCTACGGC
GAGCCAAGCT ACCGCTCACA GGGTAAAACA TCTGGTACAA TTTCCCAATA TGTTGACGCA
CTGTGTCACG AAATCAGCAT CTCACTAGCT TTTAGTCAAC CAGTAACAAC TATTTTCTTT
GGTGGTGGTA CTCCTTCGCT GTTATCGACA GAACAGTTGC AATGTATATT AACAGCGTTA
GAGAAGCGTT TTGGCATTGC GGCTGGCGTG GAAATTTCCA TGGAAATGGA CCCCGGTACT
TATGATTTGG CACAGATTGC AGGTTATTGC AGTACAGGTG TCAACCGGGT AAGTTTGGGT
GTACAAGCCT TTCAAGATGA ATTACTAACA GTTGCTGGGC GATCGCACTC AGTTAATGAT
ATCTTTGCAG CTATTGATTT AATCAACCAA GTCGAGATAC CCCAATTTAG CTTAGACCTA
ATTTCTGGGT TGCCACATCA GTCTTTAGTT CAGTGGGAAG ATTCCCTAAC TAAAGCGGTA
GAAGTTGCCC CCACTCATAT ATCTATCTAT GATTTAACCA TTGAACCAGG GACAGCTTTT
GGTCGTTATT ACAAACCGGG AGATAATCCC CTACCGACAG ATGAAACCAC TGTCACAATG
TACCAACTAG GGCAAAAAGT CTTAACTGGC GCAGGTTATG AACATTATGA AATTTCCAAC
TATGCTAAAA GCGGACATCA ATGTAAACAT AATCGAGTTT ATTGGGAAAA TCGCTCTTAT
TATGGTTTTG GTATGGGTGC AGCCAGTTAT GTGCATGGTA AACGCTTCAC TCGTCCTCGG
AAAACTAAAG AATATTACGA ATGGTTGCAA AATGGTGCAT TGATTGATTG TGAAGTCACA
CCTTTAGAGG ATGAATTGTT AGAAACTTTA ATGCTGGGGT TGCGGTTAGC AGAAGGTTTG
AGTTTGACGG TGTTGGTGGA GAAGTTTGGA AAAGAAAAGG TTGAGGAAAT TACACAATGT
TTGCAACCTT ATTTTAAGCA GGGTTGGGTG GAAGTTGTGG AGGAAAGGTT GCGTTTAACT
GATCCTGATG GGTTTTTGTT TTCTAATATG GTGTTGGCAC ATTTGTTTGA GAAGTTGGGG
GAATAA
 
Protein sequence
MSQKDIAVQV PSAAYVHIPF CRRRCFYCDF PVFVVGDTYG EPSYRSQGKT SGTISQYVDA 
LCHEISISLA FSQPVTTIFF GGGTPSLLST EQLQCILTAL EKRFGIAAGV EISMEMDPGT
YDLAQIAGYC STGVNRVSLG VQAFQDELLT VAGRSHSVND IFAAIDLINQ VEIPQFSLDL
ISGLPHQSLV QWEDSLTKAV EVAPTHISIY DLTIEPGTAF GRYYKPGDNP LPTDETTVTM
YQLGQKVLTG AGYEHYEISN YAKSGHQCKH NRVYWENRSY YGFGMGAASY VHGKRFTRPR
KTKEYYEWLQ NGALIDCEVT PLEDELLETL MLGLRLAEGL SLTVLVEKFG KEKVEEITQC
LQPYFKQGWV EVVEERLRLT DPDGFLFSNM VLAHLFEKLG E