Gene Aazo_3386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3386 
Symbol 
ID9341191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3450164 
End bp3451546 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content37% 
IMG OID 
Productoxygen-independent coproporphyrinogen III oxidase 
Protein accessionYP_003722162 
Protein GI298491985 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00479262 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCTTTC TACTACCAGG TGTGAAATTT GATCTGGATC TGATTAAAAA GTATGATACC 
CGCGCACCCA GATACACCAG CTACCCACCC GCTACAGAGT TAACTGAAAC GTTCACAGCA
GCAGATTTCC AACCCTCTAT TGCTGCTTCT AATGAACGTC AAAGTGCCCT ATCTTTGTAT
TTCCATATTC CATTTTGTCA GACTGCTTGT TATTTCTGTG GCTGTAATAC TGTAATTTCT
AATAACAAGA ATATTGCCAA AGCATATCTA CACAATTTGG TACAAGAAAT TCACAACACC
GCAGAGTTGA TTGACACAAG TAGAAAGGTG CTTCAGGTAC ATTGGGGTGG TGGAACACCT
AATTATTTGG AACTCGACCA AGTTGAATTT TTGTGGAATA AGATTAATCG TTATTTCACT
ATTGATTCAT CAGCAGAGGT TTCAATTGAA ATTAACCCCC GTTATGTCAA TAAGGAATAC
ATTCAGTTTC TGAGAGATAT TGGCTTTAAT CGGATTAGTT TTGGTATCCA AGACCTTAAC
CCGAAAGTAC AAGCAGCGGT AAACCGTATC CAACCGGAAA AAATGCTGTT TGATGCTATG
GGTTGGATTA AGGAGGCTAA TTTTAGCAGT GTCAATGTAG ACTTAATTTA TGGTTTGCCA
TACCAAACTC TGCATACTTT TCAGGAAACG GTAGAAAAGA CAATCATTCT AGACCCTGAT
CGAATTGTGG TGTTTAATTT TGCTTATGTA CCTTGGATGA AACCTGTGCA GAAGAGGATT
TCCCAAGATA CATTACCCGC AGCACAGGAA AAGTTAGATA TTCTGAAGAT GACCATTGAG
GAGTTGACAA ATAACGAGTA TTTGTTTATT GGCATGGATC ATTTTGCGAA AACTAATAAT
GAATTAGCGA TCGCTCAACG TAATGGTACT CTAAAACGCA ACTTCCAAGG CTATACTACC
CACGCAGAAA CAGAACTTTT TGGGTTTGGT TCTACATCTA TCAGTATGCT AGAAGATGCT
TATGCTCAGA ACCATAAGAG TTTAAAGGAC TATTATCAGG CTGTATCAGC AGGTGTTATT
CCTACCAGTA AAGGCATTAA ATTAACCCAA AATGATATCA TCAGAAGGGA TGTCATTATG
TCAATCATGT CTCACTTTCA GCTACATAAG TCAGACATTG AAGATAAATA TCACATCAAT
TTTGATGAAT ATTTCTCTCA GGAACTAGAA GAGTTAAAAC CCCTAGAAGG TGATGGACTA
GTAAATTTAT TTACCAACCA AATCGAAATT ACAGATATTG GTAGATTACT GGTCAGAAAT
ATTGCAGTTA ATTTCGATAC TCATACCAGA ACTAGAGAAA GAAAATTCTC TCGTGCAATT
TAA
 
Protein sequence
MVFLLPGVKF DLDLIKKYDT RAPRYTSYPP ATELTETFTA ADFQPSIAAS NERQSALSLY 
FHIPFCQTAC YFCGCNTVIS NNKNIAKAYL HNLVQEIHNT AELIDTSRKV LQVHWGGGTP
NYLELDQVEF LWNKINRYFT IDSSAEVSIE INPRYVNKEY IQFLRDIGFN RISFGIQDLN
PKVQAAVNRI QPEKMLFDAM GWIKEANFSS VNVDLIYGLP YQTLHTFQET VEKTIILDPD
RIVVFNFAYV PWMKPVQKRI SQDTLPAAQE KLDILKMTIE ELTNNEYLFI GMDHFAKTNN
ELAIAQRNGT LKRNFQGYTT HAETELFGFG STSISMLEDA YAQNHKSLKD YYQAVSAGVI
PTSKGIKLTQ NDIIRRDVIM SIMSHFQLHK SDIEDKYHIN FDEYFSQELE ELKPLEGDGL
VNLFTNQIEI TDIGRLLVRN IAVNFDTHTR TRERKFSRAI