Gene Ava_2223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2223 
Symbol 
ID3679472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2764048 
End bp2765238 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content43% 
IMG OID637717565 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_322736 
Protein GI75908440 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAAG TTAATGTTCC TAGCCTCGCT AGTTCTGCTT ATGTACATAT TCCCTTTTGC 
AGAAGGCGGT GCTTTTATTG TGACTTCCCG ATATTTGTTG TCGGCGATCG CCAACGTGGT
GAAACATCTG TTACCATCTC CGGGTATGTG GATACTCTCT GTGAAGAAAT TGCCATTACA
CCAGTTTTTG GTCAACCCCT CCAGACCATA TTTTTCGGCG GTGGTACGCC TTCGCTGTTA
TCTACAGCAC AATTAGCACA AATACTAGGA ACCCTAGATA AGCGGTTTGG GATTGTGCCT
GATGCAGAAA TTTCGATGGA AATTGATCCA GGTACATTTG ATCTAGCACA CATCCAAGGC
TATCGTAGCG TTGGTGTGAA TCGGGTAAGT TTGGGTGTAC AAGCGTTTCA AGAAGAATTG
TTAAAATTAG CTGGGCGATC GCACTCGCTT AAAGATATCT TTGCAGCTAT AGACTTGATT
CACCAAGTCG AGATTCCCGA ATTTAGCATA GACTTAATCT CTGGGTTGCC GCATCAGTCT
TTAGATCAAT GGCAAGATTC CCTAGAAACA GCCGTTAACA TTGCACCCAC CCATATATCT
ATCTATGATT TGACAATTGA GCCAGGAACA GCCTTTGGTC GTTATTACAA ACCTGGGGAT
AATCCCTTAC CTACTGATGA AACCACAGTG AAAATGTATC AGATGGGTCA GAGGATTTTG
ACTGATGGCG GTTATGAACA TTATGAAATT TCCAACTACG CCAAGCCAGG ACACCAGTGT
AGACATAACC GCGTGTATTG GGAAAACTGC CCTTATTACG GCTTTGGTAT GGGTGCGGCC
AGCTATGTAG AAGGTAAACG CTTCACCCGT CCCCGAAAAA CAAAGGAATA TTCTCAATGG
GTGCAAGAGT TGATTGCTAA TCACGGTGTG ATTGATTGGG AAATTACGCC AAAAGCAGAT
GTATTGTTAG AAACGTTAAT GTTGGGGTTA CGCTTGGCTG ATGGTGTGAG TTTAGCAGCA
CTAACCGAGG AATTTGGCAA GGAGAAAATA CAAGAACTGC ATCAATGTCT GCAACCTTAT
TTTACTCAAG GTTGGGTACA AGTTGTGGGT GATAGGTTGC GGTTGAGTGA TCCCGATGGT
TTTTTATTTT CCAATGTGGT GTTAGCCGAT TTGTTTAGTC AGTTGGGTTA A
 
Protein sequence
MQKVNVPSLA SSAYVHIPFC RRRCFYCDFP IFVVGDRQRG ETSVTISGYV DTLCEEIAIT 
PVFGQPLQTI FFGGGTPSLL STAQLAQILG TLDKRFGIVP DAEISMEIDP GTFDLAHIQG
YRSVGVNRVS LGVQAFQEEL LKLAGRSHSL KDIFAAIDLI HQVEIPEFSI DLISGLPHQS
LDQWQDSLET AVNIAPTHIS IYDLTIEPGT AFGRYYKPGD NPLPTDETTV KMYQMGQRIL
TDGGYEHYEI SNYAKPGHQC RHNRVYWENC PYYGFGMGAA SYVEGKRFTR PRKTKEYSQW
VQELIANHGV IDWEITPKAD VLLETLMLGL RLADGVSLAA LTEEFGKEKI QELHQCLQPY
FTQGWVQVVG DRLRLSDPDG FLFSNVVLAD LFSQLG