Gene Ava_4996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4996 
Symbol 
ID3679048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp6279595 
End bp6280638 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content47% 
IMG OID637720356 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_325488 
Protein GI75911192 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0408] Coproporphyrinogen III oxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.000411147 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.314593 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCGTC ATTCCGATAA TTCTCTGCAA GAATCTCCAA ATCACACAGA ATTGCTGATA 
TCACCTACCA ATACAATACC CAAGGATTCA CGGCAGCGAT CGCAGCAATT CATGCAGAAC
TTACAGGATG AGATTTGCAC AGCGTTAGAG CAAATTGACG GGAAAGCAAG CTTTCACCAA
GACTATTGGG AGCGAGCAGA AGGGGGAGAA GGACGTACCC GCGTGATTCG AGAAGGACGG
GTGTTTGAAC AAGGTGGCGT GAACTTTTCC GCAGTGTGGG GAAATAGCCT ACCTGCTTCA
ATTTTGGCGC AACGTCCAGA AGCAGCCGGA CATGAGTTTT TTGCCACAGG AACTTCAATG
GTGTTGCATC CTCGCAATCC TTATGTACCA ACAGTACATC TCAACTATCG CTACTTTGAA
GCTGGCCCTA TTTGGTGGTT TGGTGGTGGG GCGGATTTAA CCCCATACTA CGCCTTTGAA
GAGGATGCGG TTCACTTTCA TCAGACGCTA AAAAATGCTT GTGATGTCCA TAATCCAGAG
TATTATCCAG CATTTAAACG CTGGTGTGAT GAATACTTCT ATTTACGACA TCGCCAAGAA
CAGCGAGGTA TTGGCGGTAT TTTCTTCGAC TATCAAGATG CTAGCGGTAA GCTTTACATT
GGCACTCAAG CAGATAGTCC AGCCGCAATT TACAGCCAGA AAGTGGGAAA TGTGACGCGG
AATTGGGAGG ATATTTTTGC GTTTGTCCAG TCTTGCGGTC AGGCTTTCTT ACCTGCTTAC
TTGCCTATTG TAGAACGCAG GCAAGCAACT GAGTACGGCG ATCGCCAACG TAATTTCCAA
CTATACCGCC GTGGTCGTTA TGTTGAATTT AATTTAGTTT ACGACCGGGG AACTGTGTTT
GGCTTGCAAA CTAAGGGACG GACAGAATCG ATTCTCATGT CTTTACCACC CTTAGCACGT
TGGGAATATT GCTACGAACC GAAAGCTGGA AGCCCAGAAG CGGAACTAAC AGAAGTTTTT
CTCCAGCCTA GAGATTGGGC GTAG
 
Protein sequence
MGRHSDNSLQ ESPNHTELLI SPTNTIPKDS RQRSQQFMQN LQDEICTALE QIDGKASFHQ 
DYWERAEGGE GRTRVIREGR VFEQGGVNFS AVWGNSLPAS ILAQRPEAAG HEFFATGTSM
VLHPRNPYVP TVHLNYRYFE AGPIWWFGGG ADLTPYYAFE EDAVHFHQTL KNACDVHNPE
YYPAFKRWCD EYFYLRHRQE QRGIGGIFFD YQDASGKLYI GTQADSPAAI YSQKVGNVTR
NWEDIFAFVQ SCGQAFLPAY LPIVERRQAT EYGDRQRNFQ LYRRGRYVEF NLVYDRGTVF
GLQTKGRTES ILMSLPPLAR WEYCYEPKAG SPEAELTEVF LQPRDWA