Gene Ava_1303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1303 
Symbol 
ID3683046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1603011 
End bp1604342 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content43% 
IMG OID637716641 
ProductRieske (2Fe-2S) region 
Protein accessionYP_321822 
Protein GI75907526 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.418357 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACCTG AATTCAATTT TTTTCAGCAC TGGTATCCTG TCTCACCCAT AGAAGACTTA 
GACCCCAAAC GGCCAACCCC TGTTACTCTT TTAGGAATTC GTTTAGTTAT CTGGAAACCA
AAATCATCAG ATAGTTTTCG GATATTTATA GATCAATGTC CTCACCGTCT CGCACCCTTA
AGTGAAGGGC GTATTGACGA TAAAACCGGT AACTTAATGT GCAGTTATCA CGGCTGGCAA
TTTGATACTG AGGGTGTTTG TACCCGCATT CCTCAAGCTG AGAATCCAGA AATAGTTACT
AAAAACCAAA AGGATTTTTG TGTTAATTCC TTACCAGTAC GCCAAGAGAA TGATTTACTC
TGGGTTTGGC CTGATGTGAA CTCTCAGGAA TTAGCCCATC AAACACCATT ACCTCTGTCA
CCGCAAGTAG ACGCTAGCAA AGGTTTTGTG TGGACTTCTT TCGTGCGCGA TTTAGAATAC
GACTGGCAAA CTCTAGTAGA AAATGTCGCA GATCCTAGTC ATGTTCCTTT CGCCCATCAT
GGAGTGCAGG GTAATCGAGA ACAAGGCAGG GCTAGACCTA TTAGTATTAT GCAATCAACG
CCAGGTTTGA TTGAGGCTGT CGCAGATGGA CTAGTCACAG CCAAGATTAC TTTTGAACCG
CCTTGTCGTT TAGAGTACGC CATTAGCGTT GGTAATGATG GAAAACAGTT AGGACTTGTC
ACTTATTGCC TTCCTGTATC TCCTGGCAAA TCAAGAATTG TCGCTCAGTT TCCTCGCAAT
TTTGCTAAAA CACTTCATCG TCTTACACCC AGATGGTGGG ATCACATCAA AACACGTAAT
TTAGTACTTG ATGGGGATAT GATTTTGCTC AATCAGCAAG AGCATTTGAT CCAACAAAGA
CAATTATCAG CCAGTTGGAA AACAGCTTAC AAGATGCCAA CAAGCGCCGA TCGCCTGGTA
ATTGAATTTC GCAAGTGGTT TGATAAATAC TGTGACGGGG GATTACCCTG GAAAGAAGTG
GGAATCCCCA TTCCTGAAGC TGCCGCAATC AACGACAATC GGGATGTCTT GTTAGACAGA
TACAAACAAC ACACCCGGCA TTGCAGTAGC TGTCGTAATA CCCTGAAGAA TATCGAGCGA
CTACAACTGA TATTACTAGG CTATTTCGCC CTTGTTATTT CTGGGGTTGC CGTTCTACCA
GACTCCTTAA GAGTCCGGCT AGGTTTACCC TTAATAATTA CAGCAATCTT AGGATTGGGA
CTTTACTCCT GGCTAAAATT TTGGCTGGTG CCGAAATTTT ACTTTGTAGA TTATGTCCAC
GCCGAAAAAT AA
 
Protein sequence
MQPEFNFFQH WYPVSPIEDL DPKRPTPVTL LGIRLVIWKP KSSDSFRIFI DQCPHRLAPL 
SEGRIDDKTG NLMCSYHGWQ FDTEGVCTRI PQAENPEIVT KNQKDFCVNS LPVRQENDLL
WVWPDVNSQE LAHQTPLPLS PQVDASKGFV WTSFVRDLEY DWQTLVENVA DPSHVPFAHH
GVQGNREQGR ARPISIMQST PGLIEAVADG LVTAKITFEP PCRLEYAISV GNDGKQLGLV
TYCLPVSPGK SRIVAQFPRN FAKTLHRLTP RWWDHIKTRN LVLDGDMILL NQQEHLIQQR
QLSASWKTAY KMPTSADRLV IEFRKWFDKY CDGGLPWKEV GIPIPEAAAI NDNRDVLLDR
YKQHTRHCSS CRNTLKNIER LQLILLGYFA LVISGVAVLP DSLRVRLGLP LIITAILGLG
LYSWLKFWLV PKFYFVDYVH AEK