Gene Ava_1056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1056 
Symbol 
ID3678608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1285982 
End bp1287481 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content43% 
IMG OID637716392 
ProductRieske (2Fe-2S) region 
Protein accessionYP_321575 
Protein GI75907279 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00144994 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000116875 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTACTG AAACCAGGTT GCAAGGGAAC TCCCAAAACG AAAATTTATT AGACAACGCA 
ACTAACGAAC AATCCTCAAA AGAAGAAAAT ACCTTCCAAT GGACAAAACA ATGGTATCCG
CTAGCTGTGG TAGAGTTCCT TGATCCTAGC CGTCCTCATG CTATGCAATT ATTAGGTAAA
GATATTGTTT TGTGGCGGGA TGGTTCTAGT CAATGGCGCT GCTTTGAAGA TTTTTGTCCC
CATAGACTTG CACCACTTTC AGAAGGTCGA GTCGAAGCAG ACGGTACACT TTTATGTGCT
TACCATGCTT GGCGTTTTGA TGCTCAAGGG AATTGTGTAA GTATACCCCA GTCTAAAGAT
GAAAAAACGG CGGCGAAGAA CTGTGAAAGT CAAAAATCCT GTGCAGTAGT TTATCCGACG
CAGGAACGCC AGGGGTTACT GTGGGTATGG GCGGAAGCAG GAGAACAAGC CAAGGTAGAA
AGCCAATTGC AGACACCGCG AATTGTTCCA GAACTAGAAG ATAACTCAGG TAAAGTGATA
AAATCACCTT GGAATTTCCG TGATTTGCCC TATGGGTGGG ACTACTTTAT GGAAAATGTC
TCAGACCCTG CTCATGTACC TGTTTCCCAT CACGGTATCA TAGGCGATCG CTACAAGGAT
GCCAAATTCT ACGATATGAT TCCTGTACGC CCCATATCTA CCCAAGATGG GTTTGCCTTT
GAAATTCAGC CCACACAAGG CAAAACAGTA CAAGGAATTC ACGATTTTCA ACCACCTTGT
CACATGAGAA TAGTTTCTAC CTCTGAGGAT GGCGGACAGT TAATTTTGGC TTTGTATGCT
ACGCCAACTC GTCCCGGTTG GTGTCGCCAC ATTGGTTGTC AAGTTTTTGT CAAAAATCCC
CAAGGAAAGA AACCCCAAGG ATTATCTTTC TTTGGACTAC CATTACCTGT TTGGTTGGTT
CATGTATTAG CATCCTTATT TCTGCACCAA GATATGGTAT TTCTGCATTA CCAAGAAAAA
ATTATTGCCC AGAAAAAAAA CGGTAAATGG CTGAACGCTG TATATACACC AAATCCTCAA
GATAAGATGG TGATTACATT GCGTCAGTGG TTGAAAAACC GAGCTGGTGG TGGTATACCT
TGGGCGGAAG GATATAGCAG CGATATACCT CCAGCCGAAA AAGACAAGCA GAAGCTATTT
GATGTCTGGA CAACCCACAC TCAACATTGC ACAGTTTGCC AAGATGCGCT AAAAAACATC
AATCGTCTGA CTGTACTAGC TTATATATCT GCGGCTATCT GCTTGTTTTT AGCTGTGATT
TTAGATGCAA GAACGGTGGC AATGCAAGCG GCTTTAGGTG CATCTATATT CACATTACCT
CCTGTAGGAT TTTGGTTAGC ACTGGGCGGC GCTATTTTGT TGGCGGTAGT TGGATATCAA
CTCAAAAGAT TTAGTCGGCT ATTTTATGTA TACGAATTTG AACACGCTCG TAATGATTAA
 
Protein sequence
MTTETRLQGN SQNENLLDNA TNEQSSKEEN TFQWTKQWYP LAVVEFLDPS RPHAMQLLGK 
DIVLWRDGSS QWRCFEDFCP HRLAPLSEGR VEADGTLLCA YHAWRFDAQG NCVSIPQSKD
EKTAAKNCES QKSCAVVYPT QERQGLLWVW AEAGEQAKVE SQLQTPRIVP ELEDNSGKVI
KSPWNFRDLP YGWDYFMENV SDPAHVPVSH HGIIGDRYKD AKFYDMIPVR PISTQDGFAF
EIQPTQGKTV QGIHDFQPPC HMRIVSTSED GGQLILALYA TPTRPGWCRH IGCQVFVKNP
QGKKPQGLSF FGLPLPVWLV HVLASLFLHQ DMVFLHYQEK IIAQKKNGKW LNAVYTPNPQ
DKMVITLRQW LKNRAGGGIP WAEGYSSDIP PAEKDKQKLF DVWTTHTQHC TVCQDALKNI
NRLTVLAYIS AAICLFLAVI LDARTVAMQA ALGASIFTLP PVGFWLALGG AILLAVVGYQ
LKRFSRLFYV YEFEHARND