Gene Ava_4637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4637 
Symbol 
ID3680006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5799809 
End bp5801179 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content47% 
IMG OID637719992 
Productthioredoxin reductase 
Protein accessionYP_325129 
Protein GI75910833 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0492] Thioredoxin reductase
[COG3118] Thioredoxin domain-containing protein 
TIGRFAM ID[TIGR01292] thioredoxin-disulfide reductase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000231293 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.140967 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAACC CAACTGTAGA AAACTTAGTC ATTATTGGTT CTGGGCCAGC AGGGTACACG 
GCTGCTATCT ATGCGGCGAG AGCTAACCTA AAACCCGTTG TATTTGAAGG TTTTCAAGCT
GGGGGTTTGC CTGGTGGGCA ACTTATGACA ACGACTGAGG TAGAAAACTT TCCAGGGTTT
CCCCAAGGGA TTACCGGGCC GGATTTAATG GATAGGATGA AGGCTCAAGC AGAACGCTGG
GGGGCTGAGT TATATACTGA AGATGTTATA TCAGTTGACT TGAGCCAACG TCCATTTACT
GTGCGCTCAG AGGAAAGAGA ATTTAAAGCA CACAGTATTA TTATTGCCAC TGGTGCGACG
GCAAAACGTT TAGGTTTACC TAGCGAGCAT CAATTCTGGA GTCGGGGGAT TTCGGCTTGT
GCAATTTGTG ATGGTGCAAC CCCAATTTTC CACGGTGCAG AGTTAGCTGT GATTGGTGCT
GGTGACTCGG CGGCGGAAGA GTCCATATAT CTCACCAAGT ACGGCTCGAA GGTTAATTTG
TTGGTGCGTT CTGAAAAGAT GCGGGCTTCT AAAGCTATGC AAGACCGCGT TTTGAGTAAC
CCCAAAATCC AAGTGCATTG GAACACAGAA GTTGTGGATG TGTTTGGTAA TGGTCACATG
GATGGGGTGA AAGTCCGCAA TAATAAGACT GGGGAAGAAA CCACAGTACA CGCCAGGGGT
TTGTTCTACG CTATTGGTCA CAAGCCCAAC ACTTCCTTAT TTCAGGGACA ACTAGAACTA
GATGAAATTG GTTATGTTGT TACCAAACAT GGTTCGCCAG AAACTAGTGT AGAGGGTGTG
TTCGCGGCGG GTGACGTACA AGACCATGAG TATCGTCAAG CAATTACGGC GGCTGGTAGT
GGCTGCGCGG CGGCGCTGTT AGCGGAACGT TGGTTGTCTG CGAATGCGTT GATTCAAGAG
TTCCATCAAG AACCAACAAT CAATAATGAG TTAGAAACTC AGCCAGTAGC GCAGAAAACA
GAAGCGGAAC AAGAGGCGGG ATTTGCTTTG AGCGCAACTC GCCATGCTGG TGGCTATGCT
TTACGAAAAT TATTTCATGA AAGCGATCGC CTACTCATTG TCAAATACGT CTCCCCTGGC
TGTGGCCCTT GCCATACTCT CAAGCCAATC TTAAATAAAG TAGTCGATGA ATTTGACGGC
AAAATCCACT TTGTGGAAAT CGACATTGAC CAAGACCGGG ATATTGCGGA AAATGCTGGG
GTAACCGGCA CACCAACTGT TCAGTTCTTT AAGGATAAAG AACTGGTGAA AGAAGTTAAG
GGTGTTAAGC AAAAAAGTGA GTATCGTCAG TTGATTGAAG CTAATCTCTA G
 
Protein sequence
MSNPTVENLV IIGSGPAGYT AAIYAARANL KPVVFEGFQA GGLPGGQLMT TTEVENFPGF 
PQGITGPDLM DRMKAQAERW GAELYTEDVI SVDLSQRPFT VRSEEREFKA HSIIIATGAT
AKRLGLPSEH QFWSRGISAC AICDGATPIF HGAELAVIGA GDSAAEESIY LTKYGSKVNL
LVRSEKMRAS KAMQDRVLSN PKIQVHWNTE VVDVFGNGHM DGVKVRNNKT GEETTVHARG
LFYAIGHKPN TSLFQGQLEL DEIGYVVTKH GSPETSVEGV FAAGDVQDHE YRQAITAAGS
GCAAALLAER WLSANALIQE FHQEPTINNE LETQPVAQKT EAEQEAGFAL SATRHAGGYA
LRKLFHESDR LLIVKYVSPG CGPCHTLKPI LNKVVDEFDG KIHFVEIDID QDRDIAENAG
VTGTPTVQFF KDKELVKEVK GVKQKSEYRQ LIEANL