Gene Ava_2406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2406 
SymbolpsaB 
ID3683205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2993074 
End bp2995299 
Gene Length2226 bp 
Protein Length741 aa 
Translation table11 
GC content49% 
IMG OID637717751 
Productphotosystem I P700 chlorophyll a apoprotein A2 
Protein accessionYP_322918 
Protein GI75908622 
COG category 
COG ID 
TIGRFAM ID[TIGR01336] photosystem I core protein PsaB 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.185378 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000443533 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAACAA AATTTCCAAA ATTTAGCCAG GATCTAGCAC AGGACCCAAC TACCCGTCGC 
ATTTGGTATG CGATGGCTAT GGGCAACGAC TTTGAAAGCC ACGATGGCAT GACCGAAGAA
AATCTTTACC AAAAGATTTT CGCTACTCAC TTCGGTCACC TGGCAATCAT TTTCTTATGG
GCTTCTAGCC TCCTGTTCCA TGTAGCCTGG CAAGGTAACT TTGAACAGTG GATTAAAGAT
CCTCTACACG TCCGCCCAAT TGCTCACGCG ATTTGGGACC CCCACTTCGG TAAACCAGCT
ATTGAAGCTT TTACCCAAGC TGGTGCTAAC GGCCCAGTAA ACATTGCTTA CTCTGGTGTT
TACCACTGGT GGTACACCAT CGGTATGCGG ACAAACACCG AACTATATAC AGGTTCAGTC
TTCCTGTTGT TGTTCGCGTC CTTGTTCTTG TTTGCTGGTT GGTTGCATTT ACAACCCAAG
TTCCGTCCTA GCCTAGCTTG GTTTAAGAGT GCTGAATCTC GTCTGAACCA CCACTTAGCA
GGTTTGTTCG GCGTTAGCTC TCTAGCTTGG GCCGGTCACT TGATTCACGT TGCTATCCCC
GAATCTCGCG GACAGCACGT AGGTTGGGAT AACTTCTTAA CCACAGCGCC CCACCCAGCA
GGCTTACAAC CCTTCTTCAC AGGCAACTGG GGTGTTTACG CTCAAAACCC TGATACAGCA
GGTCACATTT TCAGTACCTC TCAAGGTTCT GGTACAGCAA TTCTGACCTT TTTGGGTGGT
TTCCATCCTC AAACAGAATC CCTGTGGTTG ACAGACATAG CTCATCACCA CTTGGCGATC
GCTGTACTAT TCATTGTTGC TGGTCATATG TACCGTACCA ACTTCGGTAT TGGTCACAGC
ATCAAAGAAA TGATGAATGC CAAAACTTTC TTTGGCAAAC CTGTTGAAGG TCCCTTCAAT
ATGCCTCACC AAGGCATTTA TGACACCTAC AACAACTCTC TGCACTTCCA ATTAGGTTGG
CACCTAGCTT GTTTGGGTGT TGTTACCTCT TGGGTGGCGC AGCATATGTA CTCTCTGCCT
TCCTACGCAT TCATTGCTAA GGACTACACA ACACAGGCAG CACTGTACAC CCACCACCAA
TACATAGCCA TTTTCTTAAT GGTTGGTGCT TTTGCTCACG GTGCTATCTT CTTAGTCCGT
GACTACGATC CTGAACAAAA CAAAGGTAAC GTGCTTGAGC GTGTGCTACA GCACAAAGAA
GCGATTATCT CCCACCTCAG CTGGGTATCG CTATTCTTAG GCTTCCACAC CTTAGGCTTA
TACGTCCACA ACGACGTAGT AGTTGCTTTC GGAACACCTG AAAAGCAAAT CTTGATTGAG
CCAGTATTTG CTCAGTTCAT TCAAGCTGCT CACGGTAAGG TACTCTACGG CTTAGATACA
TTGCTGTCTA ACCCCGATAG CGTTGCCTAC ACAGCCTATC CTAACTACGC CAACGTTTGG
CTACCAGGCT GGTTAGATGC CATTAACTCT GGTACTAACT CCCTGTTCTT AACAATTGGC
CCTGGCGACT TCTTGGTACA CCATGCGATC GCCTTAGGTC TGCACACCAC CACCCTCATC
CTAGTCAAAG GTGCTTTGGA TGCTCGTGGT TCCAAGCTGA TGCCGGATAA AAAGGACTTC
GGCTATGCCT TCCCTTGCGA CGGTCCAGGC CGTGGCGGTA CTTGCGACAT CTCCGCTTGG
GACTCCTTCT ACCTATCTTT ATTCTGGGCG TTAAATACAG TAGGTTGGGT AACATTCTAC
TGGCACTGGA AACATTTAGG TATTTGGCAA GGTAACGTTG CTCAGTTCAA CGAAAATTCC
ACCTACCTCA TGGGCTGGTT CCGTGACTAC CTCTGGGCTA ACTCTGCTCA GTTGATCAAC
GGTTACAACC CCTACGGTGT GAACAACCTG TCTGTCTGGG CGTGGATGTT CCTCTTCGGA
CACCTAGTTT GGGCTACTGG CTTCATGTTC CTCATCTCCT GGAGAGGTTA CTGGCAAGAG
TTGATCGAAA CCCTAGTTTG GGCGCACGAA CGTACTCCTA TCGCTAACCT AATTCGCTGG
AAAGACAAGC CAGTTGCTCT CTCCATCGTT CAAGCTCGTG TAGTTGGTCT AGCTCACTTC
ACCGTCGGCT ATGTCCTCAC CTACGCAGCC TTCCTCATCG CCTCCACTGC TGGTAAGTTC
GGTTGA
 
Protein sequence
MATKFPKFSQ DLAQDPTTRR IWYAMAMGND FESHDGMTEE NLYQKIFATH FGHLAIIFLW 
ASSLLFHVAW QGNFEQWIKD PLHVRPIAHA IWDPHFGKPA IEAFTQAGAN GPVNIAYSGV
YHWWYTIGMR TNTELYTGSV FLLLFASLFL FAGWLHLQPK FRPSLAWFKS AESRLNHHLA
GLFGVSSLAW AGHLIHVAIP ESRGQHVGWD NFLTTAPHPA GLQPFFTGNW GVYAQNPDTA
GHIFSTSQGS GTAILTFLGG FHPQTESLWL TDIAHHHLAI AVLFIVAGHM YRTNFGIGHS
IKEMMNAKTF FGKPVEGPFN MPHQGIYDTY NNSLHFQLGW HLACLGVVTS WVAQHMYSLP
SYAFIAKDYT TQAALYTHHQ YIAIFLMVGA FAHGAIFLVR DYDPEQNKGN VLERVLQHKE
AIISHLSWVS LFLGFHTLGL YVHNDVVVAF GTPEKQILIE PVFAQFIQAA HGKVLYGLDT
LLSNPDSVAY TAYPNYANVW LPGWLDAINS GTNSLFLTIG PGDFLVHHAI ALGLHTTTLI
LVKGALDARG SKLMPDKKDF GYAFPCDGPG RGGTCDISAW DSFYLSLFWA LNTVGWVTFY
WHWKHLGIWQ GNVAQFNENS TYLMGWFRDY LWANSAQLIN GYNPYGVNNL SVWAWMFLFG
HLVWATGFMF LISWRGYWQE LIETLVWAHE RTPIANLIRW KDKPVALSIV QARVVGLAHF
TVGYVLTYAA FLIASTAGKF G