Gene Ava_1242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1242 
Symbol 
ID3683180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1529297 
End bp1530352 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content50% 
IMG OID637716580 
Productphotosystem II reaction centre protein PsbD/D2 
Protein accessionYP_321761 
Protein GI75907465 
COG category 
COG ID 
TIGRFAM ID[TIGR01152] Photosystem II, DII subunit (also called Q(A)) 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.30805 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCG CAGTAGGAAG GGCCCCCAGT AGAGGGTGGT TTGACGTACT AGACGACTGG 
TTAAAGCGCG ATCGCTTCGT ATTCGTAGGT TGGTCAGGGA TATTATTATT TCCTTGCGCC
TTCCTAGCAC TAGGCGGTTG GCTAACCGGT ACAACATTTG TCACATCGTG GTACACCCAC
GGACTAGCAT CATCCTACCT GGAAGGAGCA AACTTCCTGA CAGTAGCAGT ATCAAGCCCA
GCAGACAGCA TGGGACACTC CCTGTTGTTG TTGTGGGGAC CAGAAGCCCA AGGGGACTTT
ACCCGTTGGT GTCAGCTAGG AGGGTTATGG CCATTCGTAG CCCTACACGG AGCCTTCGGT
TTGATCGGAT TCATGCTACG TCAATTTGAA ATTGCGCGCC TAGTAGGAAT CAGACCCTAC
AACGCACTAG CATTTTCAGC ACCCATAGCG GTATTCGTCA GCGTATTCCT GATGTACCCA
TTGGGACAAT CATCTTGGTT CTTTGCACCA AGCTTTGGAG TAGCAGCAAT CTTCAGATTC
TTGTTATTCC TCCAAGGGTT CCACAACTGG ACACTAAACC CCTTCCACAT GATGGGAGTA
GCAGGTGTAC TAGGTGGAGC GCTACTGTGT GCCATCCACG GTGCAACAGT AGAAAACACC
CTATTTGAAG ACGGCGAAGG CGCAAACACC TTCCGCGCCT TCAACCCCAC CCAATCAGAA
GAAACCTATT CAATGGTGAC AGCGAACCGA TTCTGGTCAC AGATATTCGG GATTGCTTTC
TCCAACAAAC GCTGGTTGCA CTTCTTCATG TTGTTCGTAC CCGTCACTGG GTTGTGGATG
AGTGCGGTAG GCATCGTTGG TTTAGCATTG AACCTGCGGG CTTATGACTT CGTTTCCCAA
GAATTACGGG CAGCAGAAGA CCCAGAATTT GAAACCTTCT ATACCAAGAA CATTTTGCTG
AACGAGGGTA TCCGCGCTTG GATGGCTCCT CAAGATCAGC CTCACGAAAA ATTTGTATTC
CCCGAAGAGG TACTACCACG TGGTAACGCT CTCTAA
 
Protein sequence
MTIAVGRAPS RGWFDVLDDW LKRDRFVFVG WSGILLFPCA FLALGGWLTG TTFVTSWYTH 
GLASSYLEGA NFLTVAVSSP ADSMGHSLLL LWGPEAQGDF TRWCQLGGLW PFVALHGAFG
LIGFMLRQFE IARLVGIRPY NALAFSAPIA VFVSVFLMYP LGQSSWFFAP SFGVAAIFRF
LLFLQGFHNW TLNPFHMMGV AGVLGGALLC AIHGATVENT LFEDGEGANT FRAFNPTQSE
ETYSMVTANR FWSQIFGIAF SNKRWLHFFM LFVPVTGLWM SAVGIVGLAL NLRAYDFVSQ
ELRAAEDPEF ETFYTKNILL NEGIRAWMAP QDQPHEKFVF PEEVLPRGNA L