Gene Ava_4121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4121 
Symbol 
ID3681509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5135070 
End bp5136140 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content42% 
IMG OID637719467 
Productphotosystem II reaction centre protein PsbA/D1 
Protein accessionYP_324615 
Protein GI75910319 
COG category 
COG ID 
TIGRFAM ID[TIGR01151] photosystem II, DI subunit (also called Q(B)) 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0216201 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACCA TTGTTCAACG TCAAAAGGAA TTTAATTTTT TTGATTTATG GGATAGTTTT 
TGTGCGTGGA TTACCAGCAC AGAAAATCGG ATTTATATCG GCTGGTTTGG TGTCTTGTCG
ATTCCTACCT TGCTAGCTGC TACCACCTGT TTTGTTTTGG CTTTTATTGC TGCGCCTAGC
GTAGATATGG ATGGTATACG TGAGCCAATT ATGGGTTCAC TAATGGACGG TAATAATTTA
ATTACAGCCG CAGTAGTGCC GACTTCTGCT GCGATTGGTT TGCACTTTTA TCCTATCTGG
GAAGCGGCAT CAATGGATGA ATGGCTTTAC AATGGCGGGC CATATCAGTT GATTGTGCTG
CATTTTCTGA TTGGTATTTG GTGCTTACTA GGGCGATTTT GGGAACTTAG TTATCGTTTA
GGAATGCGAC CTTGGATAGC AGTTGCCTAT TCTGCACCTG TGATTGCTGC TACTTCCGTT
TTGTTAGTTT ATCCTATTGG TCAAGGTAGT TTTTCTGATG GTTTACCTTT GGGAATTGCT
GGAACTTTCC ACTTTATGTT GGCTTTCCAA GGCGATCATA ATATCCTGAT GCACCCGTTC
CATATGTTGG GTGTAGCAGG TGTATTTGGT GGCGCACTGT TGAGTTCTTT GCATGGTTCT
TTAGTGGCTT CAACGCTAAT TCGCAATACC GATGAAAATG AATCCATCAA TGGTGGATAT
AAGCTGGGTC AGCAGCAAGT AACATACAAA TACTTGGCAG GACACAATAG CTTCTTGGGA
CGCTTGTTGA TTCCTACCTT TGCTAGCAGA AATCATCGTG CTTTCCATTT CTTATTAGCA
GCATTACCAA CAATAGGTAT TTGGTTTGCG GCGATGGGTG TATGTTCAAT GGCATTTAAT
CTCAATGGCT TGAACTTTAA TCATTCCATC TTAGATAGTC GGGGTAATGT AATTAGAAGC
GACGCTGATA TCTTAAACCG TGCCAATATT GGTCTCAGTG TCATGCACGC TCCTAATGTC
CATAATTTTC CATTGGTGCT GTCTAGCGGT CAACCTATTC CAGTTAGTTA A
 
Protein sequence
MSTIVQRQKE FNFFDLWDSF CAWITSTENR IYIGWFGVLS IPTLLAATTC FVLAFIAAPS 
VDMDGIREPI MGSLMDGNNL ITAAVVPTSA AIGLHFYPIW EAASMDEWLY NGGPYQLIVL
HFLIGIWCLL GRFWELSYRL GMRPWIAVAY SAPVIAATSV LLVYPIGQGS FSDGLPLGIA
GTFHFMLAFQ GDHNILMHPF HMLGVAGVFG GALLSSLHGS LVASTLIRNT DENESINGGY
KLGQQQVTYK YLAGHNSFLG RLLIPTFASR NHRAFHFLLA ALPTIGIWFA AMGVCSMAFN
LNGLNFNHSI LDSRGNVIRS DADILNRANI GLSVMHAPNV HNFPLVLSSG QPIPVS