Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4121 |
Symbol | |
ID | 3681509 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 5135070 |
End bp | 5136140 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637719467 |
Product | photosystem II reaction centre protein PsbA/D1 |
Protein accession | YP_324615 |
Protein GI | 75910319 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01151] photosystem II, DI subunit (also called Q(B)) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0216201 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACCA TTGTTCAACG TCAAAAGGAA TTTAATTTTT TTGATTTATG GGATAGTTTT TGTGCGTGGA TTACCAGCAC AGAAAATCGG ATTTATATCG GCTGGTTTGG TGTCTTGTCG ATTCCTACCT TGCTAGCTGC TACCACCTGT TTTGTTTTGG CTTTTATTGC TGCGCCTAGC GTAGATATGG ATGGTATACG TGAGCCAATT ATGGGTTCAC TAATGGACGG TAATAATTTA ATTACAGCCG CAGTAGTGCC GACTTCTGCT GCGATTGGTT TGCACTTTTA TCCTATCTGG GAAGCGGCAT CAATGGATGA ATGGCTTTAC AATGGCGGGC CATATCAGTT GATTGTGCTG CATTTTCTGA TTGGTATTTG GTGCTTACTA GGGCGATTTT GGGAACTTAG TTATCGTTTA GGAATGCGAC CTTGGATAGC AGTTGCCTAT TCTGCACCTG TGATTGCTGC TACTTCCGTT TTGTTAGTTT ATCCTATTGG TCAAGGTAGT TTTTCTGATG GTTTACCTTT GGGAATTGCT GGAACTTTCC ACTTTATGTT GGCTTTCCAA GGCGATCATA ATATCCTGAT GCACCCGTTC CATATGTTGG GTGTAGCAGG TGTATTTGGT GGCGCACTGT TGAGTTCTTT GCATGGTTCT TTAGTGGCTT CAACGCTAAT TCGCAATACC GATGAAAATG AATCCATCAA TGGTGGATAT AAGCTGGGTC AGCAGCAAGT AACATACAAA TACTTGGCAG GACACAATAG CTTCTTGGGA CGCTTGTTGA TTCCTACCTT TGCTAGCAGA AATCATCGTG CTTTCCATTT CTTATTAGCA GCATTACCAA CAATAGGTAT TTGGTTTGCG GCGATGGGTG TATGTTCAAT GGCATTTAAT CTCAATGGCT TGAACTTTAA TCATTCCATC TTAGATAGTC GGGGTAATGT AATTAGAAGC GACGCTGATA TCTTAAACCG TGCCAATATT GGTCTCAGTG TCATGCACGC TCCTAATGTC CATAATTTTC CATTGGTGCT GTCTAGCGGT CAACCTATTC CAGTTAGTTA A
|
Protein sequence | MSTIVQRQKE FNFFDLWDSF CAWITSTENR IYIGWFGVLS IPTLLAATTC FVLAFIAAPS VDMDGIREPI MGSLMDGNNL ITAAVVPTSA AIGLHFYPIW EAASMDEWLY NGGPYQLIVL HFLIGIWCLL GRFWELSYRL GMRPWIAVAY SAPVIAATSV LLVYPIGQGS FSDGLPLGIA GTFHFMLAFQ GDHNILMHPF HMLGVAGVFG GALLSSLHGS LVASTLIRNT DENESINGGY KLGQQQVTYK YLAGHNSFLG RLLIPTFASR NHRAFHFLLA ALPTIGIWFA AMGVCSMAFN LNGLNFNHSI LDSRGNVIRS DADILNRANI GLSVMHAPNV HNFPLVLSSG QPIPVS
|
| |