Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1230 |
Symbol | |
ID | 4242157 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 1904238 |
End bp | 1905296 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 638106443 |
Product | photosystem II D2 protein (photosystem q(a) protein) |
Protein accession | YP_721054 |
Protein GI | 113474993 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01152] Photosystem II, DII subunit (also called Q(A)) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000683761 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGTAG CAGTCGGACG CCCACAATCT CAAAGAGGAG CATTTGATGT CCTCGATGAC TGGCTAAAAC GCGATCGCTT TGTATTCGTT GGTTGGTCTG GTATACTTCT CTTCCCATGT GCTTTCTTGT CCATTGGAGG ATGGTTAACT GGGACTACCT TCGTCACTTC CTGGTATACC CACGGGTTAG CCAGTTCTTA CCTAGAAGGA TGTAATTTTC TAACAGTTGC AATTAGCAGC CCAGCTTATA GCATGGGACA TTCCCTACTC TTCCTTTGGG GCCCAGAGGC TCAGTGGGAC TTTGCCCGTT GGTGTCAAAT CGGTGGTCTC TGGTCATTCA CTGCCCTACA CGGTGCATTT GCTCTAATTG GATTCTGCCT ACGTCAGATT GAAATTGCTC GTTTGGTTGG AATACGTCCT TACAATGCTA TTGCCTTCAC AGGACCAATT GCAGTATTTG TAAGTGTATT TTTAATGTAC CCATTGGGTC AATCAAGCTG GTTCTTTGCA CCTAGTTTTG GTGTAGCTGG TATATTCAGA TTTATCCTAT TTTTACAAGG ATTCCATAAC TGGACACTCA ACCCATTTCA TATGATGGGT GTAGCAGGAA TCTTGGGTGG TGCACTTTTG TGTGCTATTC ATGGTGCTAC AGTAGAAAAC ACACTTTTCC AAGATGGAGA GGCAGCAAAT ACCTTCCGCG CATTTGAGCC TACTCAATCT GAAGAAACAT ATTCAATGGT GACAGCTAAT CGTTTCTGGT CCCAAATTTT CGGTATTGCA TTTTCCAACA AACGTTGGTT ACACTTCTTC ATGTTATTTG TACCAGTAAC AGGATTGTGG ATGAGTTCCA TAGGTATAGT GGGTTTAGCA TTCAACCTAC GAGCTTATGA CTTTGTATCT CAAGAGTTAC GGGCAGCAGA TGACCCAGAA TTTGAAACAT TTTATACCAA AAATATCTTG CTAAATGAAG GTATCCGAGC TTGGATGTCT CCTGCAGACC AACCACATCA AAACTTTATG TTCCCAGAGG AAGTACTACC TCGTGGTAAC GCTCTTTAA
|
Protein sequence | MTVAVGRPQS QRGAFDVLDD WLKRDRFVFV GWSGILLFPC AFLSIGGWLT GTTFVTSWYT HGLASSYLEG CNFLTVAISS PAYSMGHSLL FLWGPEAQWD FARWCQIGGL WSFTALHGAF ALIGFCLRQI EIARLVGIRP YNAIAFTGPI AVFVSVFLMY PLGQSSWFFA PSFGVAGIFR FILFLQGFHN WTLNPFHMMG VAGILGGALL CAIHGATVEN TLFQDGEAAN TFRAFEPTQS EETYSMVTAN RFWSQIFGIA FSNKRWLHFF MLFVPVTGLW MSSIGIVGLA FNLRAYDFVS QELRAADDPE FETFYTKNIL LNEGIRAWMS PADQPHQNFM FPEEVLPRGN AL
|
| |