Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0072 |
Symbol | |
ID | 4242455 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 100518 |
End bp | 101846 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 638105435 |
Product | HEAT repeat-containing PBS lyase |
Protein accession | YP_720054 |
Protein GI | 113473993 |
COG category | [C] Energy production and conversion |
COG ID | [COG1413] FOG: HEAT repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0791445 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAAATC TTCTTGAACA GGCAAAAATT GCTGCCCAAA AACAAAACTG GTCTTTAGTC AATTATTATC TACAGCAATT TATCTTAGAG AGTAAGTCAA ATAAAACTCC TTTAATATTC CCAGACAACA ATGTTGTTAT AGATGAGGTA ATTGACTTAG CAATAGAAGT TATGGAAAAT GGAGATTTTC AAGAAAAATG GGATATTAAT AAACTGTTTA AGCAAATTGG AAAACCAGCT ATCGCTCCCT TAATTGAAAT TCTTAACGCC CAAGAAGTAG ATCTAGAAGA ACGTTGGTTT ATAACTAGGA TATTGGCAGA GTTTAACACT GAAGAATTCC TAGAAGTTAT GGGGAATATA GTTATTAGAT CAAAGCCAAA AGATATACAG GAAATAGCAG CAGAAATAAG AGCAAATTGT GGTAATTCGG GAGTTGATTT ATTAACAAAT TTATTAGCAA AGCCAGAGTT AAAATTATTG GTCATTAAAG CTTTAGCTAA AATTAACTGT ATTAGTATGA TTCCAGCATT GTTAACTTTC GTTAAAGATG AAAATTATGA AGTGAGAATC ATGGGAATTT CTGCTTTGAA TAATTATTGT GACTCACGCA TTCCTATTGT ATTAATTTCG GCTTTAAAAG ATAGAGTGGC AAAAGTTAGA AAAACAGCAG TGATTAGTTT AGCTAGTTAT GCTAATTTAC ACCAAGAATT AGGTTTGGTG AAGTTACTAC AACCCTTACT ATGGGATATG GATGTTGAAG TTTCTCAACA AGCAGCGATC GCTCTGGGAA AAATTGGTAC CAATTTAGCA GCAACAGCTT TATATGAACT ATTAAAAACT GCCACAGTGC CAATATTTTT AAAGATAGAT GCAGTGCGTG CTTTAGGTAG AATTGAAACT CAAGTATCCT TAGAATATTT GCAAAATTTA CTAGAGTATA ATTTCTTAGT TGAAGATTAT GATATGGCAC AAATTGTTAA TGAAATTATC ACGGCATTGG GTAAGTTAGA AAAACCTGAA TTAAAACTTC AAGCAACAGA TATTTTAATT GAATTTATTA CCAGTCATAA CTCTAATCTA GAAAATATTT GTGTGAAAAA ATCTTTGGCT TTAGCTCTGG GATACTTAGG AAATATTCAT GCTTTAGAAT CTCTAATTCA ATTATTAGAA GATGAAGATA ATAGTGTTAG ACTTCATTCT GTAGCAGCGA TGAAACAATT AGATTCAGAA AAAGCATATC AAAGACTTAT TTTTTTATCT GCACAAGCAA GCCTTAACTC ACAGTTAAAA ACAGGAATAG CAATAGCTCT TGCAGAATTG AATAATTAA
|
Protein sequence | MSNLLEQAKI AAQKQNWSLV NYYLQQFILE SKSNKTPLIF PDNNVVIDEV IDLAIEVMEN GDFQEKWDIN KLFKQIGKPA IAPLIEILNA QEVDLEERWF ITRILAEFNT EEFLEVMGNI VIRSKPKDIQ EIAAEIRANC GNSGVDLLTN LLAKPELKLL VIKALAKINC ISMIPALLTF VKDENYEVRI MGISALNNYC DSRIPIVLIS ALKDRVAKVR KTAVISLASY ANLHQELGLV KLLQPLLWDM DVEVSQQAAI ALGKIGTNLA ATALYELLKT ATVPIFLKID AVRALGRIET QVSLEYLQNL LEYNFLVEDY DMAQIVNEII TALGKLEKPE LKLQATDILI EFITSHNSNL ENICVKKSLA LALGYLGNIH ALESLIQLLE DEDNSVRLHS VAAMKQLDSE KAYQRLIFLS AQASLNSQLK TGIAIALAEL NN
|
| |