Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0159 |
Symbol | |
ID | 4241752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 240062 |
End bp | 241228 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638105507 |
Product | hypothetical protein |
Protein accession | YP_720126 |
Protein GI | 113474065 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.322221 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.596487 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGGGGG AAATCTGCCA AGTTGGCGAA GAAATCTTAA AGCTACTCCT AGATGAATTT CAGCAGTCTA CTCGTGGGTC AAGGCAAAAT TGCCGAGAGG TGGCAGAACG TATTACACAC GAAGTAGATA GGATTTGCAC AGAAAGTAAA AGAATTCAAG CTTCGGGAGA AGTGGGTAAA TGGGCTAAAA ATTTAGCTCT ACATCGCTTG AAACGATGTA TACATTACTA CCAGCTTCGT TCTCAAGAAG GAAGAATAGA ATTACATAGC ACCTTCAGCG CTATTATTTA TAGATATATC ACTCCTGCCC AAATACAATC AAGTTATCAG GCCAAATTAA ATCTGATAGA AGATTTTTTA CAACAATTTT ACCTGGAAAC TTTGAATGCT TTTCGGCGAG AAAGTGAACT ACCAGCAACT TATCGTCCCC GTACTTTGTT AGAACTTGCA GAGTATATGG CCTTTACCGA ACGTTATGGA AAGAGACGTA TACCTTTATC TGGTGGTCGT AGTCAACAAT TGATTATTTT ACGGGCACAA ACATTTTCCC AACAACAACC AAAGGAAACA TTTGTAGATA TTGACCAAGC AGCAGAAGGA ACAACTACTG ACTCAGATAA GACTTGGAAC GATAGATCTA TCCAAGAAGT TCGAGAAGCA ATGGTTGCAC AAGACCCAGG TAATAATATT GCTTCTTTGC GTCAGGTTGT GATTGAAGAA CTAATGGCCT ATCTAGAGGA ACGTGAACAG AAAGACTGTG CAGATTACTT TGCATTGCGT TTACAAGATT TATCAACTGG AGAAATAGAA TCTATCTTAG GTCTAACTCC CCGAGAAAGG GATTATTTAC AGCAACGCTT TAAATACCAT TTGCTCAAAT TTGCTATGGG ACATCGTTGG GAACTGGTTC ATCAATGGTT AGAAGCAGAT TTAGAACAAA ATTTAGGCTT AACTCCTACG GAGTGGGAAG CCTTGCATCA CAAAATTGAT TCAGAGCAAA AAAATTTGCT AAAATTAAAA CAACAAGGTA TTTCCGATGA TGTGATCGCG AAAACTTTAG GTCGTAAAAT CAACCAGGTT AAGAAAAAGT GGTATAAATT ACTCGAACTT GCCTGGGAAT TACGAAATCG TTCAGGTTCC GGAGCAGGGG CATCAAGTGA TGAATAA
|
Protein sequence | MMGEICQVGE EILKLLLDEF QQSTRGSRQN CREVAERITH EVDRICTESK RIQASGEVGK WAKNLALHRL KRCIHYYQLR SQEGRIELHS TFSAIIYRYI TPAQIQSSYQ AKLNLIEDFL QQFYLETLNA FRRESELPAT YRPRTLLELA EYMAFTERYG KRRIPLSGGR SQQLIILRAQ TFSQQQPKET FVDIDQAAEG TTTDSDKTWN DRSIQEVREA MVAQDPGNNI ASLRQVVIEE LMAYLEEREQ KDCADYFALR LQDLSTGEIE SILGLTPRER DYLQQRFKYH LLKFAMGHRW ELVHQWLEAD LEQNLGLTPT EWEALHHKID SEQKNLLKLK QQGISDDVIA KTLGRKINQV KKKWYKLLEL AWELRNRSGS GAGASSDE
|
| |