Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4965 |
Symbol | |
ID | 4246619 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 7572184 |
End bp | 7573545 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 638109776 |
Product | hypothetical protein |
Protein accession | YP_724352 |
Protein GI | 113478291 |
COG category | [S] Function unknown |
COG ID | [COG4370] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03492] conserved hypothetical protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.252399 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTTAA AAATACTTTG CGTGAGTAAT GGTCACGGAG AAGATGTGAT CGGGTCTCAA ATTTTGCAAG AACTCAAAAA ATATCCTCTT CCTCCTGATC TTGTTGCTTT ACCGATAGTG GGAGAAGGAC AAGCTTATAC AATAGCTGAC TTCTCAATTA TTGGCCCCCG ACAAAGAATG CCTTCTGGTG GATTTATTTA TATGGATGGA AAGCAGTTAT GGCGAGATGT TAAAAGTGGT TTGTTAAGTA TTGTTTTGGC TCAGTTGAAA GCATTAAGAT CTTGGATAAG TGATCAAACT AATGATGGTC ATCAAGTAGT TATTTTGGCA GTTGGGGATA TTGTGCCTTT ATTATTTGCT TGGTTAAGTG GAGCACCTTA TACTTTTTTA GGAACTGCTA AATCTGAATA TCTGATTCGA GATCAAAATG AGCAACTTCT CCCTAGGTCT TGGGTGGAAA GTTTCCTGCT TTCCTCAGGG TCAGTCTATT TTCCTTGGGA ACGTTGGTTA ATGAGTAGAA AGAAATGTGG GGCAGTCCTT GCTAGAGATG ATTTAACAGC AAAAATGTTA AAGAAAAAGT CTATTCGAGC TTATTGTGTG GGAAACCCGA TGATGGATGG TGTCAAGTTA AAAAGCTCTA TGGAGTTAAT GTCTGGTAAT AAAGCCCGGA TGCTAGAGAT GCATGATCAA TTAACAATTA CTTTGTTACC AGGGTCTCGT TCTCCAGAAG CTTATGCCAA TTGGCAAATT ATTCTTCAGG CAGTGACAGG GTTGCTCGAA AGCTTTCCAC AAAAAAAGTT TTTGTTTCTG GCAGCGATCG CTCCTAATTT AGATTTGGAA GCTTTTACTA AACAGCTTTT GTTTGATAAT TGGCAAACTG AACAAGAAAT TCTAACTCAG AACCAAAACA GTATTTTACA AATGCCAACT GATAAACCTG AACTAACTTT TTGTTTTAGA GAAAAAAGTA TTCATTTTCC CATAAAATTT ATTTCTCAGA ATAAAAATGC AAGTCTGATT TTAAATCAAC AGGCTTTTCA AGAATTTATA CATCAGGGAG ATTTAGCTAT TGCTATGGCA GGTACGGCTA CAGAACAATT TGTCGGTTTA GGGAAACCAG CGATCGCTAT TCCTGGCAAG GGTCCACAGT TTACATCCAC TTTTGCAGAA AATCAAAGTC GCCTTTTAGG AATTTCTCAA ATTCTGGTTA AAGATCCTAG AGAAGTTTGT GGTGTAGTTA AGTCTTTGTT AGATAACTTA GAGCAACGGC GCTTAATTGC TAAAAATGGT GTTAAAAGAA TGGGAGGCTC AGGTGCGGCT AAAAGAATTG CTAATTTTTT AATTAATTTA AATTGGGTTT AG
|
Protein sequence | MTLKILCVSN GHGEDVIGSQ ILQELKKYPL PPDLVALPIV GEGQAYTIAD FSIIGPRQRM PSGGFIYMDG KQLWRDVKSG LLSIVLAQLK ALRSWISDQT NDGHQVVILA VGDIVPLLFA WLSGAPYTFL GTAKSEYLIR DQNEQLLPRS WVESFLLSSG SVYFPWERWL MSRKKCGAVL ARDDLTAKML KKKSIRAYCV GNPMMDGVKL KSSMELMSGN KARMLEMHDQ LTITLLPGSR SPEAYANWQI ILQAVTGLLE SFPQKKFLFL AAIAPNLDLE AFTKQLLFDN WQTEQEILTQ NQNSILQMPT DKPELTFCFR EKSIHFPIKF ISQNKNASLI LNQQAFQEFI HQGDLAIAMA GTATEQFVGL GKPAIAIPGK GPQFTSTFAE NQSRLLGISQ ILVKDPREVC GVVKSLLDNL EQRRLIAKNG VKRMGGSGAA KRIANFLINL NWV
|
| |