Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0334 |
Symbol | |
ID | 4243149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 510841 |
End bp | 512151 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638105666 |
Product | hypothetical protein |
Protein accession | YP_720281 |
Protein GI | 113474220 |
COG category | [S] Function unknown |
COG ID | [COG2855] Predicted membrane protein |
TIGRFAM ID | [TIGR00698] conserved hypothetical integral membrane protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAAAA CAAAATGGTC TGATTTATAC AAATTAGAAG ACTGGTGGGC AGTCTGGATC GGTTTGTTAC TGTTAGCAAG TGTATTTAGT GGATGGGTAA CACTTGTACC CAAAATTCCT AAATGGAAAG GAATAGATAT TCTGGCTGCA CTCCCGATAG ATTTAATTCC TAGCTTAGTA TTATTAGCGA TCGCTCTGTG CGCTGTATTT ACATTTGGCA ACATTTTGAT GCAGGGGCGT AGGGCAAATG CTATGATACC AGGATTTTTG GCCATCTTTA TCTTAGCTAC TCTCGCTTAT GTATTGGGAA ATTCTCAAGC TGCAAAAGCA ATAAATCTCG GTTATGCCTT CTGGGCACTA CTAATAGGAT TATTAATTAG TAATACAGTA GGTACGCCAG AATGGTTGAA ACCTGCCGTG CGCACCGAAT ACTATATTAA AACTGGGTTA GTCATTTTGG GAGCAGAAAT TCTGTTTAGT CGCATTGTCG AATTCGGACC CTATGGTTTA GCGATAGCCT GGGGAGTCAC GCCAATAGTA GTGATATTCA TGTATTTATT TGGCACTCGC ATCCTGAAAA TGCAGCAAAA AACATTAGTA ATGATAGTAG CAACTTCAAC CTCTGTCTGT GGAGTCTCGG CAGCTATTGC AGCTGCAGCA GCTTGTCGCG CAAAAAAGGA TGAGTTAACT TTGGCCGTCG GAATGACTCT GATATTTACA GTCATAATGA TGGTTTTCAT GCCCTTGTTT GTCAAAGCTG TCAACATGAA TGAATTAGTG GGAGGAGCCT GGTTAGGTGG CACCATTGAT GCTACTGGCG CTGTTGTTGC AGCAGGCGAA GCATTGGGGG ATGTGGCCGG AAAAGCGGCA GCTTTAGTAA AAATGATTCA AAATATGTTG ATCGGAGCGA TCGCTTTTGC CATAGCAATT TTTTGGGTTA CATCTGTGGA AAAGCAACCA GGATCTTCTA CTCCTACTTT AAATGAATTA TGGGTGCGTT TTCCTAAGTT TGTTTTAGGT TTTATTGTAG CATCTTTATT AGTTTCTTTT GTATTTTTAC CCGTTATAGG TTTGAATAAT GTTCAAGCTT TATTGAAGCA GACTAAAAAT TATCGTGGTT GGCTATTTTG TATGGCGTTT ATCTCAATTG GGTTAGAGTC TAATTTTCAA ATTTTAGCCC AACAAATGCG AGGCGGTAAA CCGATGATTT TGTATATTGT AGGGCAAAGT TTTAATATAT TGCTAACTTT GGCTGTGGTG TGGTTAATGT TAAGTGGAAA CTTCTTTCCA GTTCCAGAAT TAGGAGGGTA G
|
Protein sequence | MSKTKWSDLY KLEDWWAVWI GLLLLASVFS GWVTLVPKIP KWKGIDILAA LPIDLIPSLV LLAIALCAVF TFGNILMQGR RANAMIPGFL AIFILATLAY VLGNSQAAKA INLGYAFWAL LIGLLISNTV GTPEWLKPAV RTEYYIKTGL VILGAEILFS RIVEFGPYGL AIAWGVTPIV VIFMYLFGTR ILKMQQKTLV MIVATSTSVC GVSAAIAAAA ACRAKKDELT LAVGMTLIFT VIMMVFMPLF VKAVNMNELV GGAWLGGTID ATGAVVAAGE ALGDVAGKAA ALVKMIQNML IGAIAFAIAI FWVTSVEKQP GSSTPTLNEL WVRFPKFVLG FIVASLLVSF VFLPVIGLNN VQALLKQTKN YRGWLFCMAF ISIGLESNFQ ILAQQMRGGK PMILYIVGQS FNILLTLAVV WLMLSGNFFP VPELGG
|
| |