Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1933 |
Symbol | |
ID | 4242682 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 2998586 |
End bp | 3000205 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 638107054 |
Product | hypothetical protein |
Protein accession | YP_721661 |
Protein GI | 113475600 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03187] DGQHR domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.258826 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0613304 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAAAA AATTAATGAT AAAAAAGGCC AATAAACCGA CATCAGAAAT AGCAAAAGAA ATTTTAGAAA ATGATAATCG AGAAAAAGAA GCGATCGCTA TCCTATTAGA CAAACATATA GGCAAAGACA ATAGATTATT AGTACAAAAA ACCATGATGG GAAACACAGA AGCTTACATT GGTTCGGTCA CTCTAGAATG GTTAGATAGT CGTGTACGCT TTGCCTCTCA ACTACCATTA TTTAGGCAAA AATTTGACAT AGAAACTGAT AATATAATTC GGGATTCAGA AACAATAGAT GAAATTCAAC AACGACCCTT AGACTGGTCT CGTCAAGCTC CATTAACTTT ATATTTAGCA ACTAGAAAAT CCCATAAATT TCCGGCAGTA TTAGTAGTAA TTAGTCCGAG TTGGGTAGAT AATCCTAAGG CAGAAGAATG GAATAAAAAT GGTGAAGCAA ATAAATCTGC AACTGATTTT TTTTCCCTAG ATTCACAGGG AAAAGTAGGA TTATTAGACC TACGTTTAGA AGTAGCAGTA TTTGCCTTAG ATGGTCAACA TAGACTGATG GGAATTCAAG GATTAATGGA ATTAATAAAA ACTGGTAGAT TACCAAGATA TAACAAACAA AAGAAATCAG TAGGTGCAGC AATTACTATT GATGATTTAA CTATAACTCA TCAGATTGAA TTACCAGAAA TACAAAAATT AGCTTACGAA CAAATAGGAA TAGAATTTAT TCCAGCAGTA GTAAAAGGGG AGACAAGAGC ACAAGCACGA CGCAGAGTTA GGTCAGTTTT TGCTCATGTA AATTTGACAG CAGTAAAATT AAGTAAAGGG CAATTAGCAT TATTAAATGA AGATGATGGA TTTGCTATTG TAGCGAGAAA AATAGCAATT TATCATCCTA TTTTAAAGGA GAAAGATGGT AGAAATCCAA GAATAAATTG GGATAGTGCA ACTGTGGCAG CTAACTCTAC TGTTTTAACT ACTCTCCAAG CATTGCAAGA AATGTCTGAA AGATATTTAA AACCTCGTTA TCCCTATTGG AAACCTTCAG ATAGAGGTTT AATTCCTATG CGTCCTGCAG AAGAAGAGCT AGAAGAGGGG GTAGAAGAAT TTATGGTACT TTGGAATTAT TTGTCTAATT TGCCTAGTTA TTCTAGATTA GAAAATGGCT CTGAAACTTC GGAATTAAGA AGATTTAGTT TTGAGAGAAA ACCGGGAGAA GGTCATGTTT TGTTCCGCCC TATTGGGCAA ATTGCTTTTG CTGAAGCTTT AGGGATTTTA ATATATAAAA AAGAATTTTC TCTCAAAGAA GTTTTTCATA AATTAAATAA GTATGATGTG GATGGTGGTT TGAGTGGAAT AGAATTTCCT GACTCAATTT GGTATGGGGT TTTATATGAT TTTAATCGGA AACGAATGTC GGTAGCTGGT AGAGATTTAG CAATGAGATT ATTTATCTAT ATATTAGGTG GAGTTTCTGA CAAAATGGAG CGGGCAGAAG TTCGTCGGCA GTTGGCGCAA GCAAGACGAG TTGGAGAGGA TCAAGCTGTA GATTTTCAGG GTAAGTTTGT CGAATTAAAA AAAGTAGGAT TACCTGAAAT TTTATATTAA
|
Protein sequence | MSKKLMIKKA NKPTSEIAKE ILENDNREKE AIAILLDKHI GKDNRLLVQK TMMGNTEAYI GSVTLEWLDS RVRFASQLPL FRQKFDIETD NIIRDSETID EIQQRPLDWS RQAPLTLYLA TRKSHKFPAV LVVISPSWVD NPKAEEWNKN GEANKSATDF FSLDSQGKVG LLDLRLEVAV FALDGQHRLM GIQGLMELIK TGRLPRYNKQ KKSVGAAITI DDLTITHQIE LPEIQKLAYE QIGIEFIPAV VKGETRAQAR RRVRSVFAHV NLTAVKLSKG QLALLNEDDG FAIVARKIAI YHPILKEKDG RNPRINWDSA TVAANSTVLT TLQALQEMSE RYLKPRYPYW KPSDRGLIPM RPAEEELEEG VEEFMVLWNY LSNLPSYSRL ENGSETSELR RFSFERKPGE GHVLFRPIGQ IAFAEALGIL IYKKEFSLKE VFHKLNKYDV DGGLSGIEFP DSIWYGVLYD FNRKRMSVAG RDLAMRLFIY ILGGVSDKME RAEVRRQLAQ ARRVGEDQAV DFQGKFVELK KVGLPEILY
|
| |