Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1091 |
Symbol | |
ID | 4241664 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 1709719 |
End bp | 1711734 |
Gene Length | 2016 bp |
Protein Length | 671 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638106318 |
Product | phage tail Collar |
Protein accession | YP_720930 |
Protein GI | 113474869 |
COG category | [S] Function unknown |
COG ID | [COG4675] Microcystin-dependent protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.152489 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACCCA TAATGCCGTT TGAGTTTTTC TGGTGTTATG AGCCAGAAGA CGAAGATGAG GAAGAATATT GCCAATGTTT ACTGTTAGGA ACACCTGAGA CAGTTACCCC ACTCCATATT GGTCTCAGAA ATACTCTTTC TGATAAGGTT ATCACAATTG CTTCCATTCC AGAGGTAACG ATAGCTAGTG CTGAACATTA TCATTTTAAA TTAACGTTTC AGCCAGGAAT TTTAGCTGAT CCCGACCAAA TTAAGATTGA AGAAACCGAA AGTTGGTCAC TTTATATCAA AACAGATCCT AACTATGTGT GTTTGTATTT ATTGTGGACA GAAGCACCAA TAATCCTCAA CCCTGGTCAA GAAACAGAGG TTATCTTAAC AGGGGTTGCT GGTTTGGCTA ATAACAGTGC TACAGAGGCT ATCTTAACAG AGGTTATTGA TTTGGCTAAT AACAGTGCTA GCGCCAGAGC AGTCGCTGAT GTCAACATTA ACTGGATAAT TGGAGAGAAA AACATTAGCA TAGCGGAAGT TAAATTCCCT GGACAAGATG ATTCGTATGA TACAACAACT ACCCTAACAT TAGAAATGAA AAAAACATCT GGAAAATCTA ACATCCCTCT CTATGTCGGT TTTGTGGGTT CTAATAAAGT TTTCAATACC CACGATAACA ATAGCCGTCT GCAACTGCGA ATTACTAACA CAAATCTTTC AAATGCAGAT AACTCAGCAA TTACCTTTAA TTATAATGCA AACTCGGATC AATGTTCTCA GTTAGTGATT GTTTTAGAAG TAGGAGATAA AGATTCGGTT CCTTGGGCAT TAGGTACTCA AGATGATGTC AATGGTGTTA ATATTACTAT TAATAATTGG CAGCAACTTA GTGATGGTCC GGAAGAAATT CAGGTTAACG GTCAAACTAA GGCTTTGGAA TGGACGTTTA TTCCTCAAGA ATCTGATGTG GTCTTAAACT CGCAAGAAAC CTTGTTAATT AACTTGGATG GTATTACTAC AGCGCACCCC ACAGGAGAGA CTAACCTGTA TTTACGCTAT CAATATATTG ATGGTTACCG AGACGGACAA TTTGTTTGTC AAATTGAAAA AGCTCCCTTA GTTTTTGATG AGAAAGTCCG CATAGGAACC AACAAATCAT TAGATGGACT TCTCACCGTG AAAGATGGGA TTAGTCTAGA TGCTCCAGGG GAAATTAACC ACACGGGCAC TTTGATTTTT CGGTCTGATA CCGATACATC AAATGATGAT ATATCTGTCA AATTTTATAA ACAGAAAAAT GACTTACCCT TGATGGAATT AACATCGCGC GGTGATTTGC AATTTCCTGC CAAAAGTGGT CAGACTGGAC AAAGAATTAA CTTATGGGGT CCAGAGTATG GAGTTGGAAC ACAAGGCAAT ACCACTTATT TCCGAACAAA CGGATACTTT GCTTGGTATC TAGGAGGTAG TCATGAAGAA ACAAGTACAA CCCCTGGTGA TGGGGGTCGC ACACAGATGC TGCTTGATCA GGAAGGTAAC CTAGAAGTTG GGTGTGGACG AATTAAAGAT AGCACTGGCT GGGTCGTTCC TGTAGGGACA ATTGTTCCTT ATGCAGGTTT AACAGCCCCT GAAGGTTGGT TGTTATGCAA TGGTCAGTCA TACGACTGGG AGCAGTATTC AGAATTATAC AAGGTGCTTG ATGAAATAAA GGTTCTTCCA GATTTAAAAG GAAGATTTAT CATCGGTGTT GGAGATAAAG ATGGCTACTC CTACAGTCTT AATGCCAAAG GCGGAGAAGA AAAGCACACG TTAACAAAAG ACGAAATGCC AAGCCATGAT CATAGTAAGG GTGAATATAA ATTCATTTTA AAAAAGGATG GAAAAGTGAC TACCTCAAAC AATGTTAATA ACAGTCTCAG AGAGCCAAAT CTTGGCTCTT GTGAAGCTCT TCAGGTAATA GGAAACAATA AACCTTTCGA AAACCGACCA CCTTATTATG CACTTAATTA CATCATCAAA ACATAA
|
Protein sequence | MSPIMPFEFF WCYEPEDEDE EEYCQCLLLG TPETVTPLHI GLRNTLSDKV ITIASIPEVT IASAEHYHFK LTFQPGILAD PDQIKIEETE SWSLYIKTDP NYVCLYLLWT EAPIILNPGQ ETEVILTGVA GLANNSATEA ILTEVIDLAN NSASARAVAD VNINWIIGEK NISIAEVKFP GQDDSYDTTT TLTLEMKKTS GKSNIPLYVG FVGSNKVFNT HDNNSRLQLR ITNTNLSNAD NSAITFNYNA NSDQCSQLVI VLEVGDKDSV PWALGTQDDV NGVNITINNW QQLSDGPEEI QVNGQTKALE WTFIPQESDV VLNSQETLLI NLDGITTAHP TGETNLYLRY QYIDGYRDGQ FVCQIEKAPL VFDEKVRIGT NKSLDGLLTV KDGISLDAPG EINHTGTLIF RSDTDTSNDD ISVKFYKQKN DLPLMELTSR GDLQFPAKSG QTGQRINLWG PEYGVGTQGN TTYFRTNGYF AWYLGGSHEE TSTTPGDGGR TQMLLDQEGN LEVGCGRIKD STGWVVPVGT IVPYAGLTAP EGWLLCNGQS YDWEQYSELY KVLDEIKVLP DLKGRFIIGV GDKDGYSYSL NAKGGEEKHT LTKDEMPSHD HSKGEYKFIL KKDGKVTTSN NVNNSLREPN LGSCEALQVI GNNKPFENRP PYYALNYIIK T
|
| |