Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4157 |
Symbol | |
ID | 4245808 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 6412451 |
End bp | 6415675 |
Gene Length | 3225 bp |
Protein Length | 1074 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 638109058 |
Product | hypothetical protein |
Protein accession | YP_723637 |
Protein GI | 113477576 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGGGA ATAAAACAAT ATTAACTGAA AATAGCTCCC CCCTGGGAAT TTGGGCTAAG GAAGCTTTAG GACTTCCAGA AGTACAAGTA CAGGTAAGAT TGCGAGGTAA TCATCTACAT ATTCTCTGTG AAGCAGAAAA ATGTCCTGAA ATGAGCTTTG CACTTGCCCA GTTTTCTCAG GCACTGAGTC AGATCAATAT AGAATCTTTA TTACCTCCTA ATCAACCTCG GATTTACCAG ACGTTTCTGT GTGGTCGTAC TTTGGGACGT AGGCGACCTG ACTGGACAGT AAAACTTGAT AGTAATAAGG TAAGGGCTCA AAGTGGGAGA CTAAATTCTC CTATCTTATC TAATGATTCA GAAACTAATT TCGATCCAAC ATCTACTTCT ACTGCTTCTA CTCCTCAAGA ATACTCACAA AATTATCATG GCACTGAAGG TTTACCTGAA GAAAAATTCA CAAAAATTGG TGTTGGACAA TCTTCTGAAC TGGATATCCA TCGCGATGTT GCCTCTGAAT TGCTGTTAGC TCCTGATCAG GATTTTGTCG GGTCTAATGA CACATTAGAT AATCTCAACT TTTTATCTCC AAAATTCGAC TCAGAGACTT TTTCTAGTAC TGAACCAGAA AAGTCTAAGT TAAAAACGCA GCTAGATAAA CTCAAGGAAT CAGATATTAA TCTGATTAAT TCTTCTTTAA CAGTTTCTTC TAAAAGGTTA GCTAAGTATG GTCACCCAGA TGCAATTGCT AGTTATCTGA GTGAAATTTT AGGTGAGTTA GGGGTCTGTG TAAATGTTAG TGTTAGAGAA AAACAATTTA AGGAAAAATT AACAGAAACA AGTTCACAAT TAGAGGATGT TACTAAAAAT TTTAAGTTAA AAACTCAAAA AATCTTATGG GTATCTTGTG AAGCAACTTA TAGTCCCGAT CCATCTCTGT TAGCTGAACC AATTACTCAA AAACTAAGAG ACCTCAAACT TAAAGATTTC CATGAAGCTT TAATTTCTCT ACTTGTTCGA GGTGAAACGG CTCCTGACTG GATGTTGCGA GTGGATCTAA CTCCACCCGA TCTGATGCTA CAAGAATGGG CGAGTTGGGG GGATGTAACT GCAATTGAAC GTTTGCTTAA ACAAAAGCTG GCTACCCTGG GAGTTGATAT TCGAGGGATC CTCAAAGAAT CAACTTTACA TTTATTTTGT ACTAGTACTA ATAATTCCAG CCAAGAGTAT CCAGATCAAC AGAAAACAAA AGCAGCGATC GCTTCTATAT TAGCTACAAT TATACCAAGA GGTATTCAAG CTGCTACCAT ATACGGTTGT ACGGTTAATG AAAGTAACTA CAAAAGGAAA GAATTTCCTC GCTGGATAGA CTGGTTGAAT TTACCGGCAT CAGAAAATCA AGATCTATCT GCTAGTGCTG AGGTTTTGGC AAGTCAAGGA AACTATGAAG CAATTAGTTT TTTGTTAAAT AGATTAGTTA ATTCTAACCT TGATCAAAGA CTTAAAACAG GTGGTATCCG GGTATTAATA TTATCGAAGC AAGAATTATT GCATGTTATG AGCGAAGCTC CTACAAGTCC ATCTCAATCT CAAGTAGGAC CATGCATTGC TAATTTTTTG CGTCAACTTC AAATTCCTGG AGTTAGTGGG GTAAGGGTTT ACGGTCGTCG TGCTGGTCAA AAATTACCTT TATGGCGCTA CGGTATTAAT TTTACTACGA AAAATCGACG ATATTCTGAA AAACCTCCAG AGTTTGCTGC AACTGCCGAA ATGGATTTTT TGTTGGGTAA AAGAGCAGAT CGTGCTTTCC TCAAGTTAGC ACCAAAAATG ACTGAAAAAA GTGACTCAAG TCTTTTATAT TTTCATGGGT ACACATTATT TAAAGGACGT CTGTTGACAG GTATTCAGCA GTTACTTATT GGCTCGCGTT TATTTATTCC TAATGAAGAA GATTTGACAA AAATAAGTAA TTCATTAACA TACAATAGTC GCAGTTATGG TAAATGGTTT GCTGCTATTT CTTATGCTGC TTTGGGAATT TTATTGACGG TTCAAACTGA TTTAAAAGTA GATGAAATAT TAAAGAAAGT TCCAGATGTT TCTCTGTATG GAAATATTTG TCAAGGTCGA AATTGCCAAA CTCAATCAGA AGAAATTGTA GATGAAAATC AAGCTTCAGG ATTAATGAAT TATCCTAGTT TTAATAGTCC GCAATTAGAT GAACAGTTGC TTCGTTATCA ACAATATTTA GAAGCAGAAA AAAAAATACC AGATATTTTA ATAGTTGGTA GTTCTAGAGC TTTGAGAGGG GTCGATCCCA CGGTGCTTGC TGAAAGTTTG GCAGCTTTAG GATATCCAAA TTTCAGAATT TATAATTTTG GGGTAAATGG TGCAACTGCT CAAGTAGTAG ATTTGATTGT ACGTCAGGTT TTACCACCAG AACAATTGCC AAAATTAATT ATTTTTGCTG ATGGAGTTCG GGCTCTAAAT AGTGGTAGAG TTGATAGAAC TTTTGATATT ATTGCTGGTT CTGAAGGTTA TGATCAAGTA GATGCTGGTT CGTTTATTAT TGGGAATAAT AGATCAGACT CATCAGTTAA TTATGATTTA AATAAATATA AGCAAATATT GAAAAAATTA GAGTCAAAAG TTAAGAAAAT ATCAGTGACT TATCAGCAGC GCGATCGCCT AAAAAGATTG ATAGTATCAA TTATTAAGAA TCCGACAAAT TTTGATTTTT TGTCTGGGGA AGGTAACCAG GATTTAGTAA AAGATCATAA CCTAGAGTTA GAGCAATTTC AAGCAAATGG TTTTCTGCCA ATTTCTATTA AATTTAAACC TGAAAGTTAC TATCAAAATT ATACTAAAGT TTCTGGTGAT TACGATGCTG ATTATGCTGA GTTTCAATTA CGAGGAAAAC AAACCACTGC ACTCAAAAAA TTACTACAAT ATACTCAGTC AATAGGGGTT AATTTTGTAT TTGTAAATAT GCCTTTAACT ATAGTATATT TGGATGAAGT ACGAACTGTT TATGAACAAG AATTTCAGGA ATATATGCAA CAGTTATCTG GGGAATATAC TAATTTTATA TTTCGAGATT TAGGCAGTGC ATGGCCAGAA ACTTATGACA ACTTTTCTGA CCCTAGTCAT CTAAATCTTT ACGGAGCGAT CGCTGTTTCT CAAACATTAG CTGATGATCA TATTATTCCT TGGCGAAAGC GTTAA
|
Protein sequence | MKGNKTILTE NSSPLGIWAK EALGLPEVQV QVRLRGNHLH ILCEAEKCPE MSFALAQFSQ ALSQINIESL LPPNQPRIYQ TFLCGRTLGR RRPDWTVKLD SNKVRAQSGR LNSPILSNDS ETNFDPTSTS TASTPQEYSQ NYHGTEGLPE EKFTKIGVGQ SSELDIHRDV ASELLLAPDQ DFVGSNDTLD NLNFLSPKFD SETFSSTEPE KSKLKTQLDK LKESDINLIN SSLTVSSKRL AKYGHPDAIA SYLSEILGEL GVCVNVSVRE KQFKEKLTET SSQLEDVTKN FKLKTQKILW VSCEATYSPD PSLLAEPITQ KLRDLKLKDF HEALISLLVR GETAPDWMLR VDLTPPDLML QEWASWGDVT AIERLLKQKL ATLGVDIRGI LKESTLHLFC TSTNNSSQEY PDQQKTKAAI ASILATIIPR GIQAATIYGC TVNESNYKRK EFPRWIDWLN LPASENQDLS ASAEVLASQG NYEAISFLLN RLVNSNLDQR LKTGGIRVLI LSKQELLHVM SEAPTSPSQS QVGPCIANFL RQLQIPGVSG VRVYGRRAGQ KLPLWRYGIN FTTKNRRYSE KPPEFAATAE MDFLLGKRAD RAFLKLAPKM TEKSDSSLLY FHGYTLFKGR LLTGIQQLLI GSRLFIPNEE DLTKISNSLT YNSRSYGKWF AAISYAALGI LLTVQTDLKV DEILKKVPDV SLYGNICQGR NCQTQSEEIV DENQASGLMN YPSFNSPQLD EQLLRYQQYL EAEKKIPDIL IVGSSRALRG VDPTVLAESL AALGYPNFRI YNFGVNGATA QVVDLIVRQV LPPEQLPKLI IFADGVRALN SGRVDRTFDI IAGSEGYDQV DAGSFIIGNN RSDSSVNYDL NKYKQILKKL ESKVKKISVT YQQRDRLKRL IVSIIKNPTN FDFLSGEGNQ DLVKDHNLEL EQFQANGFLP ISIKFKPESY YQNYTKVSGD YDADYAEFQL RGKQTTALKK LLQYTQSIGV NFVFVNMPLT IVYLDEVRTV YEQEFQEYMQ QLSGEYTNFI FRDLGSAWPE TYDNFSDPSH LNLYGAIAVS QTLADDHIIP WRKR
|
| |