Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1337 |
Symbol | |
ID | 4242797 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 2032658 |
End bp | 2034337 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 638106515 |
Product | FHA domain-containing protein |
Protein accession | YP_721126 |
Protein GI | 113475065 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.172478 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTTTCC CTTCAGAAAG TCCTTGCCTC AGCATAGCCC TCGCGCGCCT AAGCATCCCA GAAGCTAATC ACTTTGCCGT CTGGGTCATG CAGGCTCCTT TTCAGCGAGG ATATGTCCAC CATGACCAGG TCTGGCCAGA AACCTTATCA AAAGCTTGGC AGACTTGGTT AGAGGTCTTC TCTCCCCAAA GTTTACCGGC TATTCCAATA GGTAATTCTC AACCGACATT AGCATCAACC CCAAATATTC ATTCTGTTTC TAAATCTGGT ATCAAATTAA ATCTTACCAG TCGCCTAATG CAAAATCTAG GTATTAATCT CTGGCAGTGG CTATTTCAAG GAGAAATTGC TCATAGTCTC CACCAAAGTC AAGGTATTGC GATTGGCCAA GAGCTGCCCT TACGGGTAAG GTTAGATATT CGGGAACCTG AATTAATTGC TCTACCCTGG GAAATTATGC AACCAGGAGT GGCTTTACCC GCTTTTTCCC TAAGTCAGGA AATTTTGTTC AGCCGTACCA CCAGTGATGT TAACCCCTTA CGCAACCAAG CACCTTCCCA ATGCCTAAAT ATTTTGTTGG TAATTGGGGA AAGTAGCCCT AAGGGAAAAT CTTCTACAGG TAATGGCCAT AAATCTCTAG TTCTATCCAA ATTAAAACTT GAGGAAGAAG TTGCCCAATT AATTGAGGTT TTGAAGGCTC GCAACATCAC AAACTCAAAT ATACCTAGAG TTAATCCTAC TATTCCTTGT AGAGTGGATA CACTAATTCA GCCCACTCCC AAAGAATTAA CTTCCTATCT AGATAAAAAA ACTTATAATG TAGTTTTTTA TGCTGGTCAC GGTATACCAG GTCCAGATGG AGGTTGGTTA TTTTTAGCGC CTGATACAAC CCTCAATGGT ACTGAGTTAG CACAAATTTT AGTGCGCAAT GGAGTAAGGT TGGCAGTATT CAATGCTTGC TGGGGAGCAC AACCTGCTAC AGAGCGTCTA TCTTCTGGTG AGGTACAGGC AATACCACGC AGTAGTCTGG CAGAAGTATT AATCCATCAT GGAGTACCTG CGGTTTTAGG GATGCGAGAT GAAATTGCTG ATCGAGAAGC TTTAAGTTTT ATTCAAGTTT TTGCTCAAAG TTTGACGGAG GGAATGTTGA TAGATCAGGC AGTAGTAATC GCAAGACAGC AGTTATTAAC TCTTTATAGG TTTAATAAGC CAGCTTGGAC TTTGCCAGTA TTGTATATGC ACCCGGAGTT CAATGGTCAA TTAGTTCAAG TATTTGATGA ATTAGTAACT CAACTACCTA CTAACTCTCA GACTTGGATT AACGGTTACA CTTCTAAAGC TTTTTTACGT TCTCAAGATG ATAATAATCA AGTTTGGCCA ATTTTGATTG ATCCAATCGC TGTTGGACGT TCTCAGGAAA ATGATGTGGT GATTTGGGAG CGGTGGGTTT CCCAAAAACA CGCGGAAATT TTTTGTCGCT GCTTGCCGAA TGAAGAACTT GAACCTACTT ATTTTTTACG AGATATTTCT CGTTTTGGGA CTTTGATTTA TCGGTCTGGT ACTTGGCAAA GAATACATCG TGATCAGCTT GTTATAAAAT CAGGAACACT GTTAAAATTT GGCAGTTCTC AAGGTCAAGT TTTTGAGTTT GTGATTGAAA CAACAGAAGA CCTAAGTTAG
|
Protein sequence | MLFPSESPCL SIALARLSIP EANHFAVWVM QAPFQRGYVH HDQVWPETLS KAWQTWLEVF SPQSLPAIPI GNSQPTLAST PNIHSVSKSG IKLNLTSRLM QNLGINLWQW LFQGEIAHSL HQSQGIAIGQ ELPLRVRLDI REPELIALPW EIMQPGVALP AFSLSQEILF SRTTSDVNPL RNQAPSQCLN ILLVIGESSP KGKSSTGNGH KSLVLSKLKL EEEVAQLIEV LKARNITNSN IPRVNPTIPC RVDTLIQPTP KELTSYLDKK TYNVVFYAGH GIPGPDGGWL FLAPDTTLNG TELAQILVRN GVRLAVFNAC WGAQPATERL SSGEVQAIPR SSLAEVLIHH GVPAVLGMRD EIADREALSF IQVFAQSLTE GMLIDQAVVI ARQQLLTLYR FNKPAWTLPV LYMHPEFNGQ LVQVFDELVT QLPTNSQTWI NGYTSKAFLR SQDDNNQVWP ILIDPIAVGR SQENDVVIWE RWVSQKHAEI FCRCLPNEEL EPTYFLRDIS RFGTLIYRSG TWQRIHRDQL VIKSGTLLKF GSSQGQVFEF VIETTEDLS
|
| |