Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3473 |
Symbol | |
ID | 4244473 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 5348974 |
End bp | 5350791 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 638108447 |
Product | hypothetical protein |
Protein accession | YP_723036 |
Protein GI | 113476975 |
COG category | [S] Function unknown |
COG ID | [COG5305] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGAAA ATAACATTAA ACTACTACTT AAAAACCAAT GGTTTCACCT AATAATATTA TTATTCTGGC TAACAGTTGG CATTATGATC CGGATCACAA ACTTAGCCGC AAAACCAGCC TCATCCATTG AAATTGCCAC ATTAGGCTAT AGCTTAGGGC ATAGCTTTTT TGACCTACCC TTAGATCAAA TTATCACCCT AAATGAACTA CTCTCACCCT TAAAATTTGA ATCAACCTCT ACCGCCATCG ACGTAGTTGA TCAGTTACTA AGAGAAGATA CCCATCCCCC AGTCTACTTT GTATTAAGTC ATTTGTGGCT TAAGTTATTC AGCACTGACG GAGAAATAGT TTCTCTCTGG GCTGGGCGTT CCCTAAGTGT TATTTTAGGC GTAGCAGCCA TTCCAGCAAT TTTTAGCTTA GGTAAGCTAG CATTTTCCCC ATTAGTAGGT CATATAGCAG CCGCATTAAT GGCTGTTTCT CCTTATGGCA TTTACCTCGC CCAAGAATGT CGCCATCATA CCCTAACCAT ATTATGGACT ATTGCTTCTA TAGCTTGCTT AATTAAAATA GCACCCTATA TCAAAAAACA TAAAGCATTT CCAATTTGGT TAGGCTCCGC CTGGGTTGCC ATCAATAGCT TAGGAGTTGC CACCCATTAT ATTTTTATTT TAGTGTTAGC TACAGAAGGT TTAGTTATAG GAGTTTTTTG GCTAAAAGAT ATAGAAAACA GACTTCAAAA TTATTGGTGG CGAATTTATC TAGTAGCTTT AGGAACATTT GTTAGTTGTT TAGTTTGGTT ACCCGTAGTC ACCAGCGCTG CTAATAATAA ACTTACTGGG TGGATATCAA CCAGCTTCGA TCTTGATGAA ATTTGGCACC CTATCCCCCG TTTGTTAGGT TGGACTCTGA CAATGGTATG GCTTTTGCCC GTTGAAGGAA CAAATTTATT TGTAACTATC TTATCCGGAG TCACCCTTTT AGTTGCATTA TTGTGGGTTA TTCCCAAACT ATGGCAAGGA GGGAAAGCAC AGATGAGGGA TCTTCCAAAC CGTTTATTTT TTCAAATATT TGTGAGTTTT TTAGTAGGAG CGATCGCCTT ATTTTTAGTA ATTATTTATG GTATGGGTAG AGACTTATCT CTTGCTGCCC GCTATCAAAT TGTTTATTTT CCTGTTGTAA TTATTTTATT AGCCGCAATA TTAGGGAAAT GTTGGAACAG CTCAGAAAAA GAAACAGAAG TAAAAAAAGT TTTTTCTGAC AAACAGACGG GTCAAGTCAA AAAGGAAAGA GTAATAAAAA AAGAGTTAAT TATTAGTTCT GCCACAAGGA TAATTGTTTC CAACTTAAAA CCAATCAATA AAAGAGTTGT AATTGTAGTT TTGTTAATCA GTTTTTGGGG TGGGTTAACA GTAATTAACA ACTATGGTTA TCAAAGGTCA AGGCGTGCAG ATATTCTAGT TAAAGAGATG CAAACACAAT CTAAAGCAGC GCCATTAATT GCGACAACCT ATCAAACCCA TGCAGAAATT CGTGCTTTAA TTGCACTTGG TTTGGAGTTA AAACGCCAAG AAGATAAAAG CAATACATCG GGAAATTTTC AACCTCAATT TATATTAGCT AAAAGACAAC AAAATAAAAA ATTAACCCCA GATTCAACTT TAGCTAAATT TCTATCTCAA AAATCAAAAC CAATTGACTT ATGGGGAATT AACTTAAAAA TAGAAGCCAG GGAATTAGAA GCTTTTAACT GTCAAAAATA TTCTGGGAAT CAACCAAAAA TTAATGGTTA TAGTTATAGG TTGTATCATT GTCGTTAA
|
Protein sequence | MKENNIKLLL KNQWFHLIIL LFWLTVGIMI RITNLAAKPA SSIEIATLGY SLGHSFFDLP LDQIITLNEL LSPLKFESTS TAIDVVDQLL REDTHPPVYF VLSHLWLKLF STDGEIVSLW AGRSLSVILG VAAIPAIFSL GKLAFSPLVG HIAAALMAVS PYGIYLAQEC RHHTLTILWT IASIACLIKI APYIKKHKAF PIWLGSAWVA INSLGVATHY IFILVLATEG LVIGVFWLKD IENRLQNYWW RIYLVALGTF VSCLVWLPVV TSAANNKLTG WISTSFDLDE IWHPIPRLLG WTLTMVWLLP VEGTNLFVTI LSGVTLLVAL LWVIPKLWQG GKAQMRDLPN RLFFQIFVSF LVGAIALFLV IIYGMGRDLS LAARYQIVYF PVVIILLAAI LGKCWNSSEK ETEVKKVFSD KQTGQVKKER VIKKELIISS ATRIIVSNLK PINKRVVIVV LLISFWGGLT VINNYGYQRS RRADILVKEM QTQSKAAPLI ATTYQTHAEI RALIALGLEL KRQEDKSNTS GNFQPQFILA KRQQNKKLTP DSTLAKFLSQ KSKPIDLWGI NLKIEARELE AFNCQKYSGN QPKINGYSYR LYHCR
|
| |