Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3417 |
Symbol | |
ID | 4244454 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 5229829 |
End bp | 5231175 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 638108397 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_722987 |
Protein GI | 113476926 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.462451 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.146874 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTAACC AAAAATCACT TGATAAATTA CAGAAACTAA TTAAAGATAC TATTAATCAA GCAATAAATA ATGGGGATCA AGCTAGCCAT CAAAAATTGA CTAACAATTT TGCTGGTGTC AACCTTGAGT CCGTTGACCT GAGTATGAGT GATCTAGATG ATGTTAATTT GAGTGAAACT ATTCTCCGGG GTGCTTATCT ACTTTGTGCT AATTTGAAAA GAACTAACTT TAGTAATAGT GATCTGAGTG GTGCTAATCT AAGTGGGGCA ATAATGTGGT TTGCTAACTT TAGTAAAGTA AATTTTAAAG GGGCAGATAT CAGAGGTGCT AGTTTCAAAA GTGCTAACCT CAAAGGTGCT GACCTAGAAG GTGCTAACCT CTGGCGTAGT GAGATTACAG ATGCATACTT AATTCAAGCT AATTTATTAT ACGCTAATTT AATTCGTGCT AATCTTAGAA ATAGTAATCT GACAAATGTT AACTTGAGTT ATGCGGAACT TAATGATGCA AACCTCAACG AAGCTAATCT TATAGGTACT AATTTTAGTT ATGCTAATTT GAGTAATGCT ACTCTTAAAG ATGCTAATCT CAAAGATTCT AATTTATCTA ATGTTAATCT TGTTGGAACT CAATTAAATG GAGCTAATCT TGAAGGTGCT AATCTTGAAG GTGCTAATCT TATAGGTACT AATTTTAGTG ATGCTAATCT TAATTACACA AAGTTGAGGA ATACTAATTT AAATCATGTT AATTTAAGAG GGGTGAAAAT TAATCATGGA ACAGAATTAG ATAATAAGTG GTATCTGGTT TGGGATATTA TTAATCATGG TGCTTATGGG AGAAATTTAA GTGGTGTTCA TCTGGAAAAT GCTGATCTTA AGGGTGCTAA TATCGGCACT GCTAATTTAA CTAATGCTAA TCTGGAATAT GCTAACCTCA GATATGCTAA CCTTAGTAAT GCTAATCTCG CTCATATTAA ATTAACTAAT GCTAATCTAG CAGATATTAA TTTGATTAAG GCAAATTTAT ATAGTGCAAA TATGCAAGGA GCTAACCTGA GCAATACTTT ATTATTTAAT TCTATTATGA CGGGTGCTTT ATTAAATCAA GCTAAGCTGC TAAAAGCACA ATTGTGTGAT GCGGAATTAT CAAATGCAAA GTTAATTATG GCAAATTTAA TGAACGCTAA CTTGAAAAAT GCTAGGTTAT TAGGGGCTGA TTTAAGGAAG ATAAATCTAG AAGGTGCAGA ATTAGATGGT GCTATATTTG GTAATAATAT GGGAGTTTCT GAGAAGATGA AGCAGGATTT AATTAAACGT GGGGCTGTTT TTAAAGATAA GTTTTAG
|
Protein sequence | MANQKSLDKL QKLIKDTINQ AINNGDQASH QKLTNNFAGV NLESVDLSMS DLDDVNLSET ILRGAYLLCA NLKRTNFSNS DLSGANLSGA IMWFANFSKV NFKGADIRGA SFKSANLKGA DLEGANLWRS EITDAYLIQA NLLYANLIRA NLRNSNLTNV NLSYAELNDA NLNEANLIGT NFSYANLSNA TLKDANLKDS NLSNVNLVGT QLNGANLEGA NLEGANLIGT NFSDANLNYT KLRNTNLNHV NLRGVKINHG TELDNKWYLV WDIINHGAYG RNLSGVHLEN ADLKGANIGT ANLTNANLEY ANLRYANLSN ANLAHIKLTN ANLADINLIK ANLYSANMQG ANLSNTLLFN SIMTGALLNQ AKLLKAQLCD AELSNAKLIM ANLMNANLKN ARLLGADLRK INLEGAELDG AIFGNNMGVS EKMKQDLIKR GAVFKDKF
|
| |