Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4098 |
Symbol | |
ID | 4245611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 6319091 |
End bp | 6321223 |
Gene Length | 2133 bp |
Protein Length | 710 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 638108998 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_723579 |
Protein GI | 113477518 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.202207 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAGCC GTGATTCAGG GATTAATAAC AACCCAGAGG CATCGTGGCA TTTAAAATCT AAGCAACTAC CAATGTGGGT TCGTAGATGT GGTGCTTGGG CTACGGAAAT ATCCCTCGTT CTAGTGAGTG GAACAGTTCC ATTTTGGCTA GGTCAGTTAG GTAACATAGG TAACAAATCT GTTCCTCTTA ATCCTGCTGT AGCAAAAACT AAAGAAGTAA TTGCCAATAC CTTCGGAATT CCTTTGCGAA ATCGTTATCA AAAAGTAACC CCTTTAGCTA ATTTACTATG GTCAACAGCA TTATTAACTC CTGCAGTAAT TATAAGTTGG CAGTTATTTT TATTAGCGAC AACAGGTCAA ACATTACCTA AACGTTGGTT TGAAGTTAGA GTAGTGGCAG CTTTAGGAAA TTCTCCTGGT TGGAAAAAAA CCTTGATTCG TGAGGGTTTG GGAAAGTGGG GATTTCCCAT GGGAACTGCC TACTTAATTT GGCGTATGAC TGGAGCATTT CCTTCTTTAG GAATCCTTTT TTTCTTAGGA GGATTAACTT CCTGTATTGA TATTTATTTT GTTCGTTTTA ATAAACTGGG TCGTACAGGG CATGATAAAT TAGGAAATAC TTTTGTTATA GATGCTCGCT CTGGTCAATT AGATTCTAAT CCCTTTAATT TAAGTGATAC TAATAGAAAA ACTTCCATAG CAACAATTGT TCTGAGTGAA AAAAAATCAG AAAGTTTTCA TCTATGGTCT TGGATGCGAA AAAATCCAGG GGTGACTTGG CTAATTGTGA CTAGTGTAGG TATAGTCTCA GTTTTGGGAA CTATTTTTGG TACTCACGTT TATATTCAAA ATCAAGCAAA TTGGCGTGAG TTTAAACAAC AAGATAATGA TATGTTTTTG GCTCTTATCA ATAAATTGTC ATTAACTTCC AATGAACCTC AGCAACGACG AGTATCAGTT TTAGCCCTTG GGTCCACTCA AGATCCCCGT GCCATACCAT TATTGGTAGA TTTGTTAGCA CAAGAAAAGG AACCAAGCAT ACTTGACGGA ATTCAACAAT CTTTAGTTAG TGCTGGACCT ACTGTGCTGC CAGAACTGCG TCGCTTAAAT CAAGCTCTTA AAAATGATCT GGAGTCTTTA AGTTATGGTA GTAATACTAA AGAGCAAGTC TTAGTATCTC TGCGAATACG AGCCACTCAA AAAGCGATCG CCAAAATTCT CAAAGTTTAT AGTGGCTCAC TTTCACTTAG AGATATTAAC TTGAGTGGGG TCGATTTAGG TCAAATTACT TCGACTCCAG CAAGTTTCAC CTTAATCTTA GAAAACACTG ATTTATCTGG AATTAAGCTC AGAAGTGCTA TTCTCAATCA GGCTAGCTTT AAAAATAGTC GCTTCTATGG ACCTGGAAAA GATAATCGTA TAGGAACTTT TGATGACTCA ATTGCCGATT TAAGTGGGGC CAATCTGATA GAAGTCAATT TAACAGGTGC TGTATTAGGT CCAGTTATTA TGAAACGTGC AGATTTATTC AGAGCAACTC TTAGTAAAGC TATAATGCCA GGCTCCACCA TAACACAAGC AAATTTCAGT AGTGCCAAAT TAATAGAAAC TAACTTACAC CAAGCAAACT TAACAGAGGC AACTTTTACT GGTGCAGACC TAGGAAGTGC TGATTTATCA AAGGCTAATT TATACAGAGC TAATTTAAGT AAAGTCAAAG CTGAAGGTAC TACTTTCCAA TTATCTGATC TAAGAGAATC TAACTGGCAA GGAGCAAATT TATCGGGAGC AAATTTCAGT AGAGCTAATT TAAAAAAAGC AGACCTAAGT TTAGCCTTAT TAACTAATGC CAACTTTCGG AATGCTCAAC TGCAAAATGC TAATTTGCGA AATACCGATA TAAGTTTAGC AGATTTGCGA GGAGCAAATC TAAGTGGAAC TGATTTCAAA GGAGCAAAAT TTGCAGCTCC AAAACCTAAT CAGACGGATC AATTTTTGGC CAGCCCAATA GATACATTTA AATCCGATCA TTTAAGAGGA GTTAATTTCA ATTCTGCTAA AAACTTAAGC CCTAATCAAA TTAATTATAT TTGTAAACAG GGAGGAATTC ACGAAAAATG TGGGGGCAAT TGA
|
Protein sequence | MTSRDSGINN NPEASWHLKS KQLPMWVRRC GAWATEISLV LVSGTVPFWL GQLGNIGNKS VPLNPAVAKT KEVIANTFGI PLRNRYQKVT PLANLLWSTA LLTPAVIISW QLFLLATTGQ TLPKRWFEVR VVAALGNSPG WKKTLIREGL GKWGFPMGTA YLIWRMTGAF PSLGILFFLG GLTSCIDIYF VRFNKLGRTG HDKLGNTFVI DARSGQLDSN PFNLSDTNRK TSIATIVLSE KKSESFHLWS WMRKNPGVTW LIVTSVGIVS VLGTIFGTHV YIQNQANWRE FKQQDNDMFL ALINKLSLTS NEPQQRRVSV LALGSTQDPR AIPLLVDLLA QEKEPSILDG IQQSLVSAGP TVLPELRRLN QALKNDLESL SYGSNTKEQV LVSLRIRATQ KAIAKILKVY SGSLSLRDIN LSGVDLGQIT STPASFTLIL ENTDLSGIKL RSAILNQASF KNSRFYGPGK DNRIGTFDDS IADLSGANLI EVNLTGAVLG PVIMKRADLF RATLSKAIMP GSTITQANFS SAKLIETNLH QANLTEATFT GADLGSADLS KANLYRANLS KVKAEGTTFQ LSDLRESNWQ GANLSGANFS RANLKKADLS LALLTNANFR NAQLQNANLR NTDISLADLR GANLSGTDFK GAKFAAPKPN QTDQFLASPI DTFKSDHLRG VNFNSAKNLS PNQINYICKQ GGIHEKCGGN
|
| |