Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3869 |
Symbol | |
ID | 4243532 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 5980539 |
End bp | 5985200 |
Gene Length | 4662 bp |
Protein Length | 1553 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 638108799 |
Product | WD-40 repeat-containing protein |
Protein accession | YP_723381 |
Protein GI | 113477320 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.113625 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0582417 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTATAG AAGCTTTAGT TGTTGGAATT AATGAACATG TATTTGAACC TGGTTTAAAC CTTAAAGCTC CAGTTAAAGA TGCTGAAGCT ATTGCTGAAA TGTTGGAAAA GTATGGCAAT TTTCATGTTC AAGGTTTGCC AAAAGATTAT GATGAAACTG GTAGGGAAAG ATTTATATCT GATGGCTTTG TTTATAGGAA AAACCTCAAA ATAAAAGTTA GTAATTTATT TAATCCTATC TCAAAAAATG AAATTCCTGA TGTGGCTTTA TTCTTTTTTG CTGGTCATGG TTTTGTAACA ACAGAAGGGG GTGTTAGAGA AGGATTTTTA GTAACTAGTG ATGTGCAATT AAAAAGGGAT ATTTATGGCA TATCTTTAAG TTGGTTGAAA GATTTATTAC GCCAAAGTCC GGTGAAAAAA CAAATTGTTT GGTTAGACTG TTGTTTTGGT GGGGAATTAT TAAATTCTCA AGAGGCTGAC CCTGGTACTG GAAAAGAAGT AAGTCGCTGT TTTATTACTT CTTCTCGTTC TTTTGAAAAA AGTGTGTCAG AAATAGATGG AAAACATGGT GTTTTTACGG CTAATTTACT TAAAGGTTTA AATCCAGAAA GTTCTATTGA TGGTTGGGTT ACTAACTATA TTTTGGCAGA TTCTATTAAG AATAATATGC TGAATATTTC TCAATCTCCT GTTTTTCATA ATTCAGGAGA TGCGATTATT CTGACTACTA ATACTCAGAC TAAATCTCTA GACGGACGTT GGAAAAAAAC TCCTCCTTAT CGGGCTTTAT CTTATTTTAC TGAACAGGAA AAAGATGGGG TATTTTTTTA TGGTAGAACT CAGTTGACTG ATGAGTTAAT TGACCGAGTT AGAACTAATA ATTTTGTGGC GGTTTTGGGG GCTTCGGGTA GTGGAAAGTC TTCTTTGTTA CGGGCGGGTT TGTTATATCA ATTGAAGCGG GGGCAGAAAA TATCTGGAAG TGACAGATGG TTATATATTA AGCCTTTTAC TCCGAGTTCT TCTCCTCTGG AAAGTCTGCA AAAGGCAGTT AATATAAAGA GTGAAAAAAT AGAGAATTTA ACGGAAAAAC TGATCGATTT TATTAATGAA GTAAAGGCGG AAAAAGTCTT GATGGTAATT GACCAATTTG AGGAGTCTTT TACTCTCTGT GAAACTGATG AAAAACGACG GGAATTTTTT GATTGTTTTT TGTCTGTTTT GGCAAATGAA AAAACTAAAA ATAAGTTTTG TTTGGTCTTA GGAATGCGGG CAGATTTTCT TGACCAATGT TCAAAATATT CTGGGTTGGC AACAGAAATT AAGGAGCATC AATTATTAGT AACTCCTTTG GAAAAGGATG AGATAGAGGA AGCAATTAAG AAGCCGGCGG AATTAGTGGG AATGGGAGTA GAACCTAAGT TAGTTGCTCA AATGGCAGCA GATTTTTTGC GTAATCCTGG TAGTTTACCT CTGTTACAAT ATACTTTGGA TGCTCTGTGG AAATCTGCTA CTCAAGGGGA GGATAAAAGT CAATATTTAA CTCTGGCAAG TTATGAAAAG TTGGGTGGAA TTCAGGGTAC TTTGACGAAG CAAGCTGATG CAGTTTATGA GAGTTTGAAT AAAGAGAAAA AGTCGGTTGC CAAGAGGATA TTTTTAGAGT TAGTTCAGCC TGGGGAACGG TCAGTTAATT TTCTCAAGGT AACGGATACT CGGCGGCGAG TAATTTTGGA AAAGTTGCCT AATAAAGAAC ATACTTTAGA GCTTTTATTG GAGGTTAGTG ATAATTTGGC TGACCCAAAT AATCGATTAA TTACTAAGGA TAAGTCAGAG GCAGGAACAT TATTAGATAT TATTCATGAG GACTTAATTA GAAGTTGGAA AACTTTGAGG AAATGGGTGG AGGAATATCA AGAGGCTTTG CCGGTGGAAA GGAAAATTGA GGCTGATGCG GCTGGGTGGG AAAAGGATGG AAAAAATGAG GGTTTGTTAT TACGAGAAAC TCAGTTAACT AAGGCAGAAG AATATGTGAC AAAGTATGGG GATATGGGTT TGTTGGATGG GTTGGCTTAT GAATTTATTG AGGCGAGTCA GGAGTTGAGA ATTCGTGAGG AAAAGGAAGA AAAGGAACGT AAAGAAAGGG AGTTAGAGCA AGAAAAGAAA GCTAGAAAAT TAGCTCAAAG ACGGAATCAA ATTTTGGGTG TTTCTTTGGT TTTAATGACT GGGGTATCTG GTTATGCTTG GATGCAGCAA AACATAGCTA AACATAATAG TGAAATATTC TTTACCCGTC AGTTAGCGGC AAAAGCAGAA TTATTGACAA CTTATAACAG TTATGACACA ACTGTTTTGT TGGGAGTCCA GTCAATGAAC AGAATTCAAG AATTTAAAGA ATGGCAAGAC TCTGGGTGGC GAAAAGTGGT TAGGAAATTT TTAGGGAGTC AGTTCTCAGA TATACCTCAA AATGCCGCTG ATGGAGCTAT TCGCAAAGGG TTAACTCAAC TGCCTGATCA TCTCCATACT CTCAACCACC AGGACAGGGT AATAGCAGTA GCCTTTAGCC CCGACGGCAA AACCATTGCA ACTGCAAGTT ATGACAATAC CGCCCGCCTC TGGGATACTG AGAATGGCAA TGTATTAGCT ACTCTCAACC ACCAGTCCAG GGTAAGAGCA GTAGCCTTTA GCCCCGACGG CAAAACCATT GCAACTGCAA GTTCTGACAA AACCGCCCGC CTCTGGGATA CTGAGAATGG CAAAGAATTA GCTACTCTCA ACCACCAGGA CTCGGTAAGA GCAGTAGCCT TTAGCCCCGA CGGCAAAACC ATTGCTACTG CAAGTAATGA CAAAACCGCC CGCCTCTGGG ATACTGAGAA TGGCAAAGAA TTAGCTACTC TCAACCACCA GGACTCGGTA AGAGCAGTAG CCTTTAGCCC CGACGGCAAA ACCATTGCTA CTGCAACTTC TGACAAAACC GCCCGCCTCT GGGATACTGA GAATGGCAAT GTATTAGCTA CTCTCAACCA CCAGTCCAGG GTAAGAGCAG TAGCCTTTAG CCCCGACGGC AAAACCATTG CAACTGCAAG TTATGACAAA ACCGCCCGCC TCTGGGATAC TGAGAATGGC AAAGAATTAG CTACTCTCAA CCACCAGTTC TGGGTAAATG CAGTAGCCTT TAGCCCCGAC GGCAAAACCA TTGCTACTGC AAGTTCTGAC AATACCGCCC GCCTCTGGGA TACTGAGAAT GGCTTTGAAT TAGCTACTCT CAACCACCAG GACAGGGTAT GGGCAGTAGC CTTTAGCCCC GACGGCAAAA CCATTGCTAC TGCAAGTGAT GACAAAACCG CCCGCCTCTG GGATACTGAG AATGGCAAAG AATTAGCTAC TCTCAACCAC CAGTCCTCGG TAAATGCAGT AGCCTTTAGC CCCGACGGCA AAACCATTGC TACTGCAAGT AGAGACAATA CCGCCCGCCT CTGGGATACT GAGAATGGCA AAGAATTAGC TACTCTCAAC CACCAGGACA GGGTATGGGC AGTAGCCTTT AGCCCCGACG GCAAAACCAT TGCAACTGCA AGTTTAGACA AAACCGCCCG CCTCTGGGAT ACTGAGAATG GCTTTGAATT AGCTACTCTC AACCACCAGG ACTGGGTAAG AGCAGTAGCC TTTAGCCCCG ACGGCAAAAC CATTGCAACT GCAAGTTATG ACAATACCGC CCGCCTCTGG GATACTAAGA CTCGCAAAGA ATTAGCTACT CTCAACCACC AGGACTGGGT AATAGCAGTA GCCTTTAGCC CCGACGGCAA AACCATTGCT ACTGCAAGTA GAGACAAAAC CGCCCGCCTC TGGGATACTG AGAATGGCAA AGTATTAGCT ACTCTCAACC ACCAGCTCGA TATAAATGCA GTAGCCTTTA GCCCCGACGG CAAAACCATT GCTACTGCAA CTTCTGACAA AACCGCCCGC CTCTGGGATA CTGAGAATGG CAAAGTATTA GCTACTCTCA ACCACCAGTC CAGGGTATTT GCAGTAGCCT TTAGCCCCGA CGGCAAAACC ATTGCAACTG CAAGTTATGA CAAAACCGCC CGCCTCTGGG ATACTGAGAA TGGCAAAGTA TTAGCTACTC TCAACCACCA GTCCTCGGTA AATGCAGTAG CCTTTAGCCC CGACGGCAAA ACCATTGCAA CTGCAAGTTA TGACAAAACC GCCCGCCTCT GGGATACTGA GAATGGCAAA GTATTAGCTA CTCTCAACCA CCAGTCCTCG GTAAATGCAG TAGCCTTTAG CCCCGACGGC AAAACCATTG CAACTGCAAG TTCTGACAAA ACCGCCCGCC TCCATTGGAC AACACCAAAA GGCTTAATTC AAGAAGGTTG TCGCCGTCTG AGTCGGAATT TAACAGCAGA GGAGTGGCAG CAGTATATCA ACAGCGACTT GGAGACATAT CAGAAAACTT GCAAAAATAT TCCCGTTCAT CCTAGTTTAA TTGCAGAAGC TAAAAATCTT GCAAAAACAG GGGAAAAACC GAAAATCAAA CAGGCAATTT CTATCTTCAA AAAAGCTCTA GAATTGGAAC CAGAAATTGA CCTCGATCCT GATACAGAAA CTAGAGAAAC AGACCCCCAA CTTGTAGCAA ATAAACTTGC TGCTTCTGCC AAATTAAAAT AA
|
Protein sequence | MRIEALVVGI NEHVFEPGLN LKAPVKDAEA IAEMLEKYGN FHVQGLPKDY DETGRERFIS DGFVYRKNLK IKVSNLFNPI SKNEIPDVAL FFFAGHGFVT TEGGVREGFL VTSDVQLKRD IYGISLSWLK DLLRQSPVKK QIVWLDCCFG GELLNSQEAD PGTGKEVSRC FITSSRSFEK SVSEIDGKHG VFTANLLKGL NPESSIDGWV TNYILADSIK NNMLNISQSP VFHNSGDAII LTTNTQTKSL DGRWKKTPPY RALSYFTEQE KDGVFFYGRT QLTDELIDRV RTNNFVAVLG ASGSGKSSLL RAGLLYQLKR GQKISGSDRW LYIKPFTPSS SPLESLQKAV NIKSEKIENL TEKLIDFINE VKAEKVLMVI DQFEESFTLC ETDEKRREFF DCFLSVLANE KTKNKFCLVL GMRADFLDQC SKYSGLATEI KEHQLLVTPL EKDEIEEAIK KPAELVGMGV EPKLVAQMAA DFLRNPGSLP LLQYTLDALW KSATQGEDKS QYLTLASYEK LGGIQGTLTK QADAVYESLN KEKKSVAKRI FLELVQPGER SVNFLKVTDT RRRVILEKLP NKEHTLELLL EVSDNLADPN NRLITKDKSE AGTLLDIIHE DLIRSWKTLR KWVEEYQEAL PVERKIEADA AGWEKDGKNE GLLLRETQLT KAEEYVTKYG DMGLLDGLAY EFIEASQELR IREEKEEKER KERELEQEKK ARKLAQRRNQ ILGVSLVLMT GVSGYAWMQQ NIAKHNSEIF FTRQLAAKAE LLTTYNSYDT TVLLGVQSMN RIQEFKEWQD SGWRKVVRKF LGSQFSDIPQ NAADGAIRKG LTQLPDHLHT LNHQDRVIAV AFSPDGKTIA TASYDNTARL WDTENGNVLA TLNHQSRVRA VAFSPDGKTI ATASSDKTAR LWDTENGKEL ATLNHQDSVR AVAFSPDGKT IATASNDKTA RLWDTENGKE LATLNHQDSV RAVAFSPDGK TIATATSDKT ARLWDTENGN VLATLNHQSR VRAVAFSPDG KTIATASYDK TARLWDTENG KELATLNHQF WVNAVAFSPD GKTIATASSD NTARLWDTEN GFELATLNHQ DRVWAVAFSP DGKTIATASD DKTARLWDTE NGKELATLNH QSSVNAVAFS PDGKTIATAS RDNTARLWDT ENGKELATLN HQDRVWAVAF SPDGKTIATA SLDKTARLWD TENGFELATL NHQDWVRAVA FSPDGKTIAT ASYDNTARLW DTKTRKELAT LNHQDWVIAV AFSPDGKTIA TASRDKTARL WDTENGKVLA TLNHQLDINA VAFSPDGKTI ATATSDKTAR LWDTENGKVL ATLNHQSRVF AVAFSPDGKT IATASYDKTA RLWDTENGKV LATLNHQSSV NAVAFSPDGK TIATASYDKT ARLWDTENGK VLATLNHQSS VNAVAFSPDG KTIATASSDK TARLHWTTPK GLIQEGCRRL SRNLTAEEWQ QYINSDLETY QKTCKNIPVH PSLIAEAKNL AKTGEKPKIK QAISIFKKAL ELEPEIDLDP DTETRETDPQ LVANKLAASA KLK
|
| |