Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3266 |
Symbol | |
ID | 4243687 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 5011083 |
End bp | 5015132 |
Gene Length | 4050 bp |
Protein Length | 1349 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 638108260 |
Product | filamentous haemagglutinin outer membrane protein |
Protein accession | YP_722851 |
Protein GI | 113476790 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2831] Hemolysin activation/secretion protein [COG3210] Large exoproteins involved in heme utilization or adhesion |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTTTTT TTTACAAAAA TCTATTCCTC AGAATTGTTC CGCTATCTTT GACTCTGGGA TATTTTGGGG TTGATATTTC ATCGGCGATC GCTCAACTTC AACCAGATAA TACATTGGGA GAAGAAAATT CTGTCGTCAC ACCGAATATC AATATTAAAG GGATAGAAAG CGATCGCATA GATGGCGGGG CCCAAAGAGG GGCCAACCTA TTCCACAGTT TTCAAGAATT CAACATTCAA CAGGGACGAG GGGTTTACTT TAGCAACCCT GATGGAGTTC AAAATATCCT GACTCGGGTC ACAGGAAACA ATGGCTCTAA CATATTAGGA ACCTTGGGTG TAGATGGAAA AGCTGATTTA TTTTTCATTA ACCCCAATGG CATAATCTTT GGGCCAGAAG CTAAATTAGA TGTCCAAGGT TCATTCTATG GAGCTACTGC CGATAGTATC TTATTTAAAA ATGGATTTGA ATTTGCTAGT TCAGACCCTC AAGCGCCCCC ACTACTGACA GTTAATGTTC CCATTGGGTT GAGGTTGCCC GAAAACCCTG GTAGTATTTT GGTTGAGGGT CAAGGTCATA ACATAAATTT TTATCTAAAC CCAAAAGACG GAATAGCAAG ATTAGAAAAA GAGGACAGGA ATCTAGGGTT GGAAGTTTCC CAAGGCCAAA CCTTAGGGTT AGTTGGTGGT GATATATTTT TGAGGGGAGG AAACCTCACA TCAATTGGAG GACGTGTTGA ATTAGGAAGT GTTCACAATG GGGTAGTAGG AATCACGAAA GACCAAACCC TTAGCTATGA TCCAAACCTG AAATTTGGCC AAATTCTAAT ATCTGAAGAG AGTTCTATAG ATGTAAGTGG AAGTGGAAGA GTTGAGTTTC AAATTCAAGG AGAAGAGATA AAAGTGCGAG ATGCTTCAAT AATAGTAGGA GAGATTAAAG GCGATGAAGG TGGGGGGGCT TCAATTCTTA AAGCATCTAA GTCTATTATA ATAGAAGGTA GAATACCTTT TGAGAATAAT ATTACATCAT CATTCAACGT GTCTTTATTT TCTGATAAGC TCGAAGCAGG AGAAGCTTTA AACATTAATA AGACTAACAG TGCTATTATT CTTATCACTT ACGGTAAGGG AAATGTCGGG GAGTTGACGG TAGAAACAGA AAGTTTAACG CTGGATCCTG GAGCGTTTAT TTCTACGACT ACTTTGGGTC AGGGAAATGG AGGGAATTTA ACAGTGTTTG CTAAAGACAT AGAAATGAAT GGTGGGGACT TTTTCGGTGG TTTGGTTAGT GTTAGTGTTG ATCAAGGTAA CGGAGGGAAT TTAACGGTGA AAGCTGCAGA AAGTTTAACG CTGCAAGATG GAGCCATTAT TTCTACGACT ACTTTGAGTC AGGGAAATGC AGGGAATTTA ACAGTGTCTG CTAGAGATAT ACTCATTGGT TATTCTACTA TGTTGTCTGC AGAAACTTTG GGTCAAGGTA ACGGAGGGAA TTTAACGGTG AAAGCTGCAG AAAGTTTAAC GCTGCAAGAT GGAGCTTTTA TTTCTACCAA GACTTCTGGT CAGGGAAATG CAGGGAATTT AATAGTCTCT GCTAGAGATA TAGAAATAAT AGGCAAATCT CCTGATGGTA AATCCTTCAG TAGCTTAACC GCAGAAGCAA AAAAAGAATC AACAGGAGCA GCAGGAACCA TAACCATCAA CACCGAAACC CTCAACCTCC GGAACGAAGC CGAAATTATA GTCACCTCTG CTACCCAACA AGCTGCAGGT GACTTAATAA TAAACAGCAA CAACATCCGC CTCGAAAACC AAGCTAGCCT CAAAGCTGAA ACCAAAGCAG GTGAAGGCAG CATTACCATC AATAATAATA AAGACCTCAT CCTGCGTAAC AACAGCAACA TCACCACTAA TTCTGAAGGC ACGGCTACTG GAGGTAATAT TACCATTAAT ACGGAAAACC TAGTTGCCCC AGACAACAGC GACATCTCTG CTAATGCTGT CGAGGCATTT GGAGGCCGAG TCAACATCAC CGCAGCCGGC ATATTCGGCA CAGAATTCCG TACTGAAGCA ACAGACAAAA GCGACATCAC CGCCACCTCA GAATTAGGAG CTTCATTCAG TGGAGAAGTC AATATCAACA CCCCCGATGT AGATCCTAGT GACGGACTGA TAGAATTTGA GGATACCGTC CCAGATATCT CCAACCTCCT CGACCAAAAC CCTTGCCGAC AAGGACAAAA AAGCAAATTC ATCATTACAG GAAGAGGCGG CTTACCACCA ACTCTCGAAG AACCCTTGGC TCCCACCTAC GTCTGGCAAG ACTCAGAAAC CTCAGTTACT CCTCCACCAG CTATCACTCC ATCAGAGCAA GAGTTCATTG AAGCACAAGG TTGGAGCTCT AATTCTCAAG GTATTGAACT GATCACTGAC CCCGAAACTG CTACTCCCAC CTCCCCTTGG TTAATACCTC CCAACTGCGA ACAATTAGAC ACCTTTGAAC CCCCACCCTA TCTACTCGCA AGCAGTAAGG GAAATATTCC CACCTCAACT TCTGAACCTC TAAAAGTTAC TATCAAACAA TTCAACTTTG AGGGTAACAC AAAATTTAGC AACCAAGAGC TACAACAACA ACTCACACCT TATCTAAATA AACCCATTAC CTTTGCTCAA CTCATAGCAG CACGCACTGC CATTACAAAA TATTATACCA AAAACAACTA CATCACATCG GGAGCATTTA TTCCTCCCCA AACTATGGCT GAAAATGGCA CAGTCACCCT CCGAGTAGTG GAAGGAAAAG TTGGTGAAAT CAATGTCAAT ATCCAAGGTA GATTAAATGA AAATTATATC AAAAGTCGGT TAGAAAAAGC CACCACGGCT CCTCTGAATC AAGAAAAGTT GTTGTCTGCC CTACAAATAT TACAAATAGA CCCTTTAGTC AAAACTCTAT CTGCGGAACT TTCATCCGGA GTCCGCCCAG ATACAAGTAG GTTAGACATA AGAATAGAAA CAGCTAACCC CTGGCAAATA GAAACTATCA GTAATAATGG CAGGGCACCA AGTGTGGGAA CATTCAGGCG AGGGGTAGAG GTAGACCATA GAAATGTTAC AGGTATTGGA GACAGTTTAA GTGCTCTCTA TACTAATACT GATGGCAGCG ATATGGTAGA AGTATCTTAT GCTATTCCTG TGAACTCCAG TAATGGTCGT ATCGAACTAT TTTATCGCCA CAGAGATAAT AAGGTGGTAG AATCACCTTT TGAAAGACTT GATATTGAGT CAAATTCAAA TACTTACAAG TTATCATTTA GTCAACCAAT AATGCAAACA CCCTGGCAAA CTTTAAGTTT GGGTTTATCT GCAATAAAGC GAGATAGTCA AACTTCTATA TTAGGAGAGA ATTATCCATT ATCTGAGGGT GCTGATAAAA ATGGGGAAAC GAAGTTATCA ATTTTGCAGC TTTTCCAAGA TTATGAACTA CGGGGAAAAA ATCAAGTTTT GGCGTTTCAT TCTCAGTTGA ATGTAGGGTT GGGAATATTG GATGCTACAA AGAATAGTAA TGAACCAGAT GGTCGATTTT TTTATTGGCG GGGTCAAGGA CAATGGGTGC GTGAGTTAGG CAAAAATACT TTGTTGTTGA TGGGAGCAGA TCTACAGTTA TCTCCATCTG ATTTGGTGCC TCAGGAAAGG TTTGGTTTAG GGGGTTATCG AAACGTGAGA GCTTATCGTC AGGATACTCG CTTGACAGAT AATGGAGCGT TAGGAACAGT GGAGTTGCGG TTGCCCGTGC CCTGGATATC TGGAAAAAAT AGATTATTTC AGGTAGTGCC GTTTATTGAT GGGGGGGTAG CATGGAATAG TGATAGTAAA GAGGAGGAAG GAAGTAAGGC TTTGGCGGCG GCCGGGGTGG GGTTGCAGGT AAATTTATGG GAGAAGATAA ATATGCGCTT AGATTATGGA ATTCCTTTAG TGGATGTGGA CTCGCGAGAT AAAACGGCTC AGGAGGAAGG GTTTTATTTT TCTTTTTCTA CTACTCCTTT TTCTTTTTGA
|
Protein sequence | MLFFYKNLFL RIVPLSLTLG YFGVDISSAI AQLQPDNTLG EENSVVTPNI NIKGIESDRI DGGAQRGANL FHSFQEFNIQ QGRGVYFSNP DGVQNILTRV TGNNGSNILG TLGVDGKADL FFINPNGIIF GPEAKLDVQG SFYGATADSI LFKNGFEFAS SDPQAPPLLT VNVPIGLRLP ENPGSILVEG QGHNINFYLN PKDGIARLEK EDRNLGLEVS QGQTLGLVGG DIFLRGGNLT SIGGRVELGS VHNGVVGITK DQTLSYDPNL KFGQILISEE SSIDVSGSGR VEFQIQGEEI KVRDASIIVG EIKGDEGGGA SILKASKSII IEGRIPFENN ITSSFNVSLF SDKLEAGEAL NINKTNSAII LITYGKGNVG ELTVETESLT LDPGAFISTT TLGQGNGGNL TVFAKDIEMN GGDFFGGLVS VSVDQGNGGN LTVKAAESLT LQDGAIISTT TLSQGNAGNL TVSARDILIG YSTMLSAETL GQGNGGNLTV KAAESLTLQD GAFISTKTSG QGNAGNLIVS ARDIEIIGKS PDGKSFSSLT AEAKKESTGA AGTITINTET LNLRNEAEII VTSATQQAAG DLIINSNNIR LENQASLKAE TKAGEGSITI NNNKDLILRN NSNITTNSEG TATGGNITIN TENLVAPDNS DISANAVEAF GGRVNITAAG IFGTEFRTEA TDKSDITATS ELGASFSGEV NINTPDVDPS DGLIEFEDTV PDISNLLDQN PCRQGQKSKF IITGRGGLPP TLEEPLAPTY VWQDSETSVT PPPAITPSEQ EFIEAQGWSS NSQGIELITD PETATPTSPW LIPPNCEQLD TFEPPPYLLA SSKGNIPTST SEPLKVTIKQ FNFEGNTKFS NQELQQQLTP YLNKPITFAQ LIAARTAITK YYTKNNYITS GAFIPPQTMA ENGTVTLRVV EGKVGEINVN IQGRLNENYI KSRLEKATTA PLNQEKLLSA LQILQIDPLV KTLSAELSSG VRPDTSRLDI RIETANPWQI ETISNNGRAP SVGTFRRGVE VDHRNVTGIG DSLSALYTNT DGSDMVEVSY AIPVNSSNGR IELFYRHRDN KVVESPFERL DIESNSNTYK LSFSQPIMQT PWQTLSLGLS AIKRDSQTSI LGENYPLSEG ADKNGETKLS ILQLFQDYEL RGKNQVLAFH SQLNVGLGIL DATKNSNEPD GRFFYWRGQG QWVRELGKNT LLLMGADLQL SPSDLVPQER FGLGGYRNVR AYRQDTRLTD NGALGTVELR LPVPWISGKN RLFQVVPFID GGVAWNSDSK EEEGSKALAA AGVGLQVNLW EKINMRLDYG IPLVDVDSRD KTAQEEGFYF SFSTTPFSF
|
| |