Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3936 |
Symbol | |
ID | 4244019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 6087348 |
End bp | 6090377 |
Gene Length | 3030 bp |
Protein Length | 1009 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 638108858 |
Product | Na-Ca exchanger/integrin-beta4 |
Protein accession | YP_723440 |
Protein GI | 113477379 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00889284 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000639264 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACAGACG ATAGTGAATT CCTTGATATA ATGTCCGACT TGGAGCTCAA ACACATCAAC AATATTCCTC CTATAACTTT TCTGGGGCAA GTATCTGTGG AAGTATTGCC AGCGGTGGAG ATAATTGAAG AGGGGTTAGC TCAATTTCAG GAAAATTTAA CGAACTTTGC TGCTTCTGAG ACTTTTGAGG CGGATATGTT AAATGTCTTT GGGGAGTCGG GGAAAGTTGA TCTGGGAAAA ACTATTGTTG ATACTTTGGC AAAGGGTGAG AACTTGCCAC AGATAAATAT TGTGCCAGTA GAGCTGATGA ACGGTGCGGC GGGGGGTTTT GACTCTCTGA CTGGTATGGT GTATTTGACG GATAGTTTGA TTAACGAGAA TTCTGTTATT GGAAGTGAAA CGAGGCAGTT TCCACATCTG ACGGATGTAC TGGCAGAGGA GTTAGGACAT TACATTGACT CAAAGTTGAA TACAATAGAT ACTCCTGGAG ATGAGGGAGA GTGGTTTGCG GCGTTGGTTC GCGGTGATGT GTTGAGTGTG GAGGAAGTTG AGGGTTTGCG CGGCGAGGAT GATATGGTAA AAATTTTAAA TGGGTTGGTT GAGGTGGAGG CAAGTTCAGA GTTGAGCTTC AAGATCAGCA ACTTCCCAAC GGGTGATCAA CCTTTTGGGA TGGCGGTAGC AGATTTTGAT AAGGATGGTT TTAAGGATAT TGTGGCGGCA AACGTTGGGT CTACAACAGT ATCAGTGTTG TTTGGTGATG GCAAGGGAGG GGCATTAACT GCAACTACAC ATGAAGTGGG AGGTAAGCCT GTTTATGCGG CAGTGGGAGA TTTTAACAGA GACCGCAACC CAGATATCGC AACAACAAAC CAAGATGACG ATACTGTAGC AGTTTTGTTA GGAGATGGGA AAGGTCGGTT TAGTAGTCCT AGCGAATTTC CTGTAGGTGA TGGCCCATCT CAACTAGCGG TAGCTGATGT CAACCAAGAC CGTAAGCTAG ATATTGTAAC CGGGAATAGT GGTTCCGATA ATGTTTCAGT ATTGTTAGGA AGCGGCAATG GTAGTTTTGC TGAAAGCATT AGCTATCAGG TAGGAGCAGA TCTTCCTGCA AATTTAGTCG TCAAAGATTT TAACGGTGAT GGTAAGCTGG ATATTGTGAC AGCAAATGGT AATTCTAATA ACGTTTCGGT GCTGTTGGGA AATAGAGATG GTGGCTTTGA CTTCCTCACT AGCTTTATCG CGGGAGGAGA TACGCCGAGC GGTATTGTCG CGAAAGACGT GAACGGAGAC AAGAAGCAGG ACATTGTGAT AAACATGGAA GACTCAGACA AGGTTTCTGT CCTGTTCGGT CTCGGAGACG GACGTTTTGG GTTTCCAAAT AGTTTCCCTG TGGGAAATAG TCCTGAGGAT ATCGCTATTG GAGATTTGAA TGGCGATCGC AAGTTGGATA TTGTTACAGT TAATAGGGAG TCAAATAATA TCTCAATATT ACCAGGACAA GGAAAAGGAA GCTTTGGTAG TGCAATTAAT TTTCAGGTGG GGGATGCACC TGAAGATGTT GTAGTAGAAG ATTTTAATGC TGATGGTAAG CTAGATATTG CGACAACTAA TGGCGCTTCT GATACCATAT CCATTTTACT AAATACCACC AAAATGGCAA TATTTCCCAA CATTGCTATC TCTGACACCA AAATAACTGA AGGCGACAAG GGACGCAAAA ATGCCAAATT TACTGTTACC CTCGACAATG AAAGCGACCA AACAGTAAAA GTCAACTACG CCACCGCGAA TAAAACCGCC AAAGGGAACG AAGACTACAA ACCCACTAAG GGCACCCTCA CTTTCAAGCC TGGGGAAACT CAAAAAAATA TTACCGTTCC CATCCTGGGA GATAACAAAG TAGAACCTAA CGAAACTTTC AACCTCAACC TCAGCAAACC CAAAAATGCC AGACTCAAGG ATAAAGTCGG TCTTGGGACT ATTACCAACG ACGACAAAAA AAACCCGCCC CAAATATCCA TTAGCGACAC CAAAATAGTC GAAGGCAACC GAGGACGCAA AAACGCCAAA TTCACTGTCA CTCTGGATGC CAAACCAATA GAAACAGTTC AGGTAGACTA TGGTACTCGC AATCAAACTG CGAAAGCAAA CCAAGACTAC AAACCCACCA AAGGTACCCT CACCTTCAGA CCTGGTCAAA CCCGTAAAAC CATTACTGTT CCTATTTTTG GGGATAATAA AATAGAAAAT GACGAAACTT TCCAACTCAA CCTCAGCAAA CCCCGAAATG CCCAACTTAA GGACAGACGA GGCATTGGCA CCATCCGCAA TAACGACTTG CCTAAACTAT TCATCAAAGA TCGGGAAATA ACTGAGGGTG ATGATGGCAA AAAACAACTA ACATTTGATG TCACCCTCAA TGCTAAAACT CAAAAAAGAG TTGAGGTAAG TTATGCTACT GCTGACGGAA CTGCGAAAGA AGGTTCGGAC TACCAAAAAA CTCAAGGCAA ACTTATTTTT CAACCGAACC AGAAGAAAAA AACAGTAACA GTTCCTATTC TTGGGGATCT CTCGGAGGAA AGCAGCGAAA ACTTTACAGT TAATCTCAGG AAACCTAAAA ATGCTAGGTT GGGAGACAAG GGCGCCATTG GTACTATTAA AGATAATGAT CGAGGTGGGG AACAGCCAGG GGAAAGTTTC CAGACTGCCA TCAACTTAGG GCAATTCACA GGGGAAGTGG TGAGAACTGA CGAAGTCGGT TTTAGCAGGG GAATATATCG CAATACCAAC GACTTTTATA CCTTCCAAAC AGATAAAGAA AGTGCTTTTG TACTATTCCT TGACAATTTA TTACAAGATG CTAATATTGG TTTATATGGT AGCGAGGAAG AAGTGATCAA CCAGTCTAAA AATAGCGGTA TCGAACGAGA AAGTATTGTG ACTACATTAG ACCCAGGTAC TTACTATGTT CGGGTATATC CCCAAGGTGC CAGTCGCACG GACTACCGTC TCAGCCTCAA CTTACTTTAG
|
Protein sequence | MTDDSEFLDI MSDLELKHIN NIPPITFLGQ VSVEVLPAVE IIEEGLAQFQ ENLTNFAASE TFEADMLNVF GESGKVDLGK TIVDTLAKGE NLPQINIVPV ELMNGAAGGF DSLTGMVYLT DSLINENSVI GSETRQFPHL TDVLAEELGH YIDSKLNTID TPGDEGEWFA ALVRGDVLSV EEVEGLRGED DMVKILNGLV EVEASSELSF KISNFPTGDQ PFGMAVADFD KDGFKDIVAA NVGSTTVSVL FGDGKGGALT ATTHEVGGKP VYAAVGDFNR DRNPDIATTN QDDDTVAVLL GDGKGRFSSP SEFPVGDGPS QLAVADVNQD RKLDIVTGNS GSDNVSVLLG SGNGSFAESI SYQVGADLPA NLVVKDFNGD GKLDIVTANG NSNNVSVLLG NRDGGFDFLT SFIAGGDTPS GIVAKDVNGD KKQDIVINME DSDKVSVLFG LGDGRFGFPN SFPVGNSPED IAIGDLNGDR KLDIVTVNRE SNNISILPGQ GKGSFGSAIN FQVGDAPEDV VVEDFNADGK LDIATTNGAS DTISILLNTT KMAIFPNIAI SDTKITEGDK GRKNAKFTVT LDNESDQTVK VNYATANKTA KGNEDYKPTK GTLTFKPGET QKNITVPILG DNKVEPNETF NLNLSKPKNA RLKDKVGLGT ITNDDKKNPP QISISDTKIV EGNRGRKNAK FTVTLDAKPI ETVQVDYGTR NQTAKANQDY KPTKGTLTFR PGQTRKTITV PIFGDNKIEN DETFQLNLSK PRNAQLKDRR GIGTIRNNDL PKLFIKDREI TEGDDGKKQL TFDVTLNAKT QKRVEVSYAT ADGTAKEGSD YQKTQGKLIF QPNQKKKTVT VPILGDLSEE SSENFTVNLR KPKNARLGDK GAIGTIKDND RGGEQPGESF QTAINLGQFT GEVVRTDEVG FSRGIYRNTN DFYTFQTDKE SAFVLFLDNL LQDANIGLYG SEEEVINQSK NSGIERESIV TTLDPGTYYV RVYPQGASRT DYRLSLNLL
|
| |