Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4653 |
Symbol | |
ID | 4246307 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 7155056 |
End bp | 7158055 |
Gene Length | 3000 bp |
Protein Length | 999 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 638109520 |
Product | surface antigen (D15) |
Protein accession | YP_724096 |
Protein GI | 113478035 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4775] Outer membrane protein/protective antigen OMA87 |
TIGRFAM ID | [TIGR00992] chloroplast envelope protein translocase, IAP75 family [TIGR03303] outer membrane protein assembly complex, YaeT protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0482957 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGCTTGT ATCCGTTTTT AGTGGCCATT TTTGCGGCTT TAACAACTTT TGGAATCTCA AAGTCTGCTA ATGCTCAAAT ATTTGGGAGT AGTGTAGATA CTGATTTAGT TTTTATATCA GTTAATTCAA AAACACAGTT TGCTAAACTT TTTTCTACCA AATCTTTTTT AGATTTAGGA TCAAGTTTCC CTTCACAAAA ATTATTTGTT AGTAGGAATA ACTTGTTGAG TAAGTCTTGG TCGGAGGCAG AAATAAAGTT AGATTTTGAT AATTTATTAA AGATCAAAGA AATTACAAAA TCATCTTCTA TTTATAAGAA AGAATTGGCT AATTTTTGGG ATTTGTATGT TTTGGAAAAT CTAGAAAATC CTCAACTAAT CCTAACTAAT AAATCAACAA AAACTAAATC TGAATTTTTA GTAAATAAAC AATTTTCTGT CAGCATTTCT AATCTGGAAA TGTTTAACAA TAGGAATCAA AATTTTCAGA AATTGGTTTT GTCAAAATCA ACAGAAACTA AATCTGAACC TTTAATAAAT AAACAATTTA CTGCAACTAT TCCTCAAGGA AAAATAGTTG AGAATAATAG TCAAAATTCT CAGAATGTGG TTTTGTCAAA ATCAACAGAA ACTAAATCTG AATCTTTAGT AAATAAACAG TTTATTGCCA ATATTCCTCA AGGAAAAATA GTTGAGAAGG AGAGTCTTGA TTCTCAGAAT ATGGTTTTGT CAAAATCAAC AGAAACTAAA TCTGAACCTT TAGTAAATAA ACAGTTTATT GCCAATATTC CTCAAGGAAA AATAGTTGAG AAGGAGAGTC TTGATTCTCA GAATATGGTT TTGTCAAAAT CAACAGAAAC TAAATCTGAA CCTTTAGTAA ATAAACAGTT TATTGCCAAT ATTCCTCAAG GAAAAATAGT TGAGAAGGAG AGTCTTGATT CTCAGAATGT GGTTTTATCA AAATCAACAG AAACTAAATC TGAACCTTTA GTAAATAAAC AGTTTATTGC CAATATTCCT CAAGGAAAAA TAGTTGAGAA GGAGAGTCTT GATTCTCAGA ATGTGGTTTT GTCAAAATCA ACAGAAACTA AATCTGAACC TTTAGTAAAT AAACAGTTTA CTGCCAATAT TCCCACTACG GAAATAGTTA CAGAATCAGA AAAATATATT TCCCCTCAAT CACAACAAGC ACAAACCTCT ACAGAGGAAG AAGAACCCCT AGTGCTAGTA GCCGAAGTTG TAGTTACTGG AGTAGACGGA GAACTCCAAG ATGAAGTTTA CCGAGTCATT AACACTCAAC CAGGAGAAAC AACCACTCGC TCTCAACTAC AAAAAGATAT TAACGCTATT TTTGCTACTG GTTTCTTCCA AAATGTGAAA GCAATACCTG AAGATACTCC TCTAGGTGTA AGAATTATTT TTAAAGTCGA ACCAAACCCT ATATTGACTT CTGTAATAAT AGAGGGTGCA GAAGTATTTC CTGAGAGTGA AACAGAGAGA ATATTCAATG AGCAATATGG TGAAACTTTG AACTTGAAAG TTTTTGAACA AGGTGTTGAA CAAGTCGACC AATGGTATCA AGACAATGGT TATGTTCTTG GACAAGTTAT TGGTGCGCCC CAAGTTGGTG ATGATGGTAC AGTTACCTTA GAAGTTGCAG AGGGAAAAAT TAAAGATATT CAGGTACGCT TCCTAAATTC AGAAGGAGAA ACAGTAGATG AAGAAGGTAA TGTCATAAAA GGTCGTACTC GTGAATATAT TATCACTAGA GAAATTGAAC TGAAAACAGG AGATATATTT CAGCGACAGA CTGCAGAAAA AGACATTAGA AGAGTGTTTG ATTTAGGAAT TTTTGAAGAT GTGAGATTGG GGTTAGAACC AGCACCAGAT GATCCTAATA CGGCAGTTAT TGTTGTGAAT ATTGTAGAGA AAAGTACCGG TTCTCTTGCA TTTGGTGGTG GGGTTAGTTC TGCAAGTGGG TTGTTTGGTA CAGTAAGTTA TCAACAACAA AATATTGGTG GTAATAACCA AAAATTAGGT GGTGAATTTC AGGTTGGTGA ACGATTAGTA TTAGCAGATG TTAGTTTTAC AGATCCTTGG ATCGGTGGAG ACGATCATCG ACTCTCTTAC ACGGTGAATG CCTTCAGACG GCGAACTATT TCAGTGATTT TTGATTCCGA TGATGACGAT CAGCGTGATG TCGATTTGCC TAATGGTGAT AACCCACGAG TTATTCGTAC AGGAGGTGGG GTTAGTTTTA CTCGTCCGTT TATCCCTAAT CCATTTGTTG ATCCAGATTG GACTGCTTCT CTGGGATTAA AATATGAACG GGTGGAAATT CAGGATCGTG ATGGTGAAGT TGAACCTAGA GATGAGTTGG GCAATAAATT GACAGTCGAT GATTCTGGGA AAGATGATCT ATTCACTATT CAATTTGGTA TTGTCAATGA TCGACGCAAT AATCCCCGAC AACCTACTTC TGGTAGTTTG TTGCGGTTTG GTGCAGAACA GTCTATTCCT GTAGGCTCAG GAGAAATTGG TTTGAATCGT TTGCGGGGTA ATTTCAGTTA TTTTATTCCA GTAAGTTTTA TTAACTTTAC TGATGGACCT CAAGCTTTAG CTTTTAATAT ACAAGCAGGC CATATAATAG GAGACTTACC TCCTTATGAA GCTTTTGCTC TGGGTGGTAC TAATACAGTC CGTGGATATG ATGAGGGTTC GGTGGCAGCA GGTCGTACTT TTGTCTTAGG AACTGTTGAG TATCGCTTTC CTGTATTTAA ATTTCTTGGT GGTGCTTTAT TTGTTGATGC TGCAACAGTA TTTGATAGTC AAAGATCTGT TATTGGTAAT CCTGGAGGTG TGCGGGAGAA GCCAGGAGAT GGCATAGGTT ATGGTGGTGG TCTCCGGGTG AATTCTCCTC TGGGTCCGAT TAGAATTGAT TATGCTATTA ATGATGAAGG TGACACTCGT TTCCACTTTG GTATTGGTGA GCGCTTTTAA
|
Protein sequence | MCLYPFLVAI FAALTTFGIS KSANAQIFGS SVDTDLVFIS VNSKTQFAKL FSTKSFLDLG SSFPSQKLFV SRNNLLSKSW SEAEIKLDFD NLLKIKEITK SSSIYKKELA NFWDLYVLEN LENPQLILTN KSTKTKSEFL VNKQFSVSIS NLEMFNNRNQ NFQKLVLSKS TETKSEPLIN KQFTATIPQG KIVENNSQNS QNVVLSKSTE TKSESLVNKQ FIANIPQGKI VEKESLDSQN MVLSKSTETK SEPLVNKQFI ANIPQGKIVE KESLDSQNMV LSKSTETKSE PLVNKQFIAN IPQGKIVEKE SLDSQNVVLS KSTETKSEPL VNKQFIANIP QGKIVEKESL DSQNVVLSKS TETKSEPLVN KQFTANIPTT EIVTESEKYI SPQSQQAQTS TEEEEPLVLV AEVVVTGVDG ELQDEVYRVI NTQPGETTTR SQLQKDINAI FATGFFQNVK AIPEDTPLGV RIIFKVEPNP ILTSVIIEGA EVFPESETER IFNEQYGETL NLKVFEQGVE QVDQWYQDNG YVLGQVIGAP QVGDDGTVTL EVAEGKIKDI QVRFLNSEGE TVDEEGNVIK GRTREYIITR EIELKTGDIF QRQTAEKDIR RVFDLGIFED VRLGLEPAPD DPNTAVIVVN IVEKSTGSLA FGGGVSSASG LFGTVSYQQQ NIGGNNQKLG GEFQVGERLV LADVSFTDPW IGGDDHRLSY TVNAFRRRTI SVIFDSDDDD QRDVDLPNGD NPRVIRTGGG VSFTRPFIPN PFVDPDWTAS LGLKYERVEI QDRDGEVEPR DELGNKLTVD DSGKDDLFTI QFGIVNDRRN NPRQPTSGSL LRFGAEQSIP VGSGEIGLNR LRGNFSYFIP VSFINFTDGP QALAFNIQAG HIIGDLPPYE AFALGGTNTV RGYDEGSVAA GRTFVLGTVE YRFPVFKFLG GALFVDAATV FDSQRSVIGN PGGVREKPGD GIGYGGGLRV NSPLGPIRID YAINDEGDTR FHFGIGERF
|
| |