Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4465 |
Symbol | |
ID | 4246118 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 6886447 |
End bp | 6888165 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 638109348 |
Product | carbohydrate-selective porin OprB |
Protein accession | YP_723925 |
Protein GI | 113477864 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00227345 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAAAAC TTATATGGAA TACTTTTAAG CACAGTCCTA GTGTTTTTAG TATAGCATTA TTGATGGCAG GCTCAGCAAT CGCTGCTGAG ACTCCTCTAC AAAATTTAGG AACTGATGAA AGTCCTGTAA ATCAGAACTT AACCCAAGGC AGCATTGAAA TTGCTCAAAA TTTTGATACT CGGTTAATGC CAATGGATGA CCCTTCACTT ACACCAGTAG GGGTTTCAGA CTTAGAAAAT GGCGAGTACA TGGACCAGGT AACATCTGTA ACTCAGTTAT CAGATGTACG ACCTACTGAC TGGGCTTTCC AAGCTCTACA ATCCTTGGTA GAGCGTTATG GTTGTATAGC AGGTTATCCT GACGGTACTT ATAAAGGAAA TCGAGCGATG ACTCGCTTTG AGTTTGCAGC CGGTTTAAAT GCCTGCTTGG ATAGAGTCAC AGAATTAATT GCTGCTGCAA CTTCAGACCT AGTAACTAGA GAAGACTTGG CAGTTTTACA AAGACTACAA GAAGAGTTCA GCGCAGAACT AGCTGCTTTG CGGGGACGAG TTGATTCTTT GGAGGCCAGA ACATCAGAAT TAGAGGCCAA TCAATTCTCT ACTACAACAA AATTAAACGG TGAGGTATTG TTCTGGTTAA GTGATACCTG GGGAGAAAGA GCCGCCGGTC GTGGGACAAA AAAGAGTGAG GAGGACAAAA CTGAGACAAC CTTTGCTTAT CGAGTTCGTT TAATCTTTGA TAGTAGTTTT ACTGGGAAAG ACCGCTTGAG AACTCGTTTA CAAGCTCGTA ACGTTCCAAA ATACGACAGT CGAGATTTGA CGAACACTAT GATGACTCGT CTAGGTACTG ACGATGATTT TGATGACGAT TTTGTTCTCA ATAAATTGGC CTATCGTTTT CCTTTACTAA ATGGTAGAGG TCAAATAGAA CTAGCAGCTA ATGGTTATGG TCTGGACGAC TTCATGGGAC CCATTACACC TTTAGATAGT AGTGGTTCTG GTTCTATTTC AAGATTTGGG CGATTTAACC CTACTTTTTA TCGTGCTCCA GCTGATGCTG GTGTTAAATT TGCTTATGCC TTCAACGACG CAATTAAGTT GACAGTTGGT TATGCTGCAC CAGATCCAGA AGACCCTCAA GAAGGTAAAG GTATTTTTAA TGGTGGCTTC AGTGGATTTG GCCAAGTTAC CTTTGAGCCA AATGATAGGA TAGTTTTTCA AGCAGGTTAT GTGCGTGGCT TCCATCCTAA AGGAGATGTG AACTTAACTG GAAGTACAGG TAGCTTTAAA GCCAAAGATC CTTTTAAGGG TATACGGACT AGTGCAGATA ATATCAACTT TGAAGCCCAG TGGTTAATTA CTGAAGGTTT CCAAATAGGT GGTTGGTTTG GTGCCTCTTT TGCCCGTCCT GAAGATAACA ATGATACAGA TGATATCACT ATTGTTAATG GTGCTTTGAC TTTAGCTTTC CCAGACCTAC TCAAAGAAGG TAGCTTAGGA GGTATTATCA TTGGTGTACC ACCAATTATT ACTGATGGTG GTGATGATGA TTCTCTGAAA GACGATGATA CTTCTATTCA CGTTGAGGTA CTTTATCGCT TCCAAATGAA TGACAATATT GCCATTACTC CTGGTGTATT TGTGATTACT AACCCTAATC ACATTGAGGA TAATGAAACT CTCTGGGTTG GTACAATAAG AACTCAATTC AGATTCTAG
|
Protein sequence | MIKLIWNTFK HSPSVFSIAL LMAGSAIAAE TPLQNLGTDE SPVNQNLTQG SIEIAQNFDT RLMPMDDPSL TPVGVSDLEN GEYMDQVTSV TQLSDVRPTD WAFQALQSLV ERYGCIAGYP DGTYKGNRAM TRFEFAAGLN ACLDRVTELI AAATSDLVTR EDLAVLQRLQ EEFSAELAAL RGRVDSLEAR TSELEANQFS TTTKLNGEVL FWLSDTWGER AAGRGTKKSE EDKTETTFAY RVRLIFDSSF TGKDRLRTRL QARNVPKYDS RDLTNTMMTR LGTDDDFDDD FVLNKLAYRF PLLNGRGQIE LAANGYGLDD FMGPITPLDS SGSGSISRFG RFNPTFYRAP ADAGVKFAYA FNDAIKLTVG YAAPDPEDPQ EGKGIFNGGF SGFGQVTFEP NDRIVFQAGY VRGFHPKGDV NLTGSTGSFK AKDPFKGIRT SADNINFEAQ WLITEGFQIG GWFGASFARP EDNNDTDDIT IVNGALTLAF PDLLKEGSLG GIIIGVPPII TDGGDDDSLK DDDTSIHVEV LYRFQMNDNI AITPGVFVIT NPNHIEDNET LWVGTIRTQF RF
|
| |