Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3032 |
Symbol | |
ID | 4244916 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 4682776 |
End bp | 4683933 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 638108063 |
Product | cupin 4 |
Protein accession | YP_722656 |
Protein GI | 113476595 |
COG category | [S] Function unknown |
COG ID | [COG2850] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0159331 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00213714 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCGTATC TAACACATCT AATACAACCA TTAAAGCAAG AAGAATTTTT AGAAAACAAC TGGACTAAGA AAGCGATCGC TATCTCCAAT AAAGGAGAGA AAGATTTTAC AGACCTATTT TCATGGGAAA AACTTAACTA CCTACTAAAC TTTCATCAAA TAAAATATCC CGATGTGCGA CTTGCATTTG ATGGAAAAGT TCTGGAGGAA AAAGAAAATA GAAATTTTAC TCAATGGTGC GAAAAAGGAG CAACCTTAAT TCTAGATCAA ATTCATAGAA GGATTCCAGA AGTTGCTATA TTTACTTCAA AACTCAGCTA CGAATTAGGA TACCCAACTC AAGTCAATGC TTATTGTTCT TGGTCTAGTA AAAAAGGATT TTCTCCTCAC TACGACACTC ACGATGTATT TATTTTGCAG GTAGAAGGAA ATAAACAATG GTATGTGTAT AATGACACCT TCAAATATCC ATTGCCAAAC CAAAAATCAT CTTCTTTTAC ACCACCTGAA AAAGAAGCAT ATTTGAGTTG TATTTTACAT CCAGGAGATG TACTTTATAT ACCTCGTGGA CACTGGCATT ATGCAGTGAC AAAAGAGGAA CCATCTATTC ATCTTACTTT AGGTATTCAC TCTTCAACAG GTGTGGATTT ATTGGAATGG TTAATTGGTC AATTACAATA CAGAGAAGAA TGGAGAACAA GTCTAGCGTT AAGAATAGAT GATACTTCTT TTAATGTTAG TGTAGAAAAT TTAATTAAAG ACTTAAAAGA GTATATAAAT AATCACAATA TTAGCGAAGA ATATAATAAT TACTTGGATG GTTTAGCAAA ACCTTTTGAG CAGTATAATT TACCTTATCA AGCTGGATTT CATATTTTTC ATAGGGATAT CGATACTAAA TTCAAAGTGT CTCAGTTTCA ACGTTTAAAA ATTTCCAAAA TGGCTGATGA TGATGGATAT AAAATTTTAG TTTCTGGTAA AGAGGTATCT ATTCGAGGAG TACCAGAATA TTTAGTGAAA AATTTATTTA GCCGAGAAAC ATTTACTGGA AAAGATATCA TAAATTTATT ACCTGACTAT GACTGGGAAA TAGATATTAT GCCCATGTTA TCAAAGCTAG TTAATGAAAG AGTTATTTTT GTAGAATCAG GGCTGTAA
|
Protein sequence | MSYLTHLIQP LKQEEFLENN WTKKAIAISN KGEKDFTDLF SWEKLNYLLN FHQIKYPDVR LAFDGKVLEE KENRNFTQWC EKGATLILDQ IHRRIPEVAI FTSKLSYELG YPTQVNAYCS WSSKKGFSPH YDTHDVFILQ VEGNKQWYVY NDTFKYPLPN QKSSSFTPPE KEAYLSCILH PGDVLYIPRG HWHYAVTKEE PSIHLTLGIH SSTGVDLLEW LIGQLQYREE WRTSLALRID DTSFNVSVEN LIKDLKEYIN NHNISEEYNN YLDGLAKPFE QYNLPYQAGF HIFHRDIDTK FKVSQFQRLK ISKMADDDGY KILVSGKEVS IRGVPEYLVK NLFSRETFTG KDIINLLPDY DWEIDIMPML SKLVNERVIF VESGL
|
| |