Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0803 |
Symbol | |
ID | 4241766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 1275919 |
End bp | 1277823 |
Gene Length | 1905 bp |
Protein Length | 634 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 638106081 |
Product | hypothetical protein |
Protein accession | YP_720693 |
Protein GI | 113474632 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.165447 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAATT TTCCGCAAAT CGAACCCCCT GCTGAAGGGG AAGCAGTCCA ACTTATTGGC CACAACAAGA AAGAGAGTTT TATAAATGGA TCCGACGATG CTGAGGTTCT GAAGGGTAAA AAGCGTGATG ATATTCTCGC TGGCTCCGGT GGAGAAGACT GGCTCTATGG AAAACAAGGT CAAGACATCC TGCTCGGCGA CAACGTACCT GACGAGACTG ACTTTGACGC AGATGGCAAG GTAACGGTCA ATCATACCTC AATGAATGAC TACCTTTTCG GGAATATTGG TGAGGATACT CTCATTGACC AATTCGGGAG TGACGGTTTA ATAGGTGGTC AAGGGAATGA TTTGATTATT AGTATGAGTG ACTCCAATAT TCCCCAGGAA AATCAGACTA TACCAGCCAA CGTAGATGAT GGTGATGACC TTGCTCGGCT TAATTTTTCT GATACATATA TTAATCCAGA CAACCTTAGC GCTAATGACA CATTGAAAGG TGGAAAGGAG GCAGATACAT TTTCCTTCCA ATTATTGATA AATGCTTCAA AAGAAATTGT CGAGAAGCAC ACTAATGACC AGGGAGTTAC CAACTGGGGA ATGAATGGTG TCGCCGGAGA AAATGATAAC TACCATGATC ATTGGGTGGA AGGCATTGGG CATGATAAAA TTATTGACTT TAATGCCAAG GAAGGAGACT CAATTAAAGT TTTCGGCCAT ACAGTACAAA AGCGACTCCT TGAGAATGAT GAACAATCTA AAATCGCTGT TATTGGTCTC TACAGTGATC AAGGTAATGA TGGACAGCGA GGTGGGGGTG CTCATGACCT AGATGTTCTC GGTAAAATTG AAGTGCATTA TAAAGGCAAG TTTGACTTTG ATTCAGATGT TACTGTAGAA AATGAAGATT ACGGAGCTTA TGGCTTGGAC AGTAATGCAA TGATTGAGGA ATATGATATT CCCAATAAAC TGGATGAAGC TAATACATCT GACTTTGTCG GAACTAACGG AGATGACGAA ATTGAAGGTA ATCTTGGCAA AAATAAGATA CAAGGTCTTC AAGGTGATGA TCATCTTATT GGCCAAGCGG GGAAAGACAA GTTAGAAGGT TCAGAAGGTG ATGATCATCT TATTGGTGAT TACATACCTA CAGAAGCTGA TTACGATGCT GAAGGAGTAC TCGACCTTCC CGAAGATGTA GTCTACGATG ACAAACTTTT TGGCGATACA GAATCTGATA CTCTTGCCGA TCAGTATGGG GATGATAAAC TCACAGGAGG TGATGGTAAT GATAGGTTGA TCAGTATTAG TGATAGTAGT ATTCCTCGTG AGAACGCTAA TATTCCAGCT GGTGTAGATG ATGGTGACGA CCTTAAGAAG CTTGATTTTT CCAATGAACT TATCAATCCA TATAATACCG CCAGTAAGGA TATTTTGACA GGAGGTTCCG GTGCTGATAC GTTTGAATGG AATCTTCTTA TCAATGCCTC AAAAGACATT GTCGAGAAGC ACACTGATGA CGACGAAATC ATCAACTGGG GAATGAATGG TGTGGCCGGA GAAAATGATA ACTACCATGA TCATTGGGTT GATGGTATTG GAAAGGATTT AATTATGGAT TTCAGTGGAC AGGGTGGTGA GGATGATAAG ATTATCGTCC GAGGACATAC GGTTAAAGTC AAGCTCTTGA GTGAAACGTC AGATAAGGCA GAGTTAGGTA TCTACAGCGA CCAAGGTAAT GACGGAGAAC GAGGTAATGG TGCCCATGAT TTTGATGTTC TAGGAAAAAT TGTAGTCAAA CATGATGGTA ACTTTAACTT TGATAATGAT GTACAAGTCG TAGGGGTTGA TTATGGTGCC TATGGGAATG GTGCAGAATT ATATCAAGTG TTTGGACTTG AATAA
|
Protein sequence | MDNFPQIEPP AEGEAVQLIG HNKKESFING SDDAEVLKGK KRDDILAGSG GEDWLYGKQG QDILLGDNVP DETDFDADGK VTVNHTSMND YLFGNIGEDT LIDQFGSDGL IGGQGNDLII SMSDSNIPQE NQTIPANVDD GDDLARLNFS DTYINPDNLS ANDTLKGGKE ADTFSFQLLI NASKEIVEKH TNDQGVTNWG MNGVAGENDN YHDHWVEGIG HDKIIDFNAK EGDSIKVFGH TVQKRLLEND EQSKIAVIGL YSDQGNDGQR GGGAHDLDVL GKIEVHYKGK FDFDSDVTVE NEDYGAYGLD SNAMIEEYDI PNKLDEANTS DFVGTNGDDE IEGNLGKNKI QGLQGDDHLI GQAGKDKLEG SEGDDHLIGD YIPTEADYDA EGVLDLPEDV VYDDKLFGDT ESDTLADQYG DDKLTGGDGN DRLISISDSS IPRENANIPA GVDDGDDLKK LDFSNELINP YNTASKDILT GGSGADTFEW NLLINASKDI VEKHTDDDEI INWGMNGVAG ENDNYHDHWV DGIGKDLIMD FSGQGGEDDK IIVRGHTVKV KLLSETSDKA ELGIYSDQGN DGERGNGAHD FDVLGKIVVK HDGNFNFDND VQVVGVDYGA YGNGAELYQV FGLE
|
| |