Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0337 |
Symbol | |
ID | 4243152 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 516434 |
End bp | 517675 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 638105669 |
Product | hypothetical protein |
Protein accession | YP_720284 |
Protein GI | 113474223 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00896127 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACGCAG AAAAGGTAGT ATCCCAAACA TTTCGTCGAG TTCGTCCACG ATATCGTCCA GTAAAAAAAC GGCCAGTTAA AGTTCGTCGG CGAGTTAAAA CCAAAAATAT ACAGTTAATA TTATTACAGT TAAAATATAA AAATATATTT ATTTTAGTTG CTTTATTAAT GACAGTGAGC TGGATTATTA CCTTACCTTT TAGAGGTCGG CCAGCTTCAG AGAAACCTTT ACCTACTCCT GCATCTTCTG TATCACCTAC CCTAATACCA CCCTATGTAC CCCCAGTTCC TGAAGAAACT CCAAAAGATT TAGGATTTGC CTATAATGTT CGTAGACAAA CATACAAACG TAATAGCCCG CAGTTACAAG AAATAGTTGA TGAATTACTC AGTATTGCTA AAGAAAAAGG GCTGCCTACA GCGCCTCTAT CTATTAGTTT AATTGATGTT AGTAACCCAG ATCTTCACAC ATATGCTGGA TATAAAAATC AAGTTTTAAG ATACCCTGCT AGTGTAGCCA AATTATTTTG GATGGCTGCA TTTTATGGAG CAGTTGAACA AAGTTTAATT GATAATGAAC CGAAGTTTTA TGAAGATTTA AGATTAATGA TGCAGAAATC TCATAATGAT TCTGCTAGTA GAATTTTAGA TGCAATTACT GATACAAAAT CAGGGGTTAA ATTGGAAGGC AAAAAGTTAA ATACTTGGTT AGAAAAAAGA AAATCAGTCA ATGAATTTTT TCAAAAAGCT GGTTACCAAG ATTTAATAGT TAGTACAAAA AACTATCCAA GATATTCTCC TAGTCAAACA GGTCCAGTAG GTCGCGATCG CCAACTACGA AAGCAAGATG GTAAGTTTCT CCGAAATTTG ATTTCAACTG ACCAGGCTAC CAGATTAATA TATGAAATAT ACACTAGGCA GGCAGTTTCA CGAAAGTATA GTACGAGAAT GGCTTATTTA TTAACGAGAG ATTTAAGACC CCAAGTATGG CAGAATGATC CCTACAATGG AGTCAAAGGT TTTCTGGGAG AGTCTCTGCC TGCTAATATT TATTTCGGTT CTAAAGTTGG TTTGACTTCT AAAGATCGTA TGGATGTCGC CTTTGTCAGA ACTTTAGATA ATCAAGCTAT TTATATTTTA GCAATTTTTG CAGAAGATGC TGCTTATTCT AATGATGAAG AAATATTTCC TAAATTGTCT CGTCATGTTT ATGATCGCAT GATGGCAATG GATAGCAAAT AA
|
Protein sequence | MHAEKVVSQT FRRVRPRYRP VKKRPVKVRR RVKTKNIQLI LLQLKYKNIF ILVALLMTVS WIITLPFRGR PASEKPLPTP ASSVSPTLIP PYVPPVPEET PKDLGFAYNV RRQTYKRNSP QLQEIVDELL SIAKEKGLPT APLSISLIDV SNPDLHTYAG YKNQVLRYPA SVAKLFWMAA FYGAVEQSLI DNEPKFYEDL RLMMQKSHND SASRILDAIT DTKSGVKLEG KKLNTWLEKR KSVNEFFQKA GYQDLIVSTK NYPRYSPSQT GPVGRDRQLR KQDGKFLRNL ISTDQATRLI YEIYTRQAVS RKYSTRMAYL LTRDLRPQVW QNDPYNGVKG FLGESLPANI YFGSKVGLTS KDRMDVAFVR TLDNQAIYIL AIFAEDAAYS NDEEIFPKLS RHVYDRMMAM DSK
|
| |