Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3964 |
Symbol | |
ID | 4244047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 6127659 |
End bp | 6130481 |
Gene Length | 2823 bp |
Protein Length | 940 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 638108880 |
Product | hypothetical protein |
Protein accession | YP_723462 |
Protein GI | 113477401 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.820547 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTAA ACACAAAAAG TATAGTAATA ATAGATGCTA GTGTAGAAAA CTACCAGCAA CTATTAAAGG GAGTCGTTAC GGGAGTTAAA CCTTTTCTCC TCGGCGGCGA CACTGACGGT ATCCAACAGA TAGGGGATAT CCTCCAAAAA AATCCAGAAA CGGATACCCT TCATATTATC TCCCACGGTT CTCCTGGTTG TCTGTATCTG GGAAATAGCC AATTGAGTTT GGATACTCTC AAGGGCTATG AGTCCCAACT GCAACAATGG CAACTAGACA ACCTTCTGCT CTATGGTTGT AACGTCGCTG CCGGGGATGG GGGGGAAGAG TTTATTGATA AGTTGCATCG GTTGACGGGG GCTGAGATAG CGGCTTCTAA GTCTTTGACT GGGGCGGCAG TTAAAGGGGG GAACTGGGAG TTGGAGGTGA GGACGGGTAA AAGCAAGCTC TCTCTAGCAT TGCAAGTAGA AACGATGGCC AGCTACTCCG ATACCCTCAA CCTGCAATTT GAATGGGCTA AACAGATCGG CGCTAGCAGC TCTGGCGACG TTAGTAGCAT AACCACAGAC AGCAACGGTA ATGTCTTGGT GGGGGGTCTT TTTCAGGGCA ACATTGACAT CGACGGCGAT GGGAACAATG ATTTGACCTC TAATAACGAC TGGGATATTT ATGCAGCCAA GCTCGACAGC AATGGCAATT TGGTCTGGGC TAAACAGATC GGCGGTAGCA TTGATGACTA TGTTAATAGC ATAACCACAG ACAGCAGCGG CAATGTCTTG GTGGGGGGCA GTTTTCGGAG CAACATTGAT ATCGACGGCG ATGGGAACAA TGATTTTACC TCTAACGGCT TCGGGGATGG TGGGGATGGT TATGTAGCCA AGTTCGACAG CAATGGCAAT TTGGTCTGGG CTAAACAGAT CGGCGGTAGC TATTGGGACA ATGCTAATAG CATAGCCACA GACAGCAGCG GCAATGTCTT GGTGGGGGGT TCTTTTGAGA GCTACATTGA CATCGACGGC GATGGGAGCA TTGATTTGAT CCCTGATGGC TTCGGGGATG GTTATGTAGC CAAGTTCGAC AGCAATGGCA ATTTGGTCTG GGCTAAACAG ATCGGCGGTA GCAATTGGGA CTCTCCTTAT AGCATAACCA CAGACAGCAG TGGCAATGTT TATAGCATAA CCACAGACAG CAGTGGCAAT GTCTTGGTGG GGGGTTCTTT TCGGAGCAAC ATTGACATCG ACGGCGATTG GAACAATGAT TTGACCTCTA ATGGCGACCT GGATGGTTAT GTAGCCAAGT TCGACAGCAA TGGCAATTTG GTCTGGGCTA AACAGCTCGG CGGTAGCAAT TGGGACAATG TTAATAGCAT AACCACAGAC AGCAGCGGCA ATGTCTTGGT GGGGGGTTAT TTTGATGGCA ACATTGACAT CGACGACGAT GGGAACAATG ATTTTACCTC TAATGGATTC ACGGATGGTT ATGTAGCCAA GTTCGACAGC AATGGCAATT TGGTCTGGGC TAAACAGATC GGCGGTAGCA GTGATGACTA TGCTAATAGC ATAGCCACAG ACAGCAGTGG CAATGTCTTC GTGGGGGGTA TTTTTTCCGC CAACATTGAC ATCGACGGCG ATAGAAACAA TGATTTGACC TCTAATGGAT TCACGGATGG TTATGTAGCC AAGTTCGACA GCAATGGCAA TTTGGTCTGG GCTAAACAGA TCGGCGGTAG CAGTTTGGAC TATGCTAATA GCATAACCAC AGACAGCAGC GACAATGTCT TCGTGGGGGG TTCTTTTTAT GGCAACATTG ACATCGACGG CGATGGGAAC AATGATTTTA CCTCTAATGG CTTCGGGGAT GGTTTTGTCA TAAAATTATC GGAACAAACT AGCTCCCCCC AAACTAGCCC CCCTCCCACC GACACTGAAC CAACCCGGTT TGACTTCAAT GCCGATGGAG TCGCAGACAT TTTCTGGCGT CACCCAAATG GAGCTAACAG AATTTGGTTG ATGAACGATG AGGGCACACG GGATAGTACA GTTGACCCCG GAAAGTTTGG CAAAGCTTGG GATGTCGCTG GAGTCGCAGA TTTCAATACT GACGGAGTCG CAGACATTTT TTGGCGTCAC CCAAATGGAG CTAACAGAAT TTGGTTGATG AACGATGAGG GCACACGGGA TAGTACAGTT AACCCCGGAA AGTTTGGCAA GGCTTGGGAT GTCGCTGGAG TCGCAGATTT CAATACTGAC GGAGTCGCAG ACATTTTCTG GCATCACCCA AATGGAGCTA ACAGAATTTG GTTGATGAAC GATGAGGGCA CACGGGATAG TACAGTTAAC CCCGGAAAGT TTGGTAAAGC TTGGGATGTC GCTGGAGTTG CAGATTTCAA TACTGACGGA GTCGCAGACA TTTTCTGGCA TCACCCAAAT GGAGCTAACA GAATTTGGTT GATGAACGAT GAGGGCACAC GGGATAGTAC AGTTAGCCCC GGAAAGTTTG GTAAAGCTTG GGATGTAGCG GGGGTTGCAG ATTTCAATAC TGACGGAGTC GCAGACATTT TCTGGCGTCA CCCAAATGGA GCTAACAGAA TTTGGTTGAT GAACGATGAG GGCACACGGG ATAGTAGACT TAACCCCGGA AGCTTCAGGT CAGCTTGGGA TGTAGCGGGA GTCGCAGATT TCAATACTGA CGGAGTCGCA GACATTTTCT GGCATCACCC AAATGGAGCT AACAGAATTT GGTTGATGAA CGATGAGGGC ACAAAGGATA GTGGACTTAA CCCCGCAAGG TTCAGTTCAA CTTGGGATGT AGTTGGGATG TAA
|
Protein sequence | MKLNTKSIVI IDASVENYQQ LLKGVVTGVK PFLLGGDTDG IQQIGDILQK NPETDTLHII SHGSPGCLYL GNSQLSLDTL KGYESQLQQW QLDNLLLYGC NVAAGDGGEE FIDKLHRLTG AEIAASKSLT GAAVKGGNWE LEVRTGKSKL SLALQVETMA SYSDTLNLQF EWAKQIGASS SGDVSSITTD SNGNVLVGGL FQGNIDIDGD GNNDLTSNND WDIYAAKLDS NGNLVWAKQI GGSIDDYVNS ITTDSSGNVL VGGSFRSNID IDGDGNNDFT SNGFGDGGDG YVAKFDSNGN LVWAKQIGGS YWDNANSIAT DSSGNVLVGG SFESYIDIDG DGSIDLIPDG FGDGYVAKFD SNGNLVWAKQ IGGSNWDSPY SITTDSSGNV YSITTDSSGN VLVGGSFRSN IDIDGDWNND LTSNGDLDGY VAKFDSNGNL VWAKQLGGSN WDNVNSITTD SSGNVLVGGY FDGNIDIDDD GNNDFTSNGF TDGYVAKFDS NGNLVWAKQI GGSSDDYANS IATDSSGNVF VGGIFSANID IDGDRNNDLT SNGFTDGYVA KFDSNGNLVW AKQIGGSSLD YANSITTDSS DNVFVGGSFY GNIDIDGDGN NDFTSNGFGD GFVIKLSEQT SSPQTSPPPT DTEPTRFDFN ADGVADIFWR HPNGANRIWL MNDEGTRDST VDPGKFGKAW DVAGVADFNT DGVADIFWRH PNGANRIWLM NDEGTRDSTV NPGKFGKAWD VAGVADFNTD GVADIFWHHP NGANRIWLMN DEGTRDSTVN PGKFGKAWDV AGVADFNTDG VADIFWHHPN GANRIWLMND EGTRDSTVSP GKFGKAWDVA GVADFNTDGV ADIFWRHPNG ANRIWLMNDE GTRDSRLNPG SFRSAWDVAG VADFNTDGVA DIFWHHPNGA NRIWLMNDEG TKDSGLNPAR FSSTWDVVGM
|
| |