Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2766 |
Symbol | |
ID | 4244799 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 4287744 |
End bp | 4288979 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 638107825 |
Product | hypothetical protein |
Protein accession | YP_722422 |
Protein GI | 113476361 |
COG category | [S] Function unknown |
COG ID | [COG3825] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.114767 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGTCG TTGAAGAAGT ACCTTTGTTG AAGGTCTTAC TGTCTCTTTT TTACAGTTTG CGCCAATATG GCTTGCCTTT GGGAGTTGAA GACTATATGT TAGTGCTGAG GGCATTGCAA GGTGGGTTTG GTATAGGCGA TCGCGACTCT TTAGAACGAC TATGTTGTAC TTTGTGGACA AAATCTGAAC AGGAGGCCCG CCTGTTACAT CAACTTCTAG GTCGGGCAAT AACCAATGCT CCTTCTTCTG CTGAATTACC ACAACCTGTC GAAGATACAC CGAATCCCTC CCCAACTAGC GCTTCAACTA GACCTGTATC AAAGGAGTCT GAGGAAGTTA TGGACTCTTC GACTTTACCA TCTACAGCGA CGCCTGTTCG CAACATATCT GAGCAACTTG AGGAAAAAGC TCCATTAAAA GAGCCAGAAA TACCTGTGTC AAAACCGAGT CCATTGACTG ATATTCCCCT AGAAATAGAT GAACCAGAAC TGGTCATTCA AGCTATCCGA CATTATAATA GGTCTAATGA AATGATTTCT GAATACCAAG ATCTAGCTGC TCAATACCTG CCAGTGACCC CTCGGCAGAT TAAGCAGAGT TGGCGTTTTT TAAGTCGTTC AGTCCCCCAA GGTATATCGG ACAAGTTGAA TGTACCAGCT ACGGTAGCCA AGATTTGTCA GCAATGCATC TTAATTGAAG CGGTGCTGAT GCCAAATTAT GTAAATCGAG TTAAACTTGT GTTACTGGTT GATCAAGGTG GCTCAATGAT TCCTTTTCAT CATTTATCCC GTCAATTAAT AGATAAAGCT CGACGGGGTG GAAATATTGA GCAGGCGAGT GTTTATTATT TTTATAATTA TCCTGAAATA TATTTTTATA GTGACCCAAC TCGCCTCAAA GCTCAACTAA TTACAGATAT TTTAGGAGCT ATCGATGAAA GAGCAGGAGT ACTTATGGTC AGTGATGCTG GAGCTGCCAG GAGTAATTAT AACCCAGAGC GAATTGAGTG CACCCAGAGG TTTATTGAAC AACTTCGGCA GTCAGTCCGT TATTATGCTT GGCTAAATCC TATGCCTAAT GATAGTTGGC AAGGCACGAC TGCTGGGGAA ATTGCTCGGT TTGTGCCAAT GTTTGAGATG AGTCCTCAAG GATTTAATGC TGCTATCAAT GCTTTGCGTG GTCGCTATGT GTATGGGAAA GATTTTTATG AGTTGAGCCG GCAAAAATCA TTATGA
|
Protein sequence | MNVVEEVPLL KVLLSLFYSL RQYGLPLGVE DYMLVLRALQ GGFGIGDRDS LERLCCTLWT KSEQEARLLH QLLGRAITNA PSSAELPQPV EDTPNPSPTS ASTRPVSKES EEVMDSSTLP STATPVRNIS EQLEEKAPLK EPEIPVSKPS PLTDIPLEID EPELVIQAIR HYNRSNEMIS EYQDLAAQYL PVTPRQIKQS WRFLSRSVPQ GISDKLNVPA TVAKICQQCI LIEAVLMPNY VNRVKLVLLV DQGGSMIPFH HLSRQLIDKA RRGGNIEQAS VYYFYNYPEI YFYSDPTRLK AQLITDILGA IDERAGVLMV SDAGAARSNY NPERIECTQR FIEQLRQSVR YYAWLNPMPN DSWQGTTAGE IARFVPMFEM SPQGFNAAIN ALRGRYVYGK DFYELSRQKS L
|
| |