Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3845 |
Symbol | |
ID | 4242296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 5940562 |
End bp | 5942922 |
Gene Length | 2361 bp |
Protein Length | 786 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 638108777 |
Product | hypothetical protein |
Protein accession | YP_723360 |
Protein GI | 113477299 |
COG category | [R] General function prediction only |
COG ID | [COG3211] Predicted phosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0999872 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0977424 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAGGC TAACTCGCAG AAAACTGTTA ATGTTCTTCG GATGTAGCGC AGCTGCTACA GCACTATCAC CAAAAATTGA AAATTTTTTG GGTAGTAACT CTGAAGTGGC TCTTGCTCAA ACTCAAGGTT TGAGTTTCAC ACCCCTGAAA CTGGCGCATC CTTTAGAAGC TTATGAAAAA CATTCTAGCT TTGTACCTTT AGGAACTGGT GGAGAAGGAG CAACTTTAGG AGCAGGTGTA GATGTAGCAC TACAATCATA TCAGTACTTT GATGATGTAA TAGTACCTCC CGAATATGAA AGGTATGTAA TTGTTAGTTG GGGCGATCGC GTATTCCCTG ACTCTGAAGA GTACTTTGGT TATAATGCTG ACTATGTAAG TTTTATTCCT GTTAATGGTA ATCCTGATGA TGGTTACTTA TGGACTAACC ATGAGTATGT CTCATATCCA ATGTCACCAT TATTAGCACG TAGTGATGAC TTAGAAGGTT TCCCCACAAC AGACAAACTT GTACTAGGTT TAGATTTATC TCAGTCTAGT ATCTCCACCT TAGGTGAATT TGGTTATAAC CAAGGTGGTT CTATTGTCAG AATTAAAAAA GGTAGTAACG GTCAATATGC TACTGTAGCA GACAGTGCTA ATCGTCGAAT ACACCTCTTA TCTGGACTAG GAATTAACTC CGAACGTTCT GATAACTATC AAAGAGTTAC ATCTTGGGGT ACAGCTAGCT ATCAGACTGG AGACAAAAAC TTCTTAATTG GTACTGGACC TGCTGCTGTT GAAGTATTCC CTCTAAGTTC TGATGGACTG GGTAATAAAA TCATTGGTAC AGCATTCAAC TGTTCTGGTG GTACAACTCC CTGGGGAACA GTTCTAACTG CAGAAGAGAA CTTCCAAGGT AGCGTTACCG AAGCTGTATC ACCTAATGGT ACTCAAACTG GTTATAAGGA AGAGGGTATA GGTTTTACTT TTGGTTTAGT TGGTGAAAAG TATGGCTGGA TGGTAGAAGT TTCTCCAGCA GACCCAAGTT TCCAAAATAA GAAACATACA GCTTTAGGTC GTTTTCGCCA TGAAAATATT GCGTTCCGAG TAGAAGCAGG TAAACCGTTA GCAGCTTATA TGGGAGATGA CCGTCGTGGT GGTCACACAT GGAAGTTTGT GAGTGATGGT ATTGTTTCTA ATCCTACCGA TCCAAGTAAC AGCAGATTAT TTAATAGCGG AACTCTCTAT GCTGCACGCT TAAATCCTGA TGGTTCTGGT CAATGGATTC CATTAATTCC TGCCACACGT ACTAACCCTC TATCACCAAG AGAACTTGCT GAAGCTGAAT TAAATGTTTT TGGTAAAGCT CAAAGAGATG GTCGCATCCG CTTACCTCAA CGTCTTGGTA TTGCTGGAGG AGAAGAAAAT GGTGGGTATT TCATTGTAGA CTTAACAAAT GAAAGTGCAT TATCTGATTA TCAAGGCAAA ACTCTAGCAG ATTTCTACGA CAGCCAAGGT GCGATTTTAG TTGATGCTTT CTTAGCTGCT AACTTAGTAG GTGCCACTCC TACTGCTCGT CCTGAAGATT TAGAAGTTCA CCCTGGTGAT GGTAGTGTAT TTATTGCTTA TACCGATAAT GGACCTGGTG GAGATGGATA TCCAGATTCC AGAGTCTTTG TTGTGAGTAA ATACTCTGCA GATGTTAATG CTGCTCAACC TTTTGGTGGT ATCTACCGAA TCATTGAAAC AAACAGTGAT GTTACCAGTA CCACTTTTAC CTGGTCAGCG TTTGAGCAGA GTGGAGAAAA TGGTGCTGTT AATGGTCCAG GTTTTGCCAA TGTAGACAAC CTAGAAATTG ATACTTTGGG TAATATTTGG GGCGTAACAG ATATGTCTAC TAGTAGTCAT AATGGTTTCA ATACTGGCGC TGCTGGAGAG ATAAAAGAGA TTGACCACAC TCAAACAGGA AGTGTTGGTA ACTTGAGAGG AACATTTGGT AACAACTGGT TATTCTATAT TCCTGTCGTC GGAGAAAACG CTGGAATGGT TATACCTTTT GCTTATGGTC CCCCTCGTTG TGAAATCACT GGACCATACT TTATCAAAAA TCGTAGTGGT GTAAATGAAA CTCTGCTATT AGCTGTACAA CACCCTGGTG AAAGTTGTCC TATTGGAGAT GAAGTTAAAC TAGGTCGTAA TATCGAGATG TTAAACTTAG ATGGTAGCCT TTTTACTCAG CAACGAAGCG TACCTCGTGG AAGTAATTGG CCTAGTAACA CAGGGTATGT AGGTAATCCT GGAGGCTTTT TTAATGGTTT ACTGCCACCA AGACCTTCTG TCATTGGTGT TACTCGTAGA GACGGTGGTA AGTTTGTTTA A
|
Protein sequence | MSRLTRRKLL MFFGCSAAAT ALSPKIENFL GSNSEVALAQ TQGLSFTPLK LAHPLEAYEK HSSFVPLGTG GEGATLGAGV DVALQSYQYF DDVIVPPEYE RYVIVSWGDR VFPDSEEYFG YNADYVSFIP VNGNPDDGYL WTNHEYVSYP MSPLLARSDD LEGFPTTDKL VLGLDLSQSS ISTLGEFGYN QGGSIVRIKK GSNGQYATVA DSANRRIHLL SGLGINSERS DNYQRVTSWG TASYQTGDKN FLIGTGPAAV EVFPLSSDGL GNKIIGTAFN CSGGTTPWGT VLTAEENFQG SVTEAVSPNG TQTGYKEEGI GFTFGLVGEK YGWMVEVSPA DPSFQNKKHT ALGRFRHENI AFRVEAGKPL AAYMGDDRRG GHTWKFVSDG IVSNPTDPSN SRLFNSGTLY AARLNPDGSG QWIPLIPATR TNPLSPRELA EAELNVFGKA QRDGRIRLPQ RLGIAGGEEN GGYFIVDLTN ESALSDYQGK TLADFYDSQG AILVDAFLAA NLVGATPTAR PEDLEVHPGD GSVFIAYTDN GPGGDGYPDS RVFVVSKYSA DVNAAQPFGG IYRIIETNSD VTSTTFTWSA FEQSGENGAV NGPGFANVDN LEIDTLGNIW GVTDMSTSSH NGFNTGAAGE IKEIDHTQTG SVGNLRGTFG NNWLFYIPVV GENAGMVIPF AYGPPRCEIT GPYFIKNRSG VNETLLLAVQ HPGESCPIGD EVKLGRNIEM LNLDGSLFTQ QRSVPRGSNW PSNTGYVGNP GGFFNGLLPP RPSVIGVTRR DGGKFV
|
| |