Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0447 |
Symbol | |
ID | 4242225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 705273 |
End bp | 706340 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 638105765 |
Product | hypothetical protein |
Protein accession | YP_720379 |
Protein GI | 113474318 |
COG category | [R] General function prediction only |
COG ID | [COG1611] Predicted Rossmann fold nucleotide-binding protein |
TIGRFAM ID | [TIGR00725] conserved hypothetical protein, DprA/Smf-related, family 1 [TIGR00730] conserved hypothetical protein, DprA/Smf-related, family 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.101585 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTCAA ATCTTGACAA ATCTACAATT GAGTTCATTG AGCAAGAAGT AAGCAAATTA CTCAAAAATT TGCCCAACTT GAAACAGCAA AAATGGATAA GGCGATCGCT TTCCACTATA GTTAATATCG CAGGAGAAGA ACTTGAGTAC CTGGACTGGA AGATATTAGC GGCCTCCTTA CAAGACATGG AAAATGGATT TAAAACATTT TATCCCTATC GCCATATACC TAAAATTACT ATATTTGGTT CTGCACGAGT ATCCTCAGAT ACCCCAGAGT ATAAAATGGC AGTAGAATTT GCTCATCATA TGACTAAACA AGGTTTTATG GTAATGACCG GTGCAGGTCC TGGGATAATG CAAGCTGGTA ATGAGGGAGC AGGAAGCAAC AAATCTTTCG GTCTCAATAT TCAGTTACCT TTTGAGCAAG AGTCTAATCC ATTTATAAGA GGTAACAATA AATTAATTAA CTTTAAATAC TTTTTTACTC GTAAATTATT TTTCCTGAAA GAAACTGATG CTATAGCTTT ATTTCCAGGT GGTTTTGGTA CCCTAGATGA GGCATTTGAA ACTTTAACAC TAGTACAAAC AGGTAAATTT GGCCCCGCTC CTGTAATATT AATAGATTAT CCTGGTGGAA ACTATTGGTA TGACTGGAAT GATTTTATAA ATAAACAATT ACTCCTACGA GGTTTGATAA GTCCAAATGA TTATAATTTT TACACTATTA CAGATAACTT AGAATCAGCT TATGAAGCGA TCGCCAATTT TTATAGAGTT TATCATTCTA GTCGTTATGT AGGAGAAAAG TTTGTAATTC GTCTTAAATC AGAAATTTCA GACAAAGATG TAGACTTTTT GAATCAAGAA TTTAGGGATA TTTTAACTGT AGGAAATATA GAGAAAAGTC AAGTATTACC AGAAGAGTCC GCAGATAAAA CCTTTCACTT ACCACGTTTA GTTATGCATT TTAACCACAA GGACATAGGC AGACTATATC AAATGATTGA TCAGATAAAT AAAGTGGGAG GCAATATACA AAAATTCAGT CATTTAGGAA GTAAGTAA
|
Protein sequence | MASNLDKSTI EFIEQEVSKL LKNLPNLKQQ KWIRRSLSTI VNIAGEELEY LDWKILAASL QDMENGFKTF YPYRHIPKIT IFGSARVSSD TPEYKMAVEF AHHMTKQGFM VMTGAGPGIM QAGNEGAGSN KSFGLNIQLP FEQESNPFIR GNNKLINFKY FFTRKLFFLK ETDAIALFPG GFGTLDEAFE TLTLVQTGKF GPAPVILIDY PGGNYWYDWN DFINKQLLLR GLISPNDYNF YTITDNLESA YEAIANFYRV YHSSRYVGEK FVIRLKSEIS DKDVDFLNQE FRDILTVGNI EKSQVLPEES ADKTFHLPRL VMHFNHKDIG RLYQMIDQIN KVGGNIQKFS HLGSK
|
| |