Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4358 |
Symbol | |
ID | 4246011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 6718583 |
End bp | 6719845 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 638109246 |
Product | cysteine desulfurase |
Protein accession | YP_723823 |
Protein GI | 113477762 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.124519 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.628114 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTATTA TTCAAGAAAG AACTTTGGCA GAAAAATTAC GTGCAGATTT TCCAATTTTA AATCAGGAAA TAAATGGTGA ACCACTTATT TATTTAGATA ATGCTGCTAC TTCCCAAAAA CCATTAGCAG TTATCAACGC TTGGCAAGAA TATTACCTCA AATATAATTC TAATGTGCAT CGGGGTATTC ATACTTTAAG CTCGAAGGCA ACAGATGCTT ATGAAGGGGC AAGAGATAAA GTGGCTGCTT TGATTAATGC GGCATCTCGG AATGAAATTA TTTATACTCG AAATGCTAGC GAAGCAGTTA ATTTGGTTGC TTATTCTTGG GGTTTAAATA ATCTTAAATC AGGAGATGAA ATAATTGTCT CTGTGATGGA ACATCATAGT AATTTTGTTC CCTGGCAAAT GGTTGCTCAA AAAACTGGAT CAGTTTTAAA ATTTGTTGAG TTGAATGAAA CTGAAGAACT TAATTTAGAA CAATATAAAG CTCTAATTTC AGACAAAACA AAGTTAGTTG CATTAGCTCA TGTTTCTAAT GTTTTAGGTT GTATTAATCC AATTCAAGAA ATTTGTTCAA TTGCTCATAA AAATGGAGCT AAAGTATTAA TAGATGCTTG CCAAAGTGTA CCTCATTGTG TGGTAGATGT GCAGTCAATA GATTGTGATT GGTTAGTAGC TTCCGGCCAT AAAATGTGCG CTCCTACTGG TATTGGTTTT TTGTATGGTA AGTTGGAATT ACTAAAAGAA ATGCCACCAT TTTTAGGAGG GGGTGAAATG ATTTCTGAGG TGTTTCTTGA TCATTATACT TATGCAGAAT TACCTCATAA ATTTGAAGCA GGAACCCCAG CGATAGGAGA GGCGATCGCT CTTGGTGCAG CAGTAGATTA TCTCACAAAT ATAGGTATGG AAAAAATTCA TAATTATGAA GTAGAATTAA CTACCTATTT ATTTAATAAA TTACGTCAAA TTCCTCAAAT TACTATTTAC GGACCTCAAC CAAATACCTA TGGAGAAGGT AGAGGGACAT TAGTATCTTT TACAGTAGAA AATATTCATC CTAACGATTT ATCAACAATG TTAGATGAAG CAGGGATAGC AATTCGTTCT GGTCATCATT GTGCTCAACC TTTGCATCAA TATTTAAAGG TTTCATCCAC AGCAAGAGCA AGTTTATCTT TTTATAATAC TCGTGATGAT ATTGATATTT TTGTTGATGC TTTGAAAGAT ACAATCAATT TTTTTGCTGA TATTATGGGT TGA
|
Protein sequence | MTIIQERTLA EKLRADFPIL NQEINGEPLI YLDNAATSQK PLAVINAWQE YYLKYNSNVH RGIHTLSSKA TDAYEGARDK VAALINAASR NEIIYTRNAS EAVNLVAYSW GLNNLKSGDE IIVSVMEHHS NFVPWQMVAQ KTGSVLKFVE LNETEELNLE QYKALISDKT KLVALAHVSN VLGCINPIQE ICSIAHKNGA KVLIDACQSV PHCVVDVQSI DCDWLVASGH KMCAPTGIGF LYGKLELLKE MPPFLGGGEM ISEVFLDHYT YAELPHKFEA GTPAIGEAIA LGAAVDYLTN IGMEKIHNYE VELTTYLFNK LRQIPQITIY GPQPNTYGEG RGTLVSFTVE NIHPNDLSTM LDEAGIAIRS GHHCAQPLHQ YLKVSSTARA SLSFYNTRDD IDIFVDALKD TINFFADIMG
|
| |