Gene Tery_4358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4358 
Symbol 
ID4246011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6718583 
End bp6719845 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content34% 
IMG OID638109246 
Productcysteine desulfurase 
Protein accessionYP_723823 
Protein GI113477762 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.124519 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.628114 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATTA TTCAAGAAAG AACTTTGGCA GAAAAATTAC GTGCAGATTT TCCAATTTTA 
AATCAGGAAA TAAATGGTGA ACCACTTATT TATTTAGATA ATGCTGCTAC TTCCCAAAAA
CCATTAGCAG TTATCAACGC TTGGCAAGAA TATTACCTCA AATATAATTC TAATGTGCAT
CGGGGTATTC ATACTTTAAG CTCGAAGGCA ACAGATGCTT ATGAAGGGGC AAGAGATAAA
GTGGCTGCTT TGATTAATGC GGCATCTCGG AATGAAATTA TTTATACTCG AAATGCTAGC
GAAGCAGTTA ATTTGGTTGC TTATTCTTGG GGTTTAAATA ATCTTAAATC AGGAGATGAA
ATAATTGTCT CTGTGATGGA ACATCATAGT AATTTTGTTC CCTGGCAAAT GGTTGCTCAA
AAAACTGGAT CAGTTTTAAA ATTTGTTGAG TTGAATGAAA CTGAAGAACT TAATTTAGAA
CAATATAAAG CTCTAATTTC AGACAAAACA AAGTTAGTTG CATTAGCTCA TGTTTCTAAT
GTTTTAGGTT GTATTAATCC AATTCAAGAA ATTTGTTCAA TTGCTCATAA AAATGGAGCT
AAAGTATTAA TAGATGCTTG CCAAAGTGTA CCTCATTGTG TGGTAGATGT GCAGTCAATA
GATTGTGATT GGTTAGTAGC TTCCGGCCAT AAAATGTGCG CTCCTACTGG TATTGGTTTT
TTGTATGGTA AGTTGGAATT ACTAAAAGAA ATGCCACCAT TTTTAGGAGG GGGTGAAATG
ATTTCTGAGG TGTTTCTTGA TCATTATACT TATGCAGAAT TACCTCATAA ATTTGAAGCA
GGAACCCCAG CGATAGGAGA GGCGATCGCT CTTGGTGCAG CAGTAGATTA TCTCACAAAT
ATAGGTATGG AAAAAATTCA TAATTATGAA GTAGAATTAA CTACCTATTT ATTTAATAAA
TTACGTCAAA TTCCTCAAAT TACTATTTAC GGACCTCAAC CAAATACCTA TGGAGAAGGT
AGAGGGACAT TAGTATCTTT TACAGTAGAA AATATTCATC CTAACGATTT ATCAACAATG
TTAGATGAAG CAGGGATAGC AATTCGTTCT GGTCATCATT GTGCTCAACC TTTGCATCAA
TATTTAAAGG TTTCATCCAC AGCAAGAGCA AGTTTATCTT TTTATAATAC TCGTGATGAT
ATTGATATTT TTGTTGATGC TTTGAAAGAT ACAATCAATT TTTTTGCTGA TATTATGGGT
TGA
 
Protein sequence
MTIIQERTLA EKLRADFPIL NQEINGEPLI YLDNAATSQK PLAVINAWQE YYLKYNSNVH 
RGIHTLSSKA TDAYEGARDK VAALINAASR NEIIYTRNAS EAVNLVAYSW GLNNLKSGDE
IIVSVMEHHS NFVPWQMVAQ KTGSVLKFVE LNETEELNLE QYKALISDKT KLVALAHVSN
VLGCINPIQE ICSIAHKNGA KVLIDACQSV PHCVVDVQSI DCDWLVASGH KMCAPTGIGF
LYGKLELLKE MPPFLGGGEM ISEVFLDHYT YAELPHKFEA GTPAIGEAIA LGAAVDYLTN
IGMEKIHNYE VELTTYLFNK LRQIPQITIY GPQPNTYGEG RGTLVSFTVE NIHPNDLSTM
LDEAGIAIRS GHHCAQPLHQ YLKVSSTARA SLSFYNTRDD IDIFVDALKD TINFFADIMG