Gene Tery_4785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4785 
Symbol 
ID4246439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7352883 
End bp7355186 
Gene Length2304 bp 
Protein Length767 aa 
Translation table11 
GC content42% 
IMG OID638109633 
Productselenophosphate synthase 
Protein accessionYP_724209 
Protein GI113478148 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0709] Selenophosphate synthase
[COG1252] NADH dehydrogenase, FAD-containing subunit 
TIGRFAM ID[TIGR00476] selenium donor protein
[TIGR03169] pyridine nucleotide-disulfide oxidoreductase family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.211617 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTCC CTATACCACC AAAAACAGAT TTAGTCCTAA TAGGAGGCGG TCATAGCCAT 
GCGATCGCTC TCCGAAAATT TGCCATGAAC CCTATACCAG GGGTTAGACT AACCCTAATC
ACAGATGTAT ATCACACTCC TTACTCTGGA ATGTTACCAG GATATGTGGC AGGTTTATAC
AACTTCGATC AATGTCACAT AGATCTACTC CCCCTCGCCA AATTTGCTGG GGCTAGAATA
TTTTTAAGCC ACGCTATAGG TCTAGACCTG GAGAAAAATC AGATACTTTG CACTAGTCGT
CCCCCTGTAA ATTTCGATTT ACTTTCTATT GATATTGGTA GCACTCCCGC TAGTTTAACT
GTACCGGGGG CAATAGAATA TGCTATTGCG GTTAAACCAA TATCAAAATT TCTCACCTAT
TGGGATAAAA TAACTACAAT GATTTCCAAA TCTCCAACAC AGAAAACGCG GATCGCAATA
GTGGGTGGTG GTGCTGGGGG GGTAGAGTTA GCTTTCGCGG TTCAAAGTCA TCTCCATCAG
ATTTATCGCA ATGCTAAACA ACCTACTGAT AATTTGGAAT TGCATCTATT TCATAGAGGG
AAGAGATTAT TACCACAGCG TCATCATTGG GTAGGAAAGG AAGTTGAGAA AATTCTTAAA
AGTCGCGGTG TGCTATTGCA TTTAAAGGAA AGTGTTGAGG AGGTTAGGGG TGATGACCAG
GAGCTTCCCA CGAATGTAGA GGGAAATAAA ACATTCCATT CTTTATACTC TACTCTCAAA
ATGATTTGTT GTGCTTCAGG GTTGGAAATA GAGTGTGATA TTCTGTTTTG GGTGACTCAA
GCATCGGCCT CTCCTTGGTT ACAAAAAGCA GGTTTGGCAA CAGATACCAG AGGTTTTGTT
TTGGTTAATG AGAAATTACA GTCAATTTCT CATCCTCAAG TCTTTGCGGC CGGAGATGTG
GCAACAATAA TCAATCATTC TCGCCCCAAA GCTGGAGTGT TTGCGGTGCG ACAGGGACAA
CCTTTGTTTG AAAATCTACA ACGGGCTCTT AAGAAAAAAT TGCTGAAGTC ATTTATACCT
CAGAAAAAGT TCTTGATTTT AATTGGTACT GGTGATGAAC GGGCGATCGC TTCCCGTGGC
AGAATAGGTT TTGGTCCTCA TAAGTTACTT TGGCGTTGGA AAAACCGCAT TGACCGTAAA
TTTATGGCTC AATTCTCTAA TCTAAAAATG GAAGACAGAA CGCAGAAGGC AGAAGACAGA
AAACAAGATT CCCAAATGTA CTGTGCTGGT TGTGGAGCAA AAGTTACTAG TAGGGTTTTG
GAAAATGTGC TGCATAAAAT TCCACAGGAA ATCAGGAGGG ATGATATTTT GATTGGTATG
GATACACCTG ATGATGCTGC AGTGATTAGG GTACCAGCGG ATAGGGTAAT GGTACAGACG
ATCGATTATT TTCGAGGGTT GCTTGATGAC CCCTATTTGT TGGGAAAAAT TACGGCTAAT
CATTGTTTGA GTGATTTATT TGCTATGGGG GCAATGGCTC AGAGTGGGTT AGCGATCGCT
ACTATTCCCT ACGCTGCCCC AAGTAAACAG GAGGATACTT TATATCAATT GTTATTGGGA
GCAACGGAAG TTTTGAATCA ATCTGGTGCT GTTCTGATAG GTGGTCATAC AACAGAGGGA
GAAGAGCTGG CTTTTGGTCT CACCTGTAAT GGTTTAGTGT CACAAAAAAA ACTATTGTAC
AAGGGAGGGA TGCAACCGGG GGATATGCTG ATATTAACGA AAGCTTTGGG AACAGGGACG
TTATTTGCTG CAGATATGCG TCTGAAGGCA AGGGGGCGTT GGATTGAAAG TGCTATTAAG
TCAATGTTGG GTTCTAATCA GAATGCTGCA CAGTTATTAT TAGAGTATGG TGTTACTGCT
TGTACGGATG TGACTGGATT TGGATTAGTC GGACATTTGT TGGAAATGTT AAAAGGGCAA
AGAGTGGGCG TGGAATTGGA TATGGAAGCT ATTCCTGTGT TGCCAGGTGT AGCTGAAACT
TTGGAACAAG GTATTTTTAG TTCTCTTTAT CCGGGAAATT TGCAAATGTC TGCATCAATT
CAAAATCGTG AACAAGCGAG TATGTATCCT CTTTATCCTC TGTTGTTTGA CCCGCAAACT
TCTGGTGGGT TGCTTGCTAC AGTGCCAGCT AATTTAGCGA GTGCTTGTTT GAATGCGCTA
CAACAGGAGT ATTTTGAAAC TAGAATTATT GGGCGAGTAT TATTGTTAGA TACAGGGATG
CTGCCCATTA GAATTCATTT TTGA
 
Protein sequence
MKFPIPPKTD LVLIGGGHSH AIALRKFAMN PIPGVRLTLI TDVYHTPYSG MLPGYVAGLY 
NFDQCHIDLL PLAKFAGARI FLSHAIGLDL EKNQILCTSR PPVNFDLLSI DIGSTPASLT
VPGAIEYAIA VKPISKFLTY WDKITTMISK SPTQKTRIAI VGGGAGGVEL AFAVQSHLHQ
IYRNAKQPTD NLELHLFHRG KRLLPQRHHW VGKEVEKILK SRGVLLHLKE SVEEVRGDDQ
ELPTNVEGNK TFHSLYSTLK MICCASGLEI ECDILFWVTQ ASASPWLQKA GLATDTRGFV
LVNEKLQSIS HPQVFAAGDV ATIINHSRPK AGVFAVRQGQ PLFENLQRAL KKKLLKSFIP
QKKFLILIGT GDERAIASRG RIGFGPHKLL WRWKNRIDRK FMAQFSNLKM EDRTQKAEDR
KQDSQMYCAG CGAKVTSRVL ENVLHKIPQE IRRDDILIGM DTPDDAAVIR VPADRVMVQT
IDYFRGLLDD PYLLGKITAN HCLSDLFAMG AMAQSGLAIA TIPYAAPSKQ EDTLYQLLLG
ATEVLNQSGA VLIGGHTTEG EELAFGLTCN GLVSQKKLLY KGGMQPGDML ILTKALGTGT
LFAADMRLKA RGRWIESAIK SMLGSNQNAA QLLLEYGVTA CTDVTGFGLV GHLLEMLKGQ
RVGVELDMEA IPVLPGVAET LEQGIFSSLY PGNLQMSASI QNREQASMYP LYPLLFDPQT
SGGLLATVPA NLASACLNAL QQEYFETRII GRVLLLDTGM LPIRIHF