Gene Tery_4011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4011 
Symbol 
ID4244578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6202952 
End bp6204331 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content41% 
IMG OID638108923 
Productzeta-carotene desaturase / three-step phytoene desaturase 
Protein accessionYP_723504 
Protein GI113477443 
COG category[S] Function unknown 
COG ID[COG3349] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02731] phytoene desaturase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAGTGG CAATCGCGGG AGCAGGTCTA GCAGGTCTTT CCTGTGCTAA ATATTTGACA 
GATCTGGGCC ATACCCCAAT AGTCCTTGAA AGAAGGGACG TTCTTGGAGG TAAAGTAGCA
GCCTGGAAGG ACGAGGAGGG AGACTGGTAC GAAACTGGTC TACATATATT CTTTGGAGCT
TATCCAAATA TGTTACAGCT ATTTAAAGAG TTGGATATAC AGGATAGGTT GCAATGGAAA
GACCACACAA TGATTTTTAA TCAGCCAGAT CAACCAGGAA CATATTCTAG GTTTGATTTT
CCCAATATTC CAGCACCAGT AAATGGCATA GCGGCAATTC TGGGCAATAA TGATATGCTA
ACCTGGTTAG AAAAAATTAA GTTTGGCATA GGTTTAATTC CAGCTATGCT TCAAGGACAA
AAATATGTAG AAAAAATGGA CAAATACAGT TTTTCAGAAT GGCTGAAAAA ACAAAATGTC
CCTCCAAAAG TGGAGAAAGA AGTCTTCATC GCCATGTCAA AAGCATTGAA TTTTATCGGG
CCAGAAGAAA TATCCTCAAC TGTTATTCTC ACGGCTCTAA ACCGTTTCCT CCAAGAGAAA
AATGGTTCTA AAATGGCTTT TTTAGATGGT TCACCCACGG AGCGATTATG TCAACCTCTG
GTAGAATATA TTACTAAGGG AGGTGGGGAA GTATATCTAA ACTCACCTAT AAAAGAGTTT
TTGCTGAACG ACGATGGCAC AGTTAGTGGA TTCCTCATTA GAGGATTAGA AGCAGCAGAA
GATAGAGTTA TAAGTGTTGA TGCTTATGTC TCAGCAATGC CCGTGGATCC CTTAAAGGTA
ATGTTACCAT TACCCTGGCA ACAGATGGAA TATTTCCAAA AGCTCAAGGG TTTAGAAGGG
GTACCAGTAA TTAATCTGCA TTTGTGGTTT GATCGCAAGC TCACAGATAT AGATCATTTA
CTATTTTCAC GCTCACCCCT ACTCAGTGTT TATGCAGACA TGAGTAATAC CTGTCGCGAA
TATGCTAATC CCAATTGTTC CATGTTGGAG CTAGTATTAG CACCAGCTAA AGATTGGATT
AGTAAATCTG AGCAAGAAAT AGTCGCAGCA ACAATGGCAG AGTTAGAAAA ATTATTTCCG
GCTCACTTTA CAGGAGAAGA TCCAGCTAAG TTGTTGAAAT ATCACGTCGT CAAAACACCC
CGCTCTGTAT ATAAAGCTAC CCCAGGTCGT CAAGACTGTC GCCCTTCTCA AGTAACTCCA
ATTGCCAATT TCTTCCTTAC AGGAGATTAT ACAATGCAAC GCTATCTAGC TAGCATGGAA
GGAGCCGTAC TTTCTGGCAA GCTGACAGCG CAGGCGATCG CAAAAGCAAA GCTACCCTAA
 
Protein sequence
MLVAIAGAGL AGLSCAKYLT DLGHTPIVLE RRDVLGGKVA AWKDEEGDWY ETGLHIFFGA 
YPNMLQLFKE LDIQDRLQWK DHTMIFNQPD QPGTYSRFDF PNIPAPVNGI AAILGNNDML
TWLEKIKFGI GLIPAMLQGQ KYVEKMDKYS FSEWLKKQNV PPKVEKEVFI AMSKALNFIG
PEEISSTVIL TALNRFLQEK NGSKMAFLDG SPTERLCQPL VEYITKGGGE VYLNSPIKEF
LLNDDGTVSG FLIRGLEAAE DRVISVDAYV SAMPVDPLKV MLPLPWQQME YFQKLKGLEG
VPVINLHLWF DRKLTDIDHL LFSRSPLLSV YADMSNTCRE YANPNCSMLE LVLAPAKDWI
SKSEQEIVAA TMAELEKLFP AHFTGEDPAK LLKYHVVKTP RSVYKATPGR QDCRPSQVTP
IANFFLTGDY TMQRYLASME GAVLSGKLTA QAIAKAKLP