Gene Tery_3954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3954 
Symbol 
ID4244037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6113981 
End bp6115432 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content39% 
IMG OID638108872 
Productzeta-carotene desaturase 
Protein accessionYP_723454 
Protein GI113477393 
COG category[S] Function unknown 
COG ID[COG3349] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02732] carotene 7,8-desaturase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.134571 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.596487 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGTTG CGATCGCAGG TGCAGGATTA GCAGGAATGG CCACAGCAGT TGAATTGGCT 
GATGCTGGAC ATCAAGTGGA AATATTTGAA TCTCGCCCCT TTGTTGGAGG TAAAGTAGGT
AGTTGGGTAG ATAAGAATGG AAACCATGTA GAAATGGGTT TGCACGTTTT CTTTGGCTGT
TACTATCAAC TTTTTGAGTT AATGAAAAAA GTAGGGGCCT TTAAAAACCT CCGCCTCAAA
GAACATAGTC ATAATTTTAT TAATAAAGGG GGCAAAACAG GTGCTCTAGA TTTTCGCTTC
CTCACGGGAG CGCCTTTTAA TGGATTAAAA GCTTTTTTTA CCACATCCCA GCTGTCTGCG
CAAGACAAAC TACAAAATGC AATAGCCCTC GGCACCAGTC CTATAGTCAG AGGGTTGATA
GATTTTGATG GGGCAATGAA AACTATCCGA GATTTGGATA AAGTTAGTTT TGGAGAGTGG
TTCCGTCGCC AAGGTGGGAG TAATGGAAGT ATCAAAAGAA TGTGGAACCC TATTGCTTAT
GCTTTGGGTT TCATTGATGC AGATAATATT TCTGCCCGAT GTATGTTGAC AATATTCCAG
TTTTTTGCTG CTAAAACAGA AGCATCTGTA TTGCGGATGT TGAATGGTTC TCCTTATGAA
TATCTCCACA AACCTATTGT TGATTATTTG GAGGCACGGG GCACTAAAAT CTATACTAGA
AAAAAAGTTA GGCAAATACA ATTTTTAGAG AATGATGGAG AAACTCGTGT TAGTGGAATA
GTTATAGCTA ATGGGGATAC TGAAGTTACT ATTACTGCTG ATGCTTATGT TTTTGCCTGC
GATGTACCAG GTATTCAGCG TATTCTACCT GAAGCTTGGC GAAAATGGCC AGAGTTTGAC
AATATCTATA AATTAGATGC AGTACCAGTT GCTACAGTAC AGTTAAGATT TGATGGTTGG
GTAACCGAAC TACATAATAA AAACCAACGT CAACAATTAG ATCATGCTGC TGGTATTGAT
AATTTACTTT ATACTCCAGA TGCTGATTTT TCCTGTTTTG CTGATTTGGC ATTAACAAGT
CCAGAAGATT ATTATCGGCA AGGAGAAGGT TCTTTATTAC AGTTGGTATT AACACCAGGA
GATCCTTTTA TTAAAGAAAA TAATGAGGCG ATCGCTCACC ATGTTTTAAA ACAAGTCCAT
GAATTATTTC CTTCATCACG GGAGTTAAAT ATGACTTGGT ATAGTGTGGT GAAATTAGCT
CAGTCTTTAT ATCGGGAAGC TCCAGGTATG GATCCATATC GTCCGAACCA AAAAACACCA
GTGCCTAACT TTTTTCTGGC AGGTAGCTAT ACTCAACAAG ATTATATTGA TAGTATGGAG
GGGGCAACTA TTTCTGGGAA ACAAGCTGCT CAGATAATTT TGACAAATGT GGAAAAGATT
TTGGGGCAAT AG
 
Protein sequence
MRVAIAGAGL AGMATAVELA DAGHQVEIFE SRPFVGGKVG SWVDKNGNHV EMGLHVFFGC 
YYQLFELMKK VGAFKNLRLK EHSHNFINKG GKTGALDFRF LTGAPFNGLK AFFTTSQLSA
QDKLQNAIAL GTSPIVRGLI DFDGAMKTIR DLDKVSFGEW FRRQGGSNGS IKRMWNPIAY
ALGFIDADNI SARCMLTIFQ FFAAKTEASV LRMLNGSPYE YLHKPIVDYL EARGTKIYTR
KKVRQIQFLE NDGETRVSGI VIANGDTEVT ITADAYVFAC DVPGIQRILP EAWRKWPEFD
NIYKLDAVPV ATVQLRFDGW VTELHNKNQR QQLDHAAGID NLLYTPDADF SCFADLALTS
PEDYYRQGEG SLLQLVLTPG DPFIKENNEA IAHHVLKQVH ELFPSSRELN MTWYSVVKLA
QSLYREAPGM DPYRPNQKTP VPNFFLAGSY TQQDYIDSME GATISGKQAA QIILTNVEKI
LGQ