Gene Tery_3794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3794 
Symbol 
ID4243742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5830592 
End bp5831986 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content40% 
IMG OID638108729 
Productcarotenoid oxygenase 
Protein accessionYP_723313 
Protein GI113477252 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0662225 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACAA CAACAAAATT TAACCCCTAC TTAATGAGTA ATTTTGCCCC AGTCAGCAAA 
GAAATAACCA CTAACAACCT AGAAATTAAA GGTGAACTCC CTAGAGACTT ATCAGGAATG
TATCTTAGAA ATGGCCCAAA TCCTCAATTT ACCCCTTTAG GTAAATATGA TTGGATGGAT
GGCGATGGAA TGATACATGG AGTAAGAATT AGTGGTGGTA GAGCAGAATA TCGAAACCGC
TATGTAAGGA CGGGAGGATT TGAAGTAGAA AAAGACTTAG GTTATGCCGT CTGGAGCGGC
AGATTAGAAC CTCCCCAAAT GAATAATCCC TATGGCCCAT ATAAAAACGT GGCCAATAAT
GGTCTAACTT TTCACAACGG AAAACTATTA GCCCTTTGGG AAGCAGGTTT ACCTTATGAA
ATTAGAGTTC CCAGCCTAGA AACAACAGGA CCATATTTGT GTGATGGTTA CTTAGACTCT
GCTTTTACTG CTCATCCAAA AATTGACCCC ATGACCGGAG AATTAATATT TTTTGGTTAC
GCCCTTGATG TAGCTCCATA TTTAAAATAT GGTATAATCT CAGCTAGAGG CGAGCTATTA
CAGATAACAC CAATTAATAT ACCCGTTCCA AGTGGTCCCC ATGATTTTGC CATCACAAAA
AATTATACAA TTTTAATGGA CTTACCATTA CGCTTCCGTC CAGAAAGAGA AAAACAAGGT
TTACCCGCTT GGATGTTTGA AACTGGAGTC CCCAGTCGTT TTGGTATTTT ACCTCGCTGT
GGAAACAATG AAGACATTCA CTGGTTTGAA GCTCAACCTT GCTATATGAA TCATATTCTC
AATGCTTACG AATTTGAAGA TGAAATAATA TTATACGGTT GTCGCATGAG TTCAACTTAT
TGGTATCCTG GTACTGCTAG AGACCCCAAT GAGAATATTC CCCGCATGTA TGGATGGGCA
TTTAATCTCA AAACCGGTGC TGTTAGAGAA GGGATGTTAG ACAAGAGACC CTCCGAATTT
CCTTGTATTA ATCATAGGTT TGTCGGTCGG CAAATGCGCT ATGGTTATAC TTCTAGAATG
GCATCCTTGC GACCATTGTT TGATGGAATT ATTAAATATG ACTTCAGTAC TGGTAGATCC
CTATCCCATG ACTTTGGTGA AGGTCGTTAT GGGGGCGCCA CAACCTTTGC TCCGAGAATA
GGTTCTATTG ATGAAGACGA TGGTTGGTTG TTAACCTTTG TATATGATGA AAAAGAGAAA
CATTCTGAAT TATTGGTTAT AGATGCTCAG GATATTATGG GAGAGCCTGT AGCACGGGTA
ATGTTACCTC AGAGAGTACC TTATGGTTTT CATGGTACTT GGGTATCTGA AATGCAGTTG
ATAGCAAGTA ACTAA
 
Protein sequence
MTTTTKFNPY LMSNFAPVSK EITTNNLEIK GELPRDLSGM YLRNGPNPQF TPLGKYDWMD 
GDGMIHGVRI SGGRAEYRNR YVRTGGFEVE KDLGYAVWSG RLEPPQMNNP YGPYKNVANN
GLTFHNGKLL ALWEAGLPYE IRVPSLETTG PYLCDGYLDS AFTAHPKIDP MTGELIFFGY
ALDVAPYLKY GIISARGELL QITPINIPVP SGPHDFAITK NYTILMDLPL RFRPEREKQG
LPAWMFETGV PSRFGILPRC GNNEDIHWFE AQPCYMNHIL NAYEFEDEII LYGCRMSSTY
WYPGTARDPN ENIPRMYGWA FNLKTGAVRE GMLDKRPSEF PCINHRFVGR QMRYGYTSRM
ASLRPLFDGI IKYDFSTGRS LSHDFGEGRY GGATTFAPRI GSIDEDDGWL LTFVYDEKEK
HSELLVIDAQ DIMGEPVARV MLPQRVPYGF HGTWVSEMQL IASN