Gene Tery_4346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4346 
Symbol 
ID4245998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6699593 
End bp6700990 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content34% 
IMG OID638109233 
Productpheophorbide a oxygenase 
Protein accessionYP_723811 
Protein GI113477750 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.18517 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGCTA ATACCATCAA AAAATCCGAC AATATTCCTG TAAGTGCAGG TGGAACAGAC 
CCAGAATATT TTGATTGGCA AGAAGTTTGG TATCCAATTT ATTATATAGA AGACTTAGAG
AAAAATAAAC CTACAACATT TACATTATTA GAACAAGATC TTGTGATTTG GTGGGAAGAA
AAAAACAATC AATGGCGGGT ATTTGAAGAT CAATGTCCCC ATCGTTTAGC ACCCCTTTCT
CAAGGTAGAA TTAATGAGGC TGGATGCTTA GAATGCCCCT ATCATGGTTG GGCATTTTCA
GGGGGAGGAA ATTGTGAAAT TATTCCTCAA CAAATTGCAG GAGGAAAGGC AGAAAAATCA
TCAAGAGCAA AAGTTAAATC TCTACCTACA AAAGTTTGTC AAGGTCTTTT ATTTGTCTAT
GCAGGAAAAA TCGAAAATGC TGCTCAAACA CCTATTCCTA AAGTTGATGT TTTAGATGAA
AATTCTAATG AATGGATTTG TTTAAATACT TTTCGAGATG TACCTTATGA TGGTTTAACA
TTAATGGAAA ATGTCTTAGA TGCTAGTCAT ATTCCTTATA CTCATCATCG TACTGTGGGA
AACCGCGCTA ATGTTGCTCC TGTAGAATTA GAAGTTTTAG AATCTGGAAA ATGGGGTTTT
AAAGGGGTCT GGGAAGAAGG TCCAAGAAAA GGAACTTTAG GAAGACAAGA GACTACTTTT
ATTGCTCCTG GAATGATGTG GCATGACCTT ACTTCTAAAC AATTTGGTCG TACATTAACA
GTTGTATATG CAACACCTAT TCGTAAAGGA GAATGTCGTT TATTTGCCCG TTTCCCTTTT
AAGTTTTCTT CTCCCTTACC GAAATTTTTT ATTCAGTTAA GCCCCCGTTG GTATTCTCAT
ATTGGGCAAA ATGGAGTATT GGAAGATGAC CAAATATTTT TACATTATCA AGAACGATAT
TTAGAAGCAA AGGGTGGTAG TGCTAATTTT TCTAAGGCTT TTTATTTACC TACAAAAGCA
GATTTATTTG TATTTGAATT GCGTTCTTGG GTGAATAAAT ATAATGCTCA ATTATTTCCT
AATGCAACTT TAAGTTCTGC TCTGAACTCA GAAATATTGT TAGATAGGTA TCATTCTCAT
ACTAAAAAAT GTAGTAGTTG TCGCAGAGCT TTAAAAAATT TACAACGAAT AAAGGTTGGA
GTCGTGCTTG TGACTTCATT TATCTGGGCA AGTATTTTTT TTATGCTGTT AATATTAGAT
GATTTTAATA TGACTTTGAT GACTTTTTTG ATTTTAAGTT TGCCTGTTGG AGTTGTTTTT
TGGTTACTGT TAAGTAAATT AGAAAAACAG TTTTATCAAG GACGAGAAAT ACCTCCAAGA
AATTTATTAA ATAATTGA
 
Protein sequence
MIANTIKKSD NIPVSAGGTD PEYFDWQEVW YPIYYIEDLE KNKPTTFTLL EQDLVIWWEE 
KNNQWRVFED QCPHRLAPLS QGRINEAGCL ECPYHGWAFS GGGNCEIIPQ QIAGGKAEKS
SRAKVKSLPT KVCQGLLFVY AGKIENAAQT PIPKVDVLDE NSNEWICLNT FRDVPYDGLT
LMENVLDASH IPYTHHRTVG NRANVAPVEL EVLESGKWGF KGVWEEGPRK GTLGRQETTF
IAPGMMWHDL TSKQFGRTLT VVYATPIRKG ECRLFARFPF KFSSPLPKFF IQLSPRWYSH
IGQNGVLEDD QIFLHYQERY LEAKGGSANF SKAFYLPTKA DLFVFELRSW VNKYNAQLFP
NATLSSALNS EILLDRYHSH TKKCSSCRRA LKNLQRIKVG VVLVTSFIWA SIFFMLLILD
DFNMTLMTFL ILSLPVGVVF WLLLSKLEKQ FYQGREIPPR NLLNN