Gene Tery_1778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1778 
Symbol 
ID4242243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2711932 
End bp2713704 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content42% 
IMG OID638106902 
Productcytochrome-c oxidase 
Protein accessionYP_721510 
Protein GI113475449 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.307573 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACAAA CACAAATTCC TTTAGATACT CAATCAAAAC CGGAAAAACC ACACCCGAAG 
GCGTGGAAAT GGTATCACTA CTTCGGCTAT AACATAGACC ATAAAGTAAT TGGTATTCAA
TATCTTGTTA CTGCCTTCTT CTTTTATCTG CTTGGTGGTC TAATGGCAAT GGGCCTTAGA
GCAGAACTCT ATACGCCAGA TCCAGACGTT CTGGAACCAG ATATCTACAA TGCTTTCCTG
ACAAATCATG GCACTATAAT GATTTTCTTA TGGATTATTC CGGCTGCCAT CGGTGGTCTC
GGGAACTATC TCGTACCTCT GATGGTAGGA GCAAGAGATA TGGCATTTCC CAACTTGAAC
GCGATCGCCT TCTGGCTAAA CCCTCCTGCT GGAGCATTAT TAATTGCTAG TTTATCTCTG
GGAGGGGCAC AAAGTGGTTG GACAGCATAT CCACCCCTGA GCGTTATTAC CAATAACGGG
GGACAATCTT TATGGATTCT CAGTATTGTG TTAGTGGGAA CTTCTTCCAT TCTGGGTGCC
CTGAATTTTA TCGCTACGAT TCTGAAAATG AAAGTTCCTA GCATGAAATG GGATCAGCTT
CCCCTGTTCT GCTGGGCAAT TCTTGCTACT TCTGTTTTAG CTTTATGCTC CACTCCTGTT
TTAGCAGCAG GGTTGATCAT GTTATTATTT GATATCAACT TTGGCACCGG CTTCTTTAGA
CCAGAGATGG GCGGTGATGT AGTTGTATAC CAGCACTTAT TTTGGTTCTA TTCTCACCCG
GCAGTTTATC TGATGATTCT GCCAATATTC GGCATTCTCT CAGAAGTTAT TCCTATTCAC
GCTCGTAAAC CAATATTTGG TTATAAAGCG ATCGCCTACT CGAGTCTTGC TATTTGCTTA
GTAGGTTTGT TCGTATGGGT ACACCATATG TTCACCAGTG GAACCCCACC TTGGATGCGG
ATGTTCTTTA CCATCTCTAC TCTAATTGTT GCAGTTCCTA CAGGAATCAA AATCTTTAGT
TGGATAGCTA CTCTCTGGGG GGGAAAAATT CGCTTCACAA GTGGAATGCT TTTTGCCGTC
GGTTTATTAG CAATGTTCGT TATGGGAGGA CTAAGTGGGG TGACTCTGGG AACGGTTCCT
TTTGATGTTC ACGTTCATGA TAGTTACTAT GTAGTAGGTC ACTTCCATTA CGTTCTATTT
GGTGGTTCAG TATTTGGTCT GTATTCAGGT ATCTATCACT GGTTCCCCAA AATCACAGGC
CGAATGTACA ACGAAACCTG GGGTCGAATT CATTTTGTTC TCACTTTAAT TGGTACTAAC
CTCACTTTCT TACCCATGCA TCAATTGGGT TTACAAGGAA TGCCTCGTCG AATTGCTATG
TATGACCCTA AGTTTGAATC TCTGAATCAT ATCTGTACTT ATGGGTCAAT TCTCTTAGGT
TTATCTGTAC TTCCATTCTT AATTAACATG ATTTGGAGTT GGATGTATGG GCCAAAAGCT
GGGGATAATC CTTGGGGTGG TTTATCTTTA GAGTGGACTA CTAGCTCACC TCCCATTATT
GAAAACTGGG AAGTATTACC AGTAGTTACT GAGGGTCCTT ATGATTTTGG TTCTAAGAAA
ACTCTTGCTA TTCGAGAGCG GTTCAAGCTG ATTTCTACAC CAAGTTCAAA GGTTAAGGAA
ACAGGTGTAC CTGTTTCTGA AGCTGATTCT TCATCCTTAT ATAGTGGTTC CGGGTCTATT
AATGATTCTC AGGATCCTAC TAGGAAAAGT TAG
 
Protein sequence
MTQTQIPLDT QSKPEKPHPK AWKWYHYFGY NIDHKVIGIQ YLVTAFFFYL LGGLMAMGLR 
AELYTPDPDV LEPDIYNAFL TNHGTIMIFL WIIPAAIGGL GNYLVPLMVG ARDMAFPNLN
AIAFWLNPPA GALLIASLSL GGAQSGWTAY PPLSVITNNG GQSLWILSIV LVGTSSILGA
LNFIATILKM KVPSMKWDQL PLFCWAILAT SVLALCSTPV LAAGLIMLLF DINFGTGFFR
PEMGGDVVVY QHLFWFYSHP AVYLMILPIF GILSEVIPIH ARKPIFGYKA IAYSSLAICL
VGLFVWVHHM FTSGTPPWMR MFFTISTLIV AVPTGIKIFS WIATLWGGKI RFTSGMLFAV
GLLAMFVMGG LSGVTLGTVP FDVHVHDSYY VVGHFHYVLF GGSVFGLYSG IYHWFPKITG
RMYNETWGRI HFVLTLIGTN LTFLPMHQLG LQGMPRRIAM YDPKFESLNH ICTYGSILLG
LSVLPFLINM IWSWMYGPKA GDNPWGGLSL EWTTSSPPII ENWEVLPVVT EGPYDFGSKK
TLAIRERFKL ISTPSSKVKE TGVPVSEADS SSLYSGSGSI NDSQDPTRKS