Gene Tery_1737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1737 
Symbol 
ID4245394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2643845 
End bp2645032 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content39% 
IMG OID638106862 
Productglycosyl transferase, group 1 
Protein accessionYP_721471 
Protein GI113475410 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAATAC TACTAGTCTG CACAGAAAAA TTGCCAGTGC CTTGTATCCG AGGCGGGGCT 
ATTCAGACAT ATATAGATGG TATTCTACCT TTCCTGAAGC GAGATCATGA AGTTACAGTA
TTCTCTGTAG CTGATCCAGA ACTACCAGAT CAGGAAATTA GGGACAATAT TCTCTATAAA
CGTTCTAGTA GAAAAACTTC AGAGGAATAC TATCACGCTG TAACCAATTT TGTAGCAGGA
CGAGAGTTTG ATTGGATAGT AATTTACAAT CGTCCTAAGT ATTTACCAAT GGTGGCTGAA
GTTGCACCAA ACAGTCGTTT TATACTGAGT ATGCACAATG AAATGTTCCA TGCTAAGAAG
ATTGAACCTG AAGAAGCAAT TCTATGTTTG GAACGAGTGG AAAAAGTGGT GACAGTTAGT
AAGTTTATTG CTGATGGAAT AGCTAAATTA TTTCCAGGAT ATGAACATAA GTTAACACCT
GTATATGCAG GAGTAGACCT AAAGCTCTTT CAGCCTAGGT GGATAGAAGG ATTAGAAGGG
AAACGAAAAG AAAAGTTGGC GGCTCTGGGT TTAGAAGATA AACAAGTGAT CCTTTATGTG
GGCAGGTTAA CAGATAAAAA AGGGCCTCAT TTGTTGATAT CTGCTATGAC CAAAGTTATC
AAGAAACACC CATCTGCTGT ATTATTGCTA GTGGGTAGTA AATGGTATGG TAATAATGAG
GAAAACGATT ATGTCCGTGA AATTAAGGTC AAGGCTGAAC AATTGGGAGG AGCAGTTCAG
ATGACTGGTT TTATTCCCCC ATATGAAGTT GCAGATTATT TCTTATTAGG TGATGTATTT
GTATGTGCAT CTCAATGGGA AGAGCCTCTA GCTAGGGTGC ATTATGAGGC TATGGCAACT
GGGTTATGTA CTATAACTAC TGGTAGGGGA GGAAACCCGG AAGTAATTAT TCCTGGTAAG
AATGGTATTG TGATCACAGA CTATGAAAAT TCGGGTGCAT TTGCAGATTG TATAGATTAT
TTGTTGTCTA TGCCAAACAA GAGAGAAGAA ATGGGGAAAA GAGGGCGTGA GCTAGCGGAG
CTATATTACA GTTGGTCAAG GGTGGCTTGG GATATTTTAA GTATTATTAA TGATTCAACA
TCAATGTATT CATCTTCATC TTTAGAAAGA TTCGGACAAA CAGGATAA
 
Protein sequence
MKILLVCTEK LPVPCIRGGA IQTYIDGILP FLKRDHEVTV FSVADPELPD QEIRDNILYK 
RSSRKTSEEY YHAVTNFVAG REFDWIVIYN RPKYLPMVAE VAPNSRFILS MHNEMFHAKK
IEPEEAILCL ERVEKVVTVS KFIADGIAKL FPGYEHKLTP VYAGVDLKLF QPRWIEGLEG
KRKEKLAALG LEDKQVILYV GRLTDKKGPH LLISAMTKVI KKHPSAVLLL VGSKWYGNNE
ENDYVREIKV KAEQLGGAVQ MTGFIPPYEV ADYFLLGDVF VCASQWEEPL ARVHYEAMAT
GLCTITTGRG GNPEVIIPGK NGIVITDYEN SGAFADCIDY LLSMPNKREE MGKRGRELAE
LYYSWSRVAW DILSIINDST SMYSSSSLER FGQTG