Gene Tery_4767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4767 
Symbol 
ID4246421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7320747 
End bp7322795 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content31% 
IMG OID638109618 
Productsulfotransferase 
Protein accessionYP_724194 
Protein GI113478133 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.287127 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTACT ATGAATTAGG AGAAAAATTT CTAGAACAAC AACAGTGGGA GAAAGCTGTT 
ACTAGCTACC GCCAAGCTAT AAAATTAAAC CCTACTTTTT CTTGGCATTA CTACAAATTA
GGGCAAGCTT TAACTCAACT ACAAAAATGG GATGAAGCTA TTACTAATTA CCAAAAAGCT
ATAGAACTAA ATTCTGATTT TCCTTGGTCT TATCATCACT TAGGAAATGC TTTACTAAAA
CAGGAAAAAT GGGAAGAAGC AGTTAATGCT TACCACAATT TTATTAAACT TAATTCTGAT
AATTATTGGG CTTACCATAA ATTAGGAGAA GCCTTATTTA AAATAGGAGA ATTTGACGCC
GCAATTATTT CCTATCAAAA AGCCATTAAA ATTAATCCAG AAATTAAAGG AACTCATCAA
AAATTAGCAG ATATTTTATT TCATATTGGT CAGCTAGAAG CAGCAGAAAT CGCATATCGT
AAGGCCATAA AACTCAATCC AGAAGTAGTT TGGTATCGCC AATGTTTAGG AGATGTTTTA
CTTAGACAAA AAAGATTAGA CGAAGCGATC GCTACTTATT TAGAAGTTGC TAAAGTTAGC
CATAATCTAA CTTGGATGCA CCTACAGTTA GGAGATGCTT TTGCTCAACT TTGTCAATCA
AATTTAGACG AAACTATCAA TTACTATTGC CAAGCCATTA AAAATCCCAA TCAATATCCG
ATTTATCAAA AATCACTATA TTTAATCAAA ACAAATCCAG AAATATATCT AGAATTAGGA
AATTATCTAG CACAAAAAAA TCAAATTTAT GGAGCCATAA TTATTTATTA CTTGTTGCTA
GAAATATTAC CCATTCAAAT AAATATTCAT CAGCAACTTG ACCAGATATT CCGGCAAAAA
AATCAACTAG AACAACAGCT AAATTCATTT TATTCTGCCG TTAAAACTGA CGGAAATAGT
CAATCTTACT ATCACCTAGG ATTAGCATTA ACTAAACAAC AAAAATGGTT AGAAGCTACC
ATAATCTACC ATCGTGCTAT AGAAATTAAT CCTGATTTTG CTTGGTGGTC TGATATTCGT
CTGTGGGAAA CTTTTAGAAA GCAAGATAAA CTCCAAAAAA TAGTTGATTT GTTTCAAGAA
TTTATTAGTT TTCCAAATAA CTCACTCTGT CGTTATCTCA ATTTAGCTGA AGCTCTAACT
CAAGTAAATA GAAATACTGA AGCACTAGAA ATTTATCAAA ATGCTTCTCA ACAACAAATT
CAACAAAACT ATCCTAACTT TTTTCTAAAA TCCCAAAAGT CCCCTCAAAT ATTAGGACCT
AATTTTCTAG TAATTGGAGT TAAGAAAGGA GGAACTACAT CTATTTATCA TTATCTAATT
CAACATCCAC AAATTTTACC TGGAATCAAA AAAGAAATTG ATTTTTGGTC TTTCTATTTC
CATCGAGGTT TAGATTGGTA TCGGGCACAT TTTCCATCAA TTCCAGAGTC AGAAAAATAC
TTAACTGGAG AAGCAAGTCC TAGTTATTTT GATGCTCCAG ATGTTCCCGC TAGACTATTT
CATTTTTTCC CCAGAATTAA ACTAATTGTT TTATTAAGAA ATCCAGTGGA TAGAACTATA
TCTAATTACT ATCATGAAGT ACGTTCACAA GCAGAGAGCA TGTCTATTGA AGAGGTAATT
AATTCTAGGC TGGAAAAACT GAACAAAATT TCATCTAGTT TTATCACAGA AAAAGATTAT
TGGAATTATC AGGGAGATTA TATAGCTTCT AGTATTTATT TAGATTGGCT GAAGAAATGG
TTGAATATTT TTCCCAGAGA ACAATTATTA ATTTTGCCTA GTGAAAAATT TTATAGTCAG
CCAAAAACCA TAATGAAGCA AGTTTTCAAT TTTCTAGATT TGCCAGATTA TCAAATACAA
GATTATCCAA AATTTAATGC TGCTTCTTAT GCGTCTATTA GTAAATCACT TCGACAAAAA
TTAAACGATT ATTTTCAGTC TCATAATCAG CGTTTAGAGG AATATCTGGG TATAAAATTT
GGTTGGTAA
 
Protein sequence
MNYYELGEKF LEQQQWEKAV TSYRQAIKLN PTFSWHYYKL GQALTQLQKW DEAITNYQKA 
IELNSDFPWS YHHLGNALLK QEKWEEAVNA YHNFIKLNSD NYWAYHKLGE ALFKIGEFDA
AIISYQKAIK INPEIKGTHQ KLADILFHIG QLEAAEIAYR KAIKLNPEVV WYRQCLGDVL
LRQKRLDEAI ATYLEVAKVS HNLTWMHLQL GDAFAQLCQS NLDETINYYC QAIKNPNQYP
IYQKSLYLIK TNPEIYLELG NYLAQKNQIY GAIIIYYLLL EILPIQINIH QQLDQIFRQK
NQLEQQLNSF YSAVKTDGNS QSYYHLGLAL TKQQKWLEAT IIYHRAIEIN PDFAWWSDIR
LWETFRKQDK LQKIVDLFQE FISFPNNSLC RYLNLAEALT QVNRNTEALE IYQNASQQQI
QQNYPNFFLK SQKSPQILGP NFLVIGVKKG GTTSIYHYLI QHPQILPGIK KEIDFWSFYF
HRGLDWYRAH FPSIPESEKY LTGEASPSYF DAPDVPARLF HFFPRIKLIV LLRNPVDRTI
SNYYHEVRSQ AESMSIEEVI NSRLEKLNKI SSSFITEKDY WNYQGDYIAS SIYLDWLKKW
LNIFPREQLL ILPSEKFYSQ PKTIMKQVFN FLDLPDYQIQ DYPKFNAASY ASISKSLRQK
LNDYFQSHNQ RLEEYLGIKF GW