Gene Tery_4437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4437 
Symbol 
ID4246090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6839236 
End bp6842421 
Gene Length3186 bp 
Protein Length1061 aa 
Translation table11 
GC content41% 
IMG OID638109320 
Productglycosyl transferase family protein 
Protein accessionYP_723897 
Protein GI113477836 
COG category[N] Cell motility
[R] General function prediction only
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1216] Predicted glycosyltransferases
[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCTTA CTTCTTTTGC TAAAGGAAAC AAACTCTTAC GGGAGGGAAA ATTCGAGTCG 
GCGATCGCTT ATTATCAAAA AGCCATAGAA GAAAATCCCC AGTTTACCTG GTCTTACCAA
AATTTAGGGG AAGCTCTTGA GAAAACTGGG CGGATAGAAG AAGCGATCGC TTCTTTTCGT
CAGGCTGTGG CCATAGATCC GCAATCCCAC TGTTTTCTAT ACAAATTGGG GATAACATTG
AGTCGGCAAG GCCAGTTTCA GGAAGCTGTG GGTTACTTAC GTCGGGCGAT CGATTTAAAC
AAAAATGTGC CTGAGTTTTA TCTAGGTTTG GGAGCTGCCT TGGTGAAGTT GAGGCAATGG
TCTGAAGCAG TTGAATGTAT TCATCAGGCA TTGAGGGTGT TGGATGAAAA AGTAGGAACG
TTATATGAAA GAGCTCTACA GGCAGAGGGT TATTTTTATT TGGCAGAAGC TAAGTCTGGT
CAAGAGCAAT GGTCTGATGC AATAAAGTTA TATTCTCAGA GTTGGGAAAT TTATCCATAT
CGGGTTAACT GCTGTATCAG TTGGGCAGTA GCTTTAGGTA AGTTAGGAAG ATGGAGTGAA
GCGGTGGCTT TATATCGTCA AGCTGTAGCT TTCTCTGGGG AGTCTGGTGA GGTGTACTTT
GGTTTAGGGA AGGCTTTGGG ACAGTTAAAA CAATGGGAGG AGGCTGTTGT TGAGTATCGA
CGAGGGATAG ATTTTGGTTT TGATGGGGCG GAGGTGCGCC ATTCTCTGGG GTATGCTTTC
CTACAGTTAA AAAAATGGGA GGAGGCTATT GTTGAGTATC GTTTGGTGGT AGAGGTTGAT
CCTAAGTTCG CACCAGTTCG GCACCAGCTT GGGTATGCTT TGATGCAATT GGAGCATTGG
GAGGAGGCTG TGATTGAGCT GCGTCAAGCT GTGGAGTTAT ATCCTAGGTC GGCTATAGTT
TGGCAGCAGT TGGGAGATGT TTTATGGCAG TTGGAGGAAG ATGGGGAGGC TGAAGAAGCT
TATCAAAAGG CGACAGAACT AAATCCTGAC ATGGCTAATT TGCCAAAAGC AAAAAGTGTG
GCAAGTGCAC TTTCCAAGTC AGAGATAGGG ACAATAAATA ATACAGATCG GTTTGTTCGG
TTAGTACACG AAGCTGATAA GCGAGCTTTT ATAGACTACT CATACACAGA ACTATCCGCC
TCCCACAATT TCGACTTAAA TTGTCTTAAT CTACATTGGG TGACTTGTGA CTTTTCTCCC
GGTGCTGGGG GACACATGAC AATATTCCGA TTTATCAAAT TACTAGGTCA ACTCGGTCAT
CACAATACTC TCTGGATATA CCAACCAGTT GTTCATAAGT TTGAAACAGA GGCATTAGAG
ACTATATTAA AGTATTTTCA AACTTTACAG GTAGAAGTCA AGTTTATTAT CGGCAGGGAA
GAGTTTGAGA GTGCAGCAGG AGATGTTATT ATTGCCACAG ATTGGGGTTC GGTCCAGTTT
GCTGTCTCCA ATCCTAATTT CCACAACAGA TTTTACTTTG TTCAAGACTA CGAGCGGTTC
TTTTCTCCCC AAGGAACAAA AGCACTTTTG GCTGATTTGA CTTATAGTTA TAAACTAGAT
TGCATTTGTG CAGGGCCGTG GCTAGAAAAA ATAATGTCTG AAAAATACGG TTCTTGGGCT
TGTAAGTTTT GGCTAGCAGT GGATACTTCG GTTTATTTTC CTCAAACAGA TGAAAAAGTT
AATGATGTTG TCAAAATTGC CTTTTATTAT CGTCGTGGCA CAGAGAGAAG AGCTGTTGAA
TTGGGGTTGC TGGCATTAGA AAAGTTAGCA ACTTATAGAG AAGATTTTGA AGTGCATTTC
TTTGGGGGGA ATACTAATTT TGATCGTGCA CCTTTTCAGT TTAAGTCTCA TGGAATTTTA
ACTGCCCAAC AGCTAAGGGA ACTTTACCAA GATAGTGACA TTGGTATTGT ATTTTCATCA
ACTAATTATT CATTAGTTCC TCAAGAGATG ATGGCTTGTG GTTTACCAGT GATAGAACTG
GCTGGTGAAA GTACGGAAGT TGTATTTCCC CCAGGGGTAG TAAGGTTAGC CGGTCCGGCT
CCTCTGGATA TCACTGATGC TATTGTAGAG TTAATGGACT CAAAAACTCC CCGTGAAGAA
CAAGCACATT TAGCAACAGA ATGGGTAAAA CAGTTTACAT GGGAACAAGA AGTAGCTAAG
ATAAACGGTT TTATTCAAAA TAGGCTTCTT GAAAAAAAGC CAAATTCCAT AGTTGTTAAG
GCAAAACCCT CTAAACCAAA AGCTTCTGTT TTTATACCAA CCTTGAACGG TGGTGAGTTG
CTTAAACAGG TAATAGAAAG GGTGAAGGAG CAAGTCACAC CGTGGTTGTT TGAGATTGTA
GTTATTGATA GTGGCTCAAC AGATGGAACC TTAGAATGGA TGAAGGCAGA CCCAGTAATT
AGACTTTACG AGATACCTAA GTCCGAGTTT CAGCATGGTA AGACTAGGAA CTTGGGGGCA
TCTTTGTCTG AAGGGGAAAA TATTGCTTTT TTGACCCATG ACGCTCTGCC AGTGGATAAA
AATTGGTTAT ATTATCTGGT GACTACACTA GAAAATTTTC CTAATGCGGC AGGGATATTT
GGCAAGCATT TAGCATATCC TGATGCAGAT GCTTTTACTA AGCGAGACCT GGAAAACCAT
TTTCAGATTT TTGATGAACT ACCAGTGTAC CTTGATAAGA ATACAAATTT CAAGCTATAT
AAGAACAAAG ATTTGTCCTG GAAACAAAAA CTTCATTTTT ACAGCGACAA TAATTCCTGT
ATGCGTCGAT GTGTTTGGGA AAAAATTCCA TATCCAGAGA TTAGTTTTGG AGAGGATCAG
GCTTGGGCTT GGCAAGTTAT TGAGGCGGGG TATGGAAAGG TGTATGGGAG AGATGCAGTG
GTGTATCATT CACATAATTT TTTGCCGGAG GAGATTTTCA GTCGCAGTCT GGAGGAGGCT
TCATTTTTCC AAAAAACTTT TGGGTATGAG TTGGTTAATA AAGACAACAT TTACGAACAA
ATAAAGTTGC TAAATGAACA TGATAGCCAG TGGGGAAGGG ATAAAAATTT AGATGAAAAA
GTAATTATAA TGAGACAAAA AAATAATGAG GCAAGAATAC ATGGCTATAT GGCAGCTTTG
AAGTAG
 
Protein sequence
MLLTSFAKGN KLLREGKFES AIAYYQKAIE ENPQFTWSYQ NLGEALEKTG RIEEAIASFR 
QAVAIDPQSH CFLYKLGITL SRQGQFQEAV GYLRRAIDLN KNVPEFYLGL GAALVKLRQW
SEAVECIHQA LRVLDEKVGT LYERALQAEG YFYLAEAKSG QEQWSDAIKL YSQSWEIYPY
RVNCCISWAV ALGKLGRWSE AVALYRQAVA FSGESGEVYF GLGKALGQLK QWEEAVVEYR
RGIDFGFDGA EVRHSLGYAF LQLKKWEEAI VEYRLVVEVD PKFAPVRHQL GYALMQLEHW
EEAVIELRQA VELYPRSAIV WQQLGDVLWQ LEEDGEAEEA YQKATELNPD MANLPKAKSV
ASALSKSEIG TINNTDRFVR LVHEADKRAF IDYSYTELSA SHNFDLNCLN LHWVTCDFSP
GAGGHMTIFR FIKLLGQLGH HNTLWIYQPV VHKFETEALE TILKYFQTLQ VEVKFIIGRE
EFESAAGDVI IATDWGSVQF AVSNPNFHNR FYFVQDYERF FSPQGTKALL ADLTYSYKLD
CICAGPWLEK IMSEKYGSWA CKFWLAVDTS VYFPQTDEKV NDVVKIAFYY RRGTERRAVE
LGLLALEKLA TYREDFEVHF FGGNTNFDRA PFQFKSHGIL TAQQLRELYQ DSDIGIVFSS
TNYSLVPQEM MACGLPVIEL AGESTEVVFP PGVVRLAGPA PLDITDAIVE LMDSKTPREE
QAHLATEWVK QFTWEQEVAK INGFIQNRLL EKKPNSIVVK AKPSKPKASV FIPTLNGGEL
LKQVIERVKE QVTPWLFEIV VIDSGSTDGT LEWMKADPVI RLYEIPKSEF QHGKTRNLGA
SLSEGENIAF LTHDALPVDK NWLYYLVTTL ENFPNAAGIF GKHLAYPDAD AFTKRDLENH
FQIFDELPVY LDKNTNFKLY KNKDLSWKQK LHFYSDNNSC MRRCVWEKIP YPEISFGEDQ
AWAWQVIEAG YGKVYGRDAV VYHSHNFLPE EIFSRSLEEA SFFQKTFGYE LVNKDNIYEQ
IKLLNEHDSQ WGRDKNLDEK VIIMRQKNNE ARIHGYMAAL K