Gene Tery_2111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2111 
Symbol 
ID4243947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3296981 
End bp3298258 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content41% 
IMG OID638107219 
Productglycosyl transferase family protein 
Protein accessionYP_721820 
Protein GI113475759 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR01426] glycosyltransferase, MGT family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000775281 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.550046 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCATT TTGGTCTGAT TTGTCCGGCA TTGACAGGAC ACCTAAATCC AATGCTTCCC 
ATAGGACAAG AATTAAAAAG GCGTGGTCAT CGTGTCACGA CGATAGGGAT ACTTGACGCT
GAAGCTAAGA CACTAGCAGC AGGATTAGAA TTTGTTGCCT ATGGCACGGA AGAATATTCT
AAGGGCAGCA CAGCAGAAGC TTTAAATCAC CTGAGTAAAC TCAGCGGGTT AGCTGCATTT
CGCTATACAA TTACACTATT AAAAGACTGG ACAAATGTTT TGCTTCGGGA TGCTCCGCAA
GTCATCAAAA ATGCTGGTGT AGATGCTTTG TTAATCGACC AGGCTTCATT AGGAGAATCT
ATAGGGGATT TTCTAGACAT TCCCTTTATT ACTATTTGTA GTGCACTGGT ACTCAATCAA
GATGAGAATG TTCCCCACCC TGTAAGCAAC TGGAAATATA ACCCCGCCTG GTGGGCAAAA
CTGCGTAATA GAGCTACTTG GAGTTTCTAT CAAATCTTAG GCAAACCTAT TAACAAGGTA
GTAGCTGAGT ATCGTCGTCA ATGGAATTTA CCTTTGTACT CTGACCCCAA TGATGCTTAT
TCTCTACTGG CTCAAATTAG TCAGCAACCT GCTGAGTTAG AATTTCCCAG AGAAAATTTA
CCTAAGTGTT TTCATTTCAC AGGACCTTAT CATTATTCAG GTACTAGAGA ACCTGTTTCC
TTTCCTTGGG AACAGTTGAC AGGTAAACCT TTAATTTATG CCTCTATGGG AACTATACAA
AATCGTTTGG TTGAGGTATT TTATCAAATT ACAGCAGCTT GTGAGGGGTT GGATGCTCAG
TTAGTTATTT CTCTGGGAGG TTCTGCCACT CCAGAATCTC TACCCAACTT AGCAGGAAAT
CCTCTAGTTG TTGAATATGC ACCCCAATTA GAAATACTGC AAAAAGCTAC TCTCACTATT
ACTCATGCAG GTATGAATAC AACTCTAGAA TGTTTAAGTA ATGCAGTACC AATGGTTGCT
ATTCCTATTG CTAACGATCA ACCAGGAGTA GCGGCACGAA TAGCTTGGGC TGGAGCTGGA
GTAGCGATAA CACTGAAACG TTTAACAGTA CCTCGGTTAC GAACAGCTAT TTCTCAGGTG
CTCACACAAC CGTCATATAA GCAAAATGCT TTGAGATTAC AGAAAGCAAT TAAACGAGCA
GGTGGAGTCA CTCGTGCTGC TGATATTATT GAACAGGCAG TATCAACAGG TAAACCAGTT
TTAACAGGAA CTATATAA
 
Protein sequence
MTHFGLICPA LTGHLNPMLP IGQELKRRGH RVTTIGILDA EAKTLAAGLE FVAYGTEEYS 
KGSTAEALNH LSKLSGLAAF RYTITLLKDW TNVLLRDAPQ VIKNAGVDAL LIDQASLGES
IGDFLDIPFI TICSALVLNQ DENVPHPVSN WKYNPAWWAK LRNRATWSFY QILGKPINKV
VAEYRRQWNL PLYSDPNDAY SLLAQISQQP AELEFPRENL PKCFHFTGPY HYSGTREPVS
FPWEQLTGKP LIYASMGTIQ NRLVEVFYQI TAACEGLDAQ LVISLGGSAT PESLPNLAGN
PLVVEYAPQL EILQKATLTI THAGMNTTLE CLSNAVPMVA IPIANDQPGV AARIAWAGAG
VAITLKRLTV PRLRTAISQV LTQPSYKQNA LRLQKAIKRA GGVTRAADII EQAVSTGKPV
LTGTI