Gene Tery_2067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2067 
Symbol 
ID4245715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3230557 
End bp3231741 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content36% 
IMG OID638107178 
Productglycosyl transferase, group 1 
Protein accessionYP_721781 
Protein GI113475720 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0419406 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.401472 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAATAG TACAAATATT ACCATCTATA TCTCTTGTTT ATGGCGGTCC GAGTCAAATG 
GTTATAGGAC TTTCTACCGC TCTTGCCACT CAGAACATAG ATGTTACCAT TCTCACCACA
AATTCTAATG GAGACACTGG TCAGCCACCC TTAGATGTTC CTATCAATAA ACCTGTCAAA
CAAAATGGTT ATCAAATTCG TTATTTTCCT TGTTCTCCGT TTCGTCGTTA TAAATTTTCT
CTACAATTAT TACAATGGTT AAATGAACAC GCTACTGAAT TTGATTTAGC TCATATTCAT
GCTCTTTTTT CACCAGTAAC AACCATAGCT GCAACTGTTG CTAGAACCAA TAACTTACCC
TATATTTTAA GACCATTAGG AACCTTAGAC CCCGCTGATC TACGCAAGAA AAAACAACTC
AAAAAAATTT ATGTTTCTCT CTTAGAAAAG CGGAATATTG CTCATGCTGC TGCCCTTCAT
TTTACCACAA CACAAGAGGC AAAAGTTTCC GAAAGATTTG GCTTATCTAC AAAAGACTTA
GTAATTCCCA ATGGAGTCAA TACTCTAGAG AATATTCAAG ATGAAAATTT AGTTAATAGT
CTCCGATCTC AAGGAGTAGA AGTGAAACAT CCCATAATTT TATTTATGTC TCGCATTGAA
CCAAAAAAAG GACTAGATTT ATTATTACCT GCTTTAGAAA AATTGTTAGC ACAAGGGGTA
GATTTTCAAT TTATCTTAGC AGGTGCAAAT CCTCAAGATC CTAATTATGA GGCACAAATT
TACTCACAAA TAAAGGCTTC ACCTATTGCT AAGTTTACCA AAATAATGGG GTTTGTTACA
GGGGAAATAA AGACAAGTTT ATTAAGAATT GCTGATTTAT TTGTACTACC TTCTTATTAT
GAAAACTTTG GTATTGCAGT AGCAGAGGCT ATGATAGCAG GTACCCCCGT AGTGATTTCA
GACCAAGTTT ATATTTATCA AGATGTAGCA AATGCAGAAG CAGGTTGGGT TGGTGGTTGC
AAAACAGAAG ACATGGCTGC TTTAATGAAA TTAGCTTTGC AGGATGAAGC AGAGAGAAAA
CGCCGGGGTT TGAATGCTCA AGAGTTAGCG AAAAATAATT ATAGTTGGCA AGCGATCGCC
ACACAAACCA TTCAAGCCTA TGAAAAAATT ATTTCATGTA AATAA
 
Protein sequence
MRIVQILPSI SLVYGGPSQM VIGLSTALAT QNIDVTILTT NSNGDTGQPP LDVPINKPVK 
QNGYQIRYFP CSPFRRYKFS LQLLQWLNEH ATEFDLAHIH ALFSPVTTIA ATVARTNNLP
YILRPLGTLD PADLRKKKQL KKIYVSLLEK RNIAHAAALH FTTTQEAKVS ERFGLSTKDL
VIPNGVNTLE NIQDENLVNS LRSQGVEVKH PIILFMSRIE PKKGLDLLLP ALEKLLAQGV
DFQFILAGAN PQDPNYEAQI YSQIKASPIA KFTKIMGFVT GEIKTSLLRI ADLFVLPSYY
ENFGIAVAEA MIAGTPVVIS DQVYIYQDVA NAEAGWVGGC KTEDMAALMK LALQDEAERK
RRGLNAQELA KNNYSWQAIA TQTIQAYEKI ISCK