Gene Tery_2749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2749 
Symbol 
ID4244782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4257314 
End bp4258822 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content38% 
IMG OID638107808 
Productglycosyl transferase family protein 
Protein accessionYP_722405 
Protein GI113476344 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.924892 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.61972 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGACA GTTATTTACC CAAAGACAGT TACTATCAGG AAACCAAAAT TTATCAAAAC 
GATACCTCAG ATAATGAACA GGAAAATTAT CAGGTATCAG AAAGGCAGAA AACTTTAGAT
GTTAACCAGA TAGAGCAAAA AATAGAAGTG GTGGAGAAAA AAAAGTTTCA GGATATTTAT
GATGGTCGTA GATGTAAAGC AGCACTTATG CTATGGCTTA TCTGGACTAC TACTATTATC
CTACATTTGC TTTCGTGGGG ATATTGGATA ATTCTGGGTT TGACAGGTTT ACTGTCAGTT
CAATTTTTGA GAATACTATT TGCTAAACCA AAATTAGCTC CAAAAACTCT GTCAGAAGAA
AATTTTACTG AATGGCCTTA TATATCTCTG TTAGTAGCCG CTAAAAACGA AGAAGCTGTA
ATCAGAAAGT TGGTAAAAAA TATGCTGGCT TTAGATTATC CTACTAATAG TTATGAACTT
TGGGTGATAG ATGACAATAG TACGGATAAA ACCCCTTTAT TATTAGAACA ATTGGCCCGG
GAATATGAAC AGCTAAAAGT GATTAGAAGA AGTCCAGATG CTGGGGGTGG TAAGTCAGGG
GCTCTAAATG CTGCTATACC TTTTGTGAAG GGAAAAATTT TAGGAGTATT TGATGCAGAT
GCACAAGTAA CACCAGATCT ACTCCAAAAG GTAGTACCAC TTTTTGCTAG GGAAGAAGTA
GGAGCAGTAC AAATCAGAAA GGCGATCGCT AATGCAGGTA TAAACTTTTG GACGAAGGGA
CAATCAGCAG AAATGGTTGT GGATGGTTTT TTTCAGGAAC AGCGAATTGC CATTGGCGGG
ATTGGAGAGC TCAGAGGAAA TGGCCAGTTT GTACGAATGA ATGCTCTGGA AGAATGTGGA
GGGTGGAATG AACAGACTAT TACTGATGAT TTAGATTTAA CTATTCGCCT ACACTTAAAC
CAATGGGATA TAGATTATCT GGCTTTTCCG GCAGTAACGG AGGAGGGAGT AACTAGCCCT
ATAGCTTTGT GGCATCAACG CTCGCGATGG GCAGAAGGAG GATATCAACG GTATTTAGAC
TACTGGAAAT TGATTTTGCG TAACCGGATG AGATTTAGTA AAACTTGGGA TTTATGGCAA
TTTTTGGTAA CACAATATCT ATTATCAGTT GCTGCTGTGC CTGATTTTTT AATGTCAATA
ATCTTACGTC GTTTACCAAT AACAAGTCCT TTAACTGTGT TTACTGTTAT GGTCTCTTTG
CTAGGTATGT TTATAGGTTT ACGCCGAACT CGGAAACAAC AGATGAACTT AGCAAAGGAG
GAAAAGGTGA TGGAGTTTAA TTCGAGTAAA GATAATCCAT TGTCTTTATT TCTAACTTTA
CTAGAAAGTG TGCGGGGAAC TTTTTATATG TTGCATTGGT TTGTAGTTAT GGGTGTTACT
ATTGCTCGAA TGTCTATATT ACCCAAGAGA CTAAAATGGG TAAAAACAGT TCATAGAGGT
GATGAATAA
 
Protein sequence
MPDSYLPKDS YYQETKIYQN DTSDNEQENY QVSERQKTLD VNQIEQKIEV VEKKKFQDIY 
DGRRCKAALM LWLIWTTTII LHLLSWGYWI ILGLTGLLSV QFLRILFAKP KLAPKTLSEE
NFTEWPYISL LVAAKNEEAV IRKLVKNMLA LDYPTNSYEL WVIDDNSTDK TPLLLEQLAR
EYEQLKVIRR SPDAGGGKSG ALNAAIPFVK GKILGVFDAD AQVTPDLLQK VVPLFAREEV
GAVQIRKAIA NAGINFWTKG QSAEMVVDGF FQEQRIAIGG IGELRGNGQF VRMNALEECG
GWNEQTITDD LDLTIRLHLN QWDIDYLAFP AVTEEGVTSP IALWHQRSRW AEGGYQRYLD
YWKLILRNRM RFSKTWDLWQ FLVTQYLLSV AAVPDFLMSI ILRRLPITSP LTVFTVMVSL
LGMFIGLRRT RKQQMNLAKE EKVMEFNSSK DNPLSLFLTL LESVRGTFYM LHWFVVMGVT
IARMSILPKR LKWVKTVHRG DE