Gene Tery_2950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2950 
Symbol 
ID4245292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4584744 
End bp4585874 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content30% 
IMG OID638107989 
Productglycosyl transferase family protein 
Protein accessionYP_722586 
Protein GI113476525 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.268496 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTATAA ACAATTTTCC CAAAGTTACT GTTTGTTTGC CTACTTATAA CTCTGGAGAA 
TTTCTTAGAT ATGCAATAGA CAGTATTCTT GAACAGACAT TTACAGATTT TGAGCTGATT
ATTTCTGATG ACTGTTCTAC TGATAATACT CCAGAAATTA TTAGGAGTTA TTTGGAGAAA
GATAGTAGAA TTCAATACTT ACAAAATTCT CACAACTTAG GACTTTTTCC TAATTGGAAT
CGTTGTTTAG AATCTGCTTC TGGAGAATAT ATTACTGTCT TTGCTCAGGA TGATGTGATG
TTGCCAAAAA ATTTAGAGCA AAAGGTAAAA ATTCTAGAAA AATACCAAAA TGTTGGTTTA
GTTACTTCCT CTATTATGGT GGTGGATAGT GATAATAATT ATTTGAATTG GGATTGGGCA
AATTATGATG AGGATAGCTT AGTTAATGGT GAGGAATGGG TTAAGAAAAA TTTGGGAAAA
GCTAATCCTA TTTGTTGTCC GTTTGTATTG ATTAGAAGAT ATATTTTAGA AAAAGTTGGT
GGAAAATTTA ATGACAATTA TCTTTTTGCT GGAGATTTAG AATTATGGTT AAGAATTGCT
TTGGTTGCTG ATTTGTATTT TGTTAAAGAA ATCTTGGGAT ATTATCGCTG GCATAAAGAA
AATAAAACTC ATAGTTTTAA TGATTTTGAT CAGGTTAAGG AACATTTACA AATTTGTAGT
AATTTAATTG ATAGTTTAAA TTTATCAGAT TTAGAATTAA ATTATTGGGA GACTGAGGTG
TTATCTCGAA CTGTTAAATG GGTTAGTTAT TATCGAATTT ATCGTCATTT AGAAATTTCA
AATTTTGATG AAGCATTAAA ATTATGTGAG TTACTGGAAA GTTGGCGAGG TAGGTCGGGA
AAATTAGGTA TTTCTGTGCA GGAATTAGGT ACTAGAATAC AGAAAATTTT GCAAGTAAAT
TCTCGCCTCC ATTCAGAGAT TAATGAATAT AGTACCTGGG TAAATAATCT TGAGGGAAAA
AATTCTGCTT TAGAAAGAGA AAAATCTTGG CTAGAATCTC AGGTAAAAGC TTGGATGCAA
ACTGCACAAA AGTATTATCA TAAAATAAAA GAAAGTGGGA ATTGTTTATA G
 
Protein sequence
MPINNFPKVT VCLPTYNSGE FLRYAIDSIL EQTFTDFELI ISDDCSTDNT PEIIRSYLEK 
DSRIQYLQNS HNLGLFPNWN RCLESASGEY ITVFAQDDVM LPKNLEQKVK ILEKYQNVGL
VTSSIMVVDS DNNYLNWDWA NYDEDSLVNG EEWVKKNLGK ANPICCPFVL IRRYILEKVG
GKFNDNYLFA GDLELWLRIA LVADLYFVKE ILGYYRWHKE NKTHSFNDFD QVKEHLQICS
NLIDSLNLSD LELNYWETEV LSRTVKWVSY YRIYRHLEIS NFDEALKLCE LLESWRGRSG
KLGISVQELG TRIQKILQVN SRLHSEINEY STWVNNLEGK NSALEREKSW LESQVKAWMQ
TAQKYYHKIK ESGNCL