Gene Tery_0221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0221 
Symbol 
ID4241817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp339480 
End bp340676 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content39% 
IMG OID638105565 
Producthypothetical protein 
Protein accessionYP_720182 
Protein GI113474121 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0510] Predicted choline kinase involved in LPS biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.676637 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.491604 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTTG TCTTAAACTC TCGAAATGTC TACGACTATT TAGTAGAAAA TGGTTTATGT 
GATTCCCTAC AGGAAGGGGA GCCATATCCT TCCCAGAACT CTCAGAGTAA TGTTGAGCTA
ATTAGTGCTA AGAATTTTAA TTTATTGGTG ACTTTGCCAG AGGGTCGAAA GCTTCTGGTT
AAACAAGAAC AATATAATTC TCAAGGAAAA ACCCTTGGTG AGTTGTTTGG AGAATGGCGA
ATTCAAAAAT TCTTACGGAC ATTTCCTGAA CTTGACCATT GGCGTGTCTT TCTGCCAGAG
TTGTTATTTC ACGACCCTGA AAAGTCTATT GTGGTCTCTA CTTACCTGGA TAATTATCGA
GATTTAAGTG ATTTCTATTC TAAGGAAAAT ACTTTCCCTA CAAAGGTTGC AGCTCAAATC
GGTAATTTTT TGGGGACTGT TCACCGCGAT ACTTGGAATA GAGAAAATTA TCGAGAGTTT
TTTGCTAGTG AAGTTGTTAG TACTAAACCT GCAGTCTCCC CACTTTTGGT GGATAGTCTC
GAAAGGATAG GACCAGACAT TTTTGGTATT GCTCCTGTAG ATGGGTTGAG ATTTTTTGCT
CTTTATCAAC GATATGACAG TTTGGGGAAA GCGATCGCTC AACTTCGCGA GACTTTATCT
CCTACTTGTC TGACTCACAA TGACCTGAAA CTAAATAATA TTCTCATGGC TCAAAATTGG
GAAAATTCCG GTGAAAATAT TGTGCGGTTA ATTGATTGGG AAAGGTCTAG TTGGGGAGAC
CCTGCTTTTG ATTTGGGAAC GGCTATCAGC AGCTATCTGC AACTTTGGTT GGGTAGTTTG
GTTATTAGTA ATTCTCTGAG TATAGAAGAA TCTCTCAAAT TTGCCACCAC TCCCCTTGAA
TTACTTCAAC CGTCTATTGC TGCTTTAGCT AAAGCTTATT TTGACACTTT CCCAGAAATT
TTGGCATATC GTCCTGATTT TTTGAGACAA GTAGTACAGT TTGCAGGTTG GGGTTTGATG
ACAGGAATTT TGTCAATGGT TCAATATCAA AAGACTTTTA ATAATACGGG TATTGCGATG
CTTCAGGTTG CCAAAGCATT ATTATGTCGC CCTGACTCAT CAATGTCTAC AATTTTTGGT
GTTGCTATTG AAGAAGTATT GAGGAGTCAG GAGCCAGGAG TCAGTAGAAT GCATTAG
 
Protein sequence
MKFVLNSRNV YDYLVENGLC DSLQEGEPYP SQNSQSNVEL ISAKNFNLLV TLPEGRKLLV 
KQEQYNSQGK TLGELFGEWR IQKFLRTFPE LDHWRVFLPE LLFHDPEKSI VVSTYLDNYR
DLSDFYSKEN TFPTKVAAQI GNFLGTVHRD TWNRENYREF FASEVVSTKP AVSPLLVDSL
ERIGPDIFGI APVDGLRFFA LYQRYDSLGK AIAQLRETLS PTCLTHNDLK LNNILMAQNW
ENSGENIVRL IDWERSSWGD PAFDLGTAIS SYLQLWLGSL VISNSLSIEE SLKFATTPLE
LLQPSIAALA KAYFDTFPEI LAYRPDFLRQ VVQFAGWGLM TGILSMVQYQ KTFNNTGIAM
LQVAKALLCR PDSSMSTIFG VAIEEVLRSQ EPGVSRMH