Gene Tery_1950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1950 
Symbol 
ID4244374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3023400 
End bp3025337 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content39% 
IMG OID638107069 
Productprotein serine/threonine phosphatase 
Protein accessionYP_721676 
Protein GI113475615 
COG category[T] Signal transduction mechanisms 
COG ID[COG0631] Serine/threonine protein phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.996513 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0303353 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAATT CTGCGCCAAT AATTCATTGT CCAAATATTA ATTGTTCTAG TGTTTCTCAC 
TCTTGGGATC AGGAAGAGTG TGAAGCTTGC CAAACTAAAT TAATCTATCG CTATTTATGG
GCAGTTGCTC CAAAAGTCGA GATTTCTGTA GGGGAGTTAT TGGGCGATCG CTACTATGTT
GTAGCTCCAC AAGTTTGGCT AGATACGAAG CCTGGTGAAC CTCCATTTAT GCACGAGGAA
TTTCCAGATT ATATTATGCC TTATCTACGC TTATATCCTG AGAGGCTTCA TACACCAGAT
GTATATGGTT TTTATCAGTA TGGAGAAGAA ACATATCCTA CAGATATTCT ATTATTAGAT
AATGTTCCTC TGGACTCTAA AGGTAATTTG CTTCCTAGTC TTGTAGAAGC TTGGCCCACA
GCAAGTCCTG TACGTCAGGT CTACTGGTTG TGGCAAATGA TGCAATTATG GAAACCTCTA
TCAGAAGAGC GTGTAGCTTT TAGTTTATTG ACCAATAATT TACGGGTGGA AGATTGGCGA
TTACGATTGT TAGAGTTAAA CCTAGAAACC CGTAAAACTA AACTTAGAGA TTTTAAGATT
TTTTGGTCAA ATTTGCTTCC GACAGCTCAT CCTAACGTTC AACAACCATT GACAGAGATT
TGTCATATGA TTAGTGATAT TGGGGAATCT TTAGAGGCGA TCGCTCCGAA ATTAAACCAA
CTACTATTAG AACAGGCAGC CAAATTACCA TTAAAATCCG ATATCATGGG GGCGACGGAT
ACAGGTCCCG TGCGTATGCA TAATGAAGAC TGTTGTTATC CTACAGAAGA GGACTTAAAT
AGTAATCAAC TAGTGCCCAA TTTAGCAATA ATTTGTGATG GAGTGGGTGG CCATGATGGC
GGAGAGGTAG CTAGTCAACT TGCAGTACAA TCAATAAAAA AATTAGTTCA AAATTTATTA
ATAGAGGTAG AGCAACAAGA GGAGTTAACA TCCCCAAACT TAGTCAATAA ACAACTAGAG
GAAATTATTC GGGTAGTCAA TAATATGATT GCTACGGAGA ATGATGAACA AGGAAGAGAG
TCTCGACAGC GCATGGGTAC AACACTAGCC ATGGCATTAC AGTTACCCCA ACAGGTGAAA
CCTTCCCCAG AATCTAGTCC AAATAATGCC CATGAGTTAT ATATAGTTCA TGTGGGAGAT
AGTCGTGCTT ATTGGTTAAG TAGAAATACT TGTCAGTTGT TGACTGTGGA TGATAATGTT
GCTTCTAGAG AAGTTGGTTT GGGTCATTGT TTGTATTGGC AGGGTTTAAA ACGTCGTGAT
GGTGGTGCCT TGACTCAGGC TTTGGGTACA AGAGATGGGG AGTTTATTTA TCCTACTATT
CAAAGATTTT TGATCGAACA AGAGGGGATA TTACTTTTAT GTTCCGATGG CCTGAGTGAT
AATAATTGGG TTGAGAATTC TTGGGCTGAA TATGCTTCTA AAGTTTTCCA GGGTGAAATT
TCTCTAAAAC AGGGGGTAGA GTCTTTAATT GAGCTGGCGA ATCATAAAAA TGGTCATGAT
AATACTTCGG TGGTGCTTGT CCACTATCGT CTTTCTCCAG AGAAGTTAAT CTTATCTAAT
ATTCCTACAG TAACAACTCC AGAAAAGGAA ATTTTAATGA CAGAATTGTC TGAGGCTTCT
AAAGAGCTTC TTTACAATAA TGATGAAGAA GTATCAGAAG CTGAGTCTGA ATTTCAGTTG
GAACCTGTCC AAAATTTTTT CTGGTTAAAA CCCTGGATGT GTATGATGGG TGTCTTATTT
ATCTTGTTGG TGGGGGCAAT TATATCGGTA TGGCGCAATT GGGAAAATTT TCAGTCAACT
CCTAGGGATA GTCCTGCACC TTGGGAGAGT CCTATGAGAC CTGAGCCTTC AGAAACTAGA
ATTTCTCCTA GTTCTTAA
 
Protein sequence
MINSAPIIHC PNINCSSVSH SWDQEECEAC QTKLIYRYLW AVAPKVEISV GELLGDRYYV 
VAPQVWLDTK PGEPPFMHEE FPDYIMPYLR LYPERLHTPD VYGFYQYGEE TYPTDILLLD
NVPLDSKGNL LPSLVEAWPT ASPVRQVYWL WQMMQLWKPL SEERVAFSLL TNNLRVEDWR
LRLLELNLET RKTKLRDFKI FWSNLLPTAH PNVQQPLTEI CHMISDIGES LEAIAPKLNQ
LLLEQAAKLP LKSDIMGATD TGPVRMHNED CCYPTEEDLN SNQLVPNLAI ICDGVGGHDG
GEVASQLAVQ SIKKLVQNLL IEVEQQEELT SPNLVNKQLE EIIRVVNNMI ATENDEQGRE
SRQRMGTTLA MALQLPQQVK PSPESSPNNA HELYIVHVGD SRAYWLSRNT CQLLTVDDNV
ASREVGLGHC LYWQGLKRRD GGALTQALGT RDGEFIYPTI QRFLIEQEGI LLLCSDGLSD
NNWVENSWAE YASKVFQGEI SLKQGVESLI ELANHKNGHD NTSVVLVHYR LSPEKLILSN
IPTVTTPEKE ILMTELSEAS KELLYNNDEE VSEAESEFQL EPVQNFFWLK PWMCMMGVLF
ILLVGAIISV WRNWENFQST PRDSPAPWES PMRPEPSETR ISPSS