Gene Tery_3845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3845 
Symbol 
ID4242296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5940562 
End bp5942922 
Gene Length2361 bp 
Protein Length786 aa 
Translation table11 
GC content41% 
IMG OID638108777 
Producthypothetical protein 
Protein accessionYP_723360 
Protein GI113477299 
COG category[R] General function prediction only 
COG ID[COG3211] Predicted phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0999872 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0977424 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGGC TAACTCGCAG AAAACTGTTA ATGTTCTTCG GATGTAGCGC AGCTGCTACA 
GCACTATCAC CAAAAATTGA AAATTTTTTG GGTAGTAACT CTGAAGTGGC TCTTGCTCAA
ACTCAAGGTT TGAGTTTCAC ACCCCTGAAA CTGGCGCATC CTTTAGAAGC TTATGAAAAA
CATTCTAGCT TTGTACCTTT AGGAACTGGT GGAGAAGGAG CAACTTTAGG AGCAGGTGTA
GATGTAGCAC TACAATCATA TCAGTACTTT GATGATGTAA TAGTACCTCC CGAATATGAA
AGGTATGTAA TTGTTAGTTG GGGCGATCGC GTATTCCCTG ACTCTGAAGA GTACTTTGGT
TATAATGCTG ACTATGTAAG TTTTATTCCT GTTAATGGTA ATCCTGATGA TGGTTACTTA
TGGACTAACC ATGAGTATGT CTCATATCCA ATGTCACCAT TATTAGCACG TAGTGATGAC
TTAGAAGGTT TCCCCACAAC AGACAAACTT GTACTAGGTT TAGATTTATC TCAGTCTAGT
ATCTCCACCT TAGGTGAATT TGGTTATAAC CAAGGTGGTT CTATTGTCAG AATTAAAAAA
GGTAGTAACG GTCAATATGC TACTGTAGCA GACAGTGCTA ATCGTCGAAT ACACCTCTTA
TCTGGACTAG GAATTAACTC CGAACGTTCT GATAACTATC AAAGAGTTAC ATCTTGGGGT
ACAGCTAGCT ATCAGACTGG AGACAAAAAC TTCTTAATTG GTACTGGACC TGCTGCTGTT
GAAGTATTCC CTCTAAGTTC TGATGGACTG GGTAATAAAA TCATTGGTAC AGCATTCAAC
TGTTCTGGTG GTACAACTCC CTGGGGAACA GTTCTAACTG CAGAAGAGAA CTTCCAAGGT
AGCGTTACCG AAGCTGTATC ACCTAATGGT ACTCAAACTG GTTATAAGGA AGAGGGTATA
GGTTTTACTT TTGGTTTAGT TGGTGAAAAG TATGGCTGGA TGGTAGAAGT TTCTCCAGCA
GACCCAAGTT TCCAAAATAA GAAACATACA GCTTTAGGTC GTTTTCGCCA TGAAAATATT
GCGTTCCGAG TAGAAGCAGG TAAACCGTTA GCAGCTTATA TGGGAGATGA CCGTCGTGGT
GGTCACACAT GGAAGTTTGT GAGTGATGGT ATTGTTTCTA ATCCTACCGA TCCAAGTAAC
AGCAGATTAT TTAATAGCGG AACTCTCTAT GCTGCACGCT TAAATCCTGA TGGTTCTGGT
CAATGGATTC CATTAATTCC TGCCACACGT ACTAACCCTC TATCACCAAG AGAACTTGCT
GAAGCTGAAT TAAATGTTTT TGGTAAAGCT CAAAGAGATG GTCGCATCCG CTTACCTCAA
CGTCTTGGTA TTGCTGGAGG AGAAGAAAAT GGTGGGTATT TCATTGTAGA CTTAACAAAT
GAAAGTGCAT TATCTGATTA TCAAGGCAAA ACTCTAGCAG ATTTCTACGA CAGCCAAGGT
GCGATTTTAG TTGATGCTTT CTTAGCTGCT AACTTAGTAG GTGCCACTCC TACTGCTCGT
CCTGAAGATT TAGAAGTTCA CCCTGGTGAT GGTAGTGTAT TTATTGCTTA TACCGATAAT
GGACCTGGTG GAGATGGATA TCCAGATTCC AGAGTCTTTG TTGTGAGTAA ATACTCTGCA
GATGTTAATG CTGCTCAACC TTTTGGTGGT ATCTACCGAA TCATTGAAAC AAACAGTGAT
GTTACCAGTA CCACTTTTAC CTGGTCAGCG TTTGAGCAGA GTGGAGAAAA TGGTGCTGTT
AATGGTCCAG GTTTTGCCAA TGTAGACAAC CTAGAAATTG ATACTTTGGG TAATATTTGG
GGCGTAACAG ATATGTCTAC TAGTAGTCAT AATGGTTTCA ATACTGGCGC TGCTGGAGAG
ATAAAAGAGA TTGACCACAC TCAAACAGGA AGTGTTGGTA ACTTGAGAGG AACATTTGGT
AACAACTGGT TATTCTATAT TCCTGTCGTC GGAGAAAACG CTGGAATGGT TATACCTTTT
GCTTATGGTC CCCCTCGTTG TGAAATCACT GGACCATACT TTATCAAAAA TCGTAGTGGT
GTAAATGAAA CTCTGCTATT AGCTGTACAA CACCCTGGTG AAAGTTGTCC TATTGGAGAT
GAAGTTAAAC TAGGTCGTAA TATCGAGATG TTAAACTTAG ATGGTAGCCT TTTTACTCAG
CAACGAAGCG TACCTCGTGG AAGTAATTGG CCTAGTAACA CAGGGTATGT AGGTAATCCT
GGAGGCTTTT TTAATGGTTT ACTGCCACCA AGACCTTCTG TCATTGGTGT TACTCGTAGA
GACGGTGGTA AGTTTGTTTA A
 
Protein sequence
MSRLTRRKLL MFFGCSAAAT ALSPKIENFL GSNSEVALAQ TQGLSFTPLK LAHPLEAYEK 
HSSFVPLGTG GEGATLGAGV DVALQSYQYF DDVIVPPEYE RYVIVSWGDR VFPDSEEYFG
YNADYVSFIP VNGNPDDGYL WTNHEYVSYP MSPLLARSDD LEGFPTTDKL VLGLDLSQSS
ISTLGEFGYN QGGSIVRIKK GSNGQYATVA DSANRRIHLL SGLGINSERS DNYQRVTSWG
TASYQTGDKN FLIGTGPAAV EVFPLSSDGL GNKIIGTAFN CSGGTTPWGT VLTAEENFQG
SVTEAVSPNG TQTGYKEEGI GFTFGLVGEK YGWMVEVSPA DPSFQNKKHT ALGRFRHENI
AFRVEAGKPL AAYMGDDRRG GHTWKFVSDG IVSNPTDPSN SRLFNSGTLY AARLNPDGSG
QWIPLIPATR TNPLSPRELA EAELNVFGKA QRDGRIRLPQ RLGIAGGEEN GGYFIVDLTN
ESALSDYQGK TLADFYDSQG AILVDAFLAA NLVGATPTAR PEDLEVHPGD GSVFIAYTDN
GPGGDGYPDS RVFVVSKYSA DVNAAQPFGG IYRIIETNSD VTSTTFTWSA FEQSGENGAV
NGPGFANVDN LEIDTLGNIW GVTDMSTSSH NGFNTGAAGE IKEIDHTQTG SVGNLRGTFG
NNWLFYIPVV GENAGMVIPF AYGPPRCEIT GPYFIKNRSG VNETLLLAVQ HPGESCPIGD
EVKLGRNIEM LNLDGSLFTQ QRSVPRGSNW PSNTGYVGNP GGFFNGLLPP RPSVIGVTRR
DGGKFV